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FULL-LENGTH HUMAN CPNAS ENCODING POTENTIALLY SECRETED 



PROTEINS 



CROSS REFERENCE TO RELATED APPLICATIONS 

5 The present application is related to and claims priority from US provisional Application 

Nos. 60/197,873, filed April 18, 2000, 60/224,009, filed August 7, 2000, 60/260,328, filed January 
8, 2001, and 60/224,006, filed August 4, 2000, the entire disclosures of each of which is herein 
incorporated by reference. 

10 FIELD OF THE INVENTION 

The present invention is directed to GENSET polypeptides, fragments thereof, and the 
regulatory regions located in the 5'- and 3'-ends of the genes encoding the polypeptides. The 
invention also concerns polypeptides encoded by GENSET genes and fragments thereof. The 
present invention also relates to recombinant vectors including the polynucleotides of the present 

15 invention, particularly recombinant vectors comprising a GENSET gene regulatory region or a 
sequence encoding a GENSET polypeptide, and to host cells containing the polynucleotides of the 
invention, as well as to methods of making such vectors and host cells. The present invention 
further relates to the use of these recombinant vectors and host cells in the production of the 
polypeptides of the invention. The invention further relates to antibodies that specifically bind to 

20 the polypeptides of the invention and to methods for producing such antibodies and fragments 
thereof The invention also provides methods of detecting the presence of the polynucleotides and 
polypeptides of the present invention in a sample, methods of diagnosis and screening of abnormal 
GENSET polypeptide expression and/or biological activity, methods of screening compounds for 
their ability to modulate the activity or expression of the GENSET polypeptides, and uses of such 

25 compounds. 

BACKGROUND OF THE INVENTION 

The estimated 30,000-12,000 genes scattered along the human chromosomes offer 
tremendous promise for the understanding, diagnosis, and treatment of human diseases. In addition, 
probes capable of specifically hybridizing to loci distributed throughout the human genome find 
30 application in the construction of high resolution chromosome maps and in the identification of 
individuals. 

In the past, the characterization of even a single human gene was a painstaking process, 
requiring years of effort. Recent developments in the areas of cloning vectors, DN A sequencing, 
and computer technology have merged to greatly accelerate the rate at which human genes can be 
35 isolated, sequenced, mapped, and characterized. 
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Currently, two different approaches are being pursued for identifying and characterizing the 
genes distributed along the human genome. In one approach, large fragments of genomic DNA are 
isolated, cloned, and sequenced. Potential open reading frames in these genomic sequences are 
identified using bio-informatics software. However, this approach entails sequencing large 
S stretches of human DNA which do not encode proteins in order to fmd the protein encoding 

sequences scattered throughout the genome. In addition to requiring extensive sequencing, the bio- 
informatics software may mischaracterize the genomic sequences obtained, i.e., labeling non-coding 
DNA as coding DNA and vice versa. 

An alternative approach takes a more direct route to identifying and characterizing human 

10 genes. In this approach, complementary DNAs (cDNAs) are synthesized from isolated messenger 
RNAs (mRNAs) which encode human proteins. Using this approach, sequencing is only performed 
on DNA which is derived from protein coding fragments of the genome. Often, only short stretches 
of the cDNAs are sequenced to obtain sequences called expressed sequence tags (ESTs). The ESTs 
may then be used to isolate or purify cDNAs which include sequences adjacent to the EST 

15 sequences. The cDNAs may contain all of the sequence of the EST which was used to obtain them 
or only a fragment of the sequence of the EST which was used to obtain them. In addition, the 
cDNAs may contain the full coding sequence of the gene from which the EST was derived or, 
alternatively, the cDNAs may include fragments of the coding sequence of the gene from which the 
EST was derived. It will be appreciated that there may be several cDNAs which include the EST 

20 sequence as a result of alternate splicing or the activity of alternative promoters. 

In the past, these short EST sequences were often obtained from oligo-dT primed cDN A 
libraries. Accordingly, they mainly corresponded to the 3* untranslated region of the mRNA. In 
part, the prevalence of EST sequences derived from the 3' end of the mRNA is a result of the fact 
that typical techniques for obtaining cDNAs are not well suited for isolating cDNA sequences 

25 derived from the 5' ends of mRNAs {Adams et al. Nature 377:3-174, 1996, Hillier et al. Genome 
Res. 6:807-828, 1996). in addition, in those reported instances where longer cDNA sequences have 
been obtained, the reported sequences typically correspond to coding sequences and do not include 
the full 5' untranslated region (5'UTR) of the mRNA from which the cDNA is derived. Indeed, 
S'UTRs have been shown to affect either the stability or translation of mRNAs. Thus, regulation of 

30 gene expression may be achieved through the use of alternative S'UTRs as shown, for instance, for 
the translation of the tissue inhibitor of metalloprotease mRNA in mitogenically activated cells 
(Waterhouse et a/., J Biol Chem. 265:5585-9. 1990). Furthermore, modification of 5*UTR through 
mutation, insertion or translocation events may even be implied in pathogenesis. For instance, the 
Fragile X syndrome, the most common cause of inherited mental retardation, is partly due to an 

35 insertion of multiple CGG trinucleotides in the 5*UTR of the Fragile X mRNA resulting in the 
inhibition of protein synthesis via ribosome stalling (Feng et al, Science 268:73 1-4, 1 995), An 
aberrant mutation in regions of the 5'UTR known to inhibit translation of the proto-oncogene c-myc 
was shown to result in upregulation of c-myc protein levels in cells derived from patients with 
multiple myelomas (Willis et aL Curr Top Microbiol Immunol 224:269-76, 1997). In addition, the 
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use of oligo-dT primed cDNA libraries does not allow the isolation of complete 5'UTRs since such 
incomplete sequences obtained by this process may not include the first exon of the mRNA, 
particularly in situations where the first exon is short. Furthermore, they may not include some 
exons, often short ones, which are located upstream of splicing sites. Thus, there is a need to obtain 

5 sequences derived firom the 5' ends of mRNAs. 

Moreover, despite the great amount of EST data that large-scale sequencing projects have 
yielded (Adams e/a/.,^flr/wre 377:174, 1996; H\\\m et aL, Genome Res, 6:807-828, 1996), 
information concerning the biological function of the mRNAs corresponding to such obtained 
cDNAs has revealed to be limited. Indeed, whereas the knowledge of the complete coding 

10 sequence is absolutely necessary to investigate the biological function of mRNAs, ESTs yield only 
partial coding sequences. So far, large-scale full-length cDNA cloning has been achieved only with 
limited success because of the poor efficiency of methods for constructing full-length cDNA 
libraries. Indeed, such methods require either a large amount of mRNA (Ederly e( al., 1995), thus 
resulting in non representative full-length libraries when small amounts of tissue are available or 

15 require PCR amplification (Maruyama et al, 1994; CLONTECHniques, 1996) to obtain a 
reasonable number of clones, thus yielding strongly biased cDNA libraries where rare and long 
cDNAs are lost. Thus, there is a need to obtain full-length cDNAs, Le, cDNAs containing the full 
coding sequence of their corresponding mRNAs. 

While many sequences derived from human chromosomes have practical applications, 

20 approaches based on the identification and characterization of those chromosomal sequences which 
encode a protein product are particularly relevant to diagnostic and therapeutic uses. Of the 30,000- 
120,000 protein coding genes, those genes encoding proteins which are secreted from the cell in 
which they are synthesized, as well as the secreted proteins themselves, are particularly valuable as 
potential therapeutic agents. Such proteins are often involved in cell to cell communication and 

25 may be responsible for producing a clinically relevant response in their target cells. In fact, several 
secretory proteins, including tissue plasminogen activator, G-CSF, GM-CSF, erythropoietin, human 
growth hormone, insulin, interferon-a, interferon-p, interferon-y, and interleukin-2, are currently in 
clinical use. These proteins are used to treat a wide range of conditions, including acute myocardial 
infarction, acute ischemic stroke, anemia, diabetes, growth hormone deficiency, hepatitis, kidney 

30 carcinoma, chemotherapy induced neutropenia and multiple sclerosis. For these reasons, cDNAs 
encoding secreted proteins or fragments thereof represent a particularly valuable source of 
therapeutic agents. Thus, there is a need for the identification and characterization of secreted 
proteins and the nucleic acids encoding them. 

In addition to being therapeutically useful themselves, secretory proteins include short 

35 peptides, called signal peptides, at their amino termini which direct their secretion. These signal 
peptides are encoded by the signal sequences located at the 5' ends of the coding sequences of genes 
encoding secreted proteins. Because these signal ]3eptides will direct the extracellular secretion of 
any protein to which they are operably linked, the signal sequences may be exploited to direct the 
efficient secretion of any protein by operably linking the signal sequences to a gene encoding the 
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protein for which secretion is desired. In addition, fragments of the signal peptides called 
membrane-translocating sequences may also be used to direct the intracellular import of a peptide 
or protein of interest. This may prove beneficial in gene therapy strategies in which it is desired to 
deliver a particular gene product to cells other than the cells in which it is produced. Signal 
5 sequences encoding signal peptides also find application in simplifying protein purification 

techniques. In such applications, the extracellular secretion of the desired protein greatly facilitates 
purification by reducing the number of undesired proteins from which the desired protein must be 
selected. Thus, there exists a need to identify and characterize the 5' fragments of the genes for 
secretory proteins which encode signal peptides. 

1 0 Sequences coding for secreted proteins may also find application as therapeutics or 

diagnostics. In particular, such sequences may be used to determine whether an individual is likely 
to express a detectable phenotype, such as a disease, as a consequence of a mutation in the coding 
sequence for a secreted protein. In instances where the individual is at risk of suffering from a 
disease or other undesirable phenotype as a result of a mutation in such a coding sequence, the 

1 5 undesirable phenotype may be corrected by introducing a normal coding sequence using gene 
therapy. Alternatively, if the undesirable phenotype results from overexpression of the protein 
encoded by the coding sequence, expression of the protein may be reduced using antisense or triple 
helix based strategies. 

The secreted human polypeptides encoded by the coding sequences may also be used as 

20 therapeutics by administering them directly to an individual having a condition, such as a disease, 
resulting from a mutation in the sequence encoding the polypeptide. In such an instance, the 
condition can be cured or ameliorated by administering the polypeptide to the individual. 

In addition, the secreted human polypeptides or fragments thereof may be used to generate 
antibodies useful in determining the tissue type or species of origin of a biological sample. The 

25 antibodies may also be used to determine the cellular localization of the secreted human 
polypeptides or the cellular localization of polypeptides which have been fused to the human 
polypeptides. In addition, the antibodies may also be used in immunoaffinity chromatography 
techniques to isolate, purify, or enrich the human polypeptide or a target polypeptide which has 
been fused to the human polypeptide. 

30 Public information on the number of human genes for which the promoters and upstream 

regulatory regions have been identified and characterized is quite limited. In part, this may be due 
to the difficulty of isolating such regulatory sequences. Upstream regulatory sequences such as 
transcription factor binding sites are typically too short to be utilized as probes for isolating 
promoters from human genomic libraries. Recently, some approaches have been developed to 

35 isolate human promoters. One of them consists of making a CpG island library (Cross et al, Nature 
Genetics 6: 236-244, 1994). The second consists of isolating human genomic DNA sequences 
containing Spel binding sites by the use of Spel binding protein (Mortlock et al.. Genome Res. 
6:327-335, 1996). Both of these approaches have their limits due to a lack of specificity and of 
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comprehensiveness. Thus, there exists a need to identiiy and systematically characterize the 5* 
fragments of the genes. 

cDNAs including the 5' ends of their corresponding mRNA may be used to efficiently 
identify and isolate 5'UTRs and upstream regulatory regions which control the location, 
5 developmental stage, rate, and quantity of protein synthesis, as well as the stability of the mRNA 
(Theil etaL, BioFactors 4:87-93, (1993). Once identified and characterized, these regulatory 
regions may be utilized in gene therapy or protein purification schemes to obtain the desired amount 
and locations of protein synthesis or to inhibit, reduce, or prevent the synthesis of undesirable gene 
products. 

10 In addition, cDNAs containing the 5' ends of secretory protein genes may include 

sequences useful as probes for chromosome mapping and the identification of individuals. Thus, 
there is a need to identiiy and characterize the sequences upstream of the 5' coding sequences of 
genes encoding secretory proteins. 

SUMMARY OF THE INVENTION 

1 5 The present invention provides a purified or isolated polynucleotide comprising, consisting 

of, or consisting essentially of a nucleotide sequence selected from the group consisting of: (a) the 
sequences of SEQ ID NOs:l-169, 339-455, 561-784; (b) the sequences of clone inserts of the 
deposited clone pool; (c) the coding sequences of SEQ IDNOs:l-169, 339-455, 561-784; (d) the 
coding sequences of the clone inserts of the deposited clone pool; (e) the sequences encoding one of 

20 the polypeptides of SEQ ID NOs: 1 70-338, 456-560, 785-9 1 8; (f) the sequences encoding one of the 
polypeptides encoded by the clone inserts of the deposited clone pool; (g) the genomic sequences 
coding for the GENSET polypeptides; (h) the 5' transcriptional regulatory regions of GENSET 
genes; (i) the 3' transcriptional regulatory regions of GENSET genes; 0) the polynucleotides 
comprising the nucleotide sequence of any combination of (gHi); (k) the variant polynucleotides of 

25 any of the polynucleotides of (a)-(j); (I) the polynucleotides comprising a nucleotide sequence of 
(a)-(k), wherein the polynucleotide is single stranded, double stranded, or a portion is single 
stranded and a portion is double stranded; (m) the polynucleotides comprising a nucleotide 
sequence complementary to any of the single stranded polynucleotides of (I). The invention further 
provides for fragments of the nucleic acid molecules of (a)-(m) described above. 

3d Further embodiments of the invention include purified or isolated polynucleotides that 

comprise, consist of, or consist essentially of a nucleotide sequence at least 70% identical, more 
preferably at least 75%, and eyen more preferably at least 80%, 85%, 90%, 95%, 96%, 97%, 98%, 
or 99% identical, to any of the nucleotide sequences in (a)-(m) above, e.g. over a region of at least 
about 25, 50, 100, 1 50, 250, 500, 1000, or more contiguous nucleotides, or a polynucleotide which 

35 hybridizes under stringent hybridization conditions to a polynucleotide in (a)-(m) above. 

The present invention also relates to recombinant vectors, which include the purified or 
isolated polynucleotides of the present invention, and to host cells recombinant for the 
polynucleotides of the present invention, as well as to methods of making such vectors and host 
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cells. The present invention further relates to the use of these recombinant vectors and reconribinant 
host cells in the production of GENSET polypeptides. 

The invention further provides a purified or isolated polypeptide comprising, consisting of, 
or consisting essentially of an amino acid sequence selected from the group consisting of: (a) the 
5 polypeptides of SEQ ID NOs; 1 70-338, 456-560, 785-9 1 8; (b) the polypeptides encoded by the 
clone inserts of the deposited clone pool; (c) the epitope-bearing fragments of the polypeptides of 
SEQ ID NOs: 170-338, 456-560, 785-918; (d) the epitope-bearing fragments of the polypeptides 
encoded by the clone inserts contained in the deposited clone pool; (e) the domains of the 
polypeptides of SEQ ID NOs: 170-338, 456-560, 785-918; (0 the domains of the polypeptides 

1 0 encoded by the clone inserts contained in the deposited clone pool; and (g) the allelic variant 
polypeptides of any of the polypeptides of (a)-(f). The invention further provides for fragments of 
the polypeptides of (a)-(g) above, such as those having biological activity or comprising 
biologically functional domain(s). 

The present invention further includes polypeptides with an amino acid sequence with at 

1 5 least 70% similarity, and more preferably at least 75%, 80%, 85%, 90%, 95%, 96%, 97%, 98%, or 
99% similarity to those polypeptides described in (a)-(g), as well as polypeptides having an amino 
acid sequence at least 70% identical, more preferably at least 75% identical, and still more 
preferably 80%, 85%, 90%, 95%, 96%, 97%, 98%, or 99% identical to those polypeptides described 
in (a)-(g), e.g. over a region of at least about 25, 50, 100, 1 50, 250, 500, 1000, or more amino acids. 

20 The invention further relates to methods of making the polypeptides of the present invention. 
The present invention further relates to transgenic plants or animals, wherein said 
transgenic plant or animal is transgenic for a polynucleotide of the present invention and expresses a 
polypeptide of the present invention. 

The invention farther relates to antibodies that specifically bind to GENSET polypeptides 

25 of the present invention and fragments thereof as well as to methods for producing such antibodies 
and fragments thereof. 

The invention also provides kits, uses and methods for detecting GENSET gene expression 
and/or biological activity in a biological sample. One such method involves assaying for the 
expression of a GENSET polynucleotide in a biological sample using the polymerase chain reaction 

30 (PGR) to amplify and detect GENSET polynucleotides or Southern and Northern blot hybridization 
to detect GENSET genomic DNA, cDNA or mRNA. Alternatively, a method of detecting 
GENSET gene expression in a test sample can be accomplished using a compound which binds to a 
GENSET polypeptide of the present invention or a portion of a GENSET polypeptide. 
The present invention also relates to diagnostic methods and uses of GENSET 

35 polynucleotides and polypeptides for identifying individuals or non-human animals having elevated 
or reduced levels of GENSET gene products, which individuals are likely to benefit from therapies 
to suppress or enhance GENSET gene expression, respectively, and to methods of identifying 
individuals or non-human animals at increased risk for developing, or at present having, certain 
diseases/disorders associated with GENSET polypeptide expression or biological activity. 
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The present invention also relates to kits, uses and methods of screening compounds for 
their ability to modulate (e,g, increase or inhibit) the activity or expression of GENSET 
polypeptides including compounds that interact with GENSET gene regulatory sequences and 
compounds that interact directly or indirectly with a GENSET polypeptide. Uses of such 
S compounds are also within the scope of the present invention. 

The present invention also relates to pharmaceutical or physiologically acceptable 
compositions comprising, an active agent, the polypeptides, polynucleotides or antibodies of the 
present invention, as well as, typically, a pharmaceutically acceptable carrier. 

The present invention also relates to computer systems containing cDNA codes and 

1 0 polypeptide codes of sequences of the invention and to computer-related methods of comparing 
sequences, identifying homology or features using GENSET polypeptides or GENSET 
polynucleotide sequences of the invention. 

In another aspect, the present invention provides an isolated polynucleotide, the 
polynucleotide comprising a nucleic acid sequence encoding: i) a polypeptide comprising an amino 

1 5 acid sequence having at least about 80% identity to any one of the sequences shown as SEQ ID 
NOs: 170-338, 456-560, 785-918 or any one of the sequences of polypeptides encoded by the clone 
inserts of the deposited clone pool; or a biologically active fragment of the polypeptide. 

In one embodiment, the polypeptide comprises any one of the sequences shown as SEQ ID 
NOs: 170-338, 456-560, 785-918 or any one of the sequences of the polypeptides encoded by the 

20 clone inserts of the deposited clone pool. In another embodiment, the polypeptide comprises a 
signal peptide. In another embodiment, the polypeptide is a mature protein. In another 
embodiment, the nucleic acid sequence has at least about 80% identity over at least about 100 
contiguous nucleotides to any one of the sequences shown as SEQ ID NOs: 1-169, 339-455, 561- 
784 or any one of the sequences of the clone inserts of the deposited clone pool. In another 

25 embodiment, the polynucleotide hybridizes under stringent conditions to a polynucleotide 

comprising any one of the sequences shown as SEQ ID NOs: 1-169, 339-455, 561-784 or any one of 
the sequences of the clone inserts of the deposited clone pool. In another embodiment, the nucleic 
acid sequence comprises any one of the sequences shown as SEQ ID NOs:l-169, 339-455, 561-784 
or any one the sequences of the clone inserts of the deposited clone pool. In another embodiment, 

30 the polynucleotide is operably linked to a promoter. 

In another aspect, the present invention provides an expression vector comprising any of the 
herein-described polynucleotides, operably linked to a promoter. In another aspect, the present 
invention provides a host cell recombinant for any of the herein-described polynucleotides. In 
another aspect, the present invention provides a non-human transgenic animal comprising the host 

35 cell. 

In another aspect, the present invention provides a method of making a GENSET 
polypeptide, the method comprising a) providing a population of host cells comprising a herein- 
described polynucleotide and b) culturing the population of host cells under conditions conducive to 
the production of the polypeptide within said host cells. 
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In one embodiment, the method further comprises purifying the polypeptide from the 
population of host cells. 

In another aspect, the present invention provides a method of making a GENSET 
polypeptide, the method comprising a) providing a population of cells comprising a polynucleotide 
5 encoding a herein-described polypeptide; b) culturing the population of cells under conditions 
conducive to the production of the polypeptide within the cells; and c) purifying the polypeptide 
from the population of cells. 

In another aspect, the present invention provides an isolated polynucleotide, the 
polynucleotide comprising a nucleic acid sequence having at least about 80% identity over at least 
10 about 100 contiguous nucleotides to any one of the sequences shown as SEQ ID NOs: 1-169, 339- 
455, 561-784 or any one of the sequences of the clone inserts of the deposited clone pool. 

In one embodiment, the polynucleotide hybridizes under stringent conditions to a 
polynucleotide comprising any one of the sequences shown as SEQ ID NOs: 1-169, 339-455, 561- 
784 or any one of the sequences of the clone inserts of the deposited clone pool. In another 
1 5 embodiment, the polynucleotide comprises any one of the sequences shown as SEQ ID NOs: 1 - 1 69, 
339-455, 561-784 or any one of the sequences of the clone inserts of the deposited clone pool. 

In another aspect, the present invention provides a biologically active polypeptide encoded 
by any of the herein-described polynucleotides. 

In another aspect, the present invention provides an isolated polypeptide or biologically 
20 active fragment thereof, the polypeptide comprising an amino acid sequence having at least about 
80% sequence identity to any one of the sequences shown as SEQ ID NOs: 1 70-338, 456-560, 785- 
91 8 or any one of the sequences of polypeptides encoded by the clone inserts of the deposited clone 
pool. 

In one embodiment, the polypeptide is selectively recognized by an antibody raised against 
25 an antigenic polypeptide, or an antigenic fragment thereof, the antigenic polypeptide comprising 
any one of the sequences shown as SEQ ID NOs: 1 70-338, 456-560, 785-91 8 or any one of the 
sequences of polypeptides encoded by the clone inserts of the deposited clone pool. In another 
embodiment, the polypeptide comprises any one of the sequences shown as SEQ ID NOs: 170-338, 
456-560, 785-91 8 or any one of the sequences of polypeptides encoded by the clone inserts of the 
30 deposited clone pool. In another embodiment, the polypeptide comprises a signal peptide. In 
another embodiment, the polypeptide is a mature protein. 

In another aspect, the present invention provides an antibody that specifically binds to any 
of ther herein-described polypeptides. 

In another aspect, the present invention provides a method of determining whether a 
35 GENSET gene is expressed within a mammal, the method comprising the steps of: a) providing a 
biological sample from said mammal; b) contacting said biological sample with either of: i) a 
polynucleotide that hybridizes under stringent conditions to any of the herein-described 
polynucleotides; or ii) a polypeptide that specifically binds to any of the herein-described 
polypeptides; and c) detecting the presence or absence of hybridization between the polynucleotide 
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and an RNA species within the sample, or the presence or absence of binding of the polypeptide to a 
protein within the sample; wherein a detection of the hybridization or of the binding indicates that 
the GENSET gene is expressed within the mammal. 

In one embodiment, the polynucleotide is a primer, and the hybridization is detected by 
5 detecting the presence of an amplification product comprising the sequence of the primer. In 
another embodiment, the polypeptide is an antibody. 

In another aspect, the present invention provides a method of determining whether a 
mammal has an elevated or reduced level of GENSET gene expression, the method comprising the 
steps of: a) providing a biological sample from the mammal; and b) comparing the amount of any of 
10 the herein-described polypeptides, or of an RNA species encoding the polypeptide, within the 
biological sample with a level detected in or expected from a control sample; wherein an increased 
amount of the polypeptide or the RNA species within the biological sample compared to the level 
detected in or expected from the control sample indicates that the mammal has an elevated level of 
the GENSET gene expression, and wherein a decreased amount of the polypeptide or the RNA 
1 5 species within the biological sample compared to the level detected in or expected from the control 
sample indicates that the mammal has a reduced level of the GENSET gene expression. 

In another aspect, the present invention provides a method of identifying a candidate 
modulator of a GENSET polypeptide, the method comprising: a) contacting any of the herein- 
described polypeptides with a test compound; and b) determining whether the compound 
20 specifically binds to the polypeptide; wherein a detection that the compound specifically binds to 
the polypeptide indicates that the compound is a candidate modulator of the GENSET polypeptide. 

In one embodiment, the method further comprises testing the biological activity of the 
GENSET polypeptide in the presence of the candidate modulator, wherein an alteration in the 
biological activity of the GENSET polypeptide in the presence of the compound in comparison to 
25 the activity in the absence of said compound indicates that the compound is a modulator of the 
GENSET polypeptide. 

In another aspect, the present invention provides a method for the production of a 
pharmaceutical composition, the method comprising a) identifying a modulator of a GENSET 
polypeptide using any of the herein-described methods; and b) combining the modulator with a 
30 pharmaceutically acceptable carrier. 

BRIEF DESCRIPTION OF DRAWINGS 
Figure 1 is a block diagram of an exemplary computer system. 

Figure 2 is a flow diagram illustrating one embodiment of a process 200 for comparing a 
35 new nucleotide or protein sequence with a database of sequences in order to determine the identity 
levels between the new sequence and the sequences in the database. 

Figure 3 is a flow diagram illustrating one embodiment of a process 250 in a computer for 
determining whether two sequences are homologous. 
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Figure 4 is a flow diagram illustrating one embodiment of an identifier process 300 for 
detecting the presence of a feature in a sequence. 



BRIEF DESCRIPTION OF THE TABLES 

5 Table I provides the SEQ ID Nos in the present application (with the SEQ ID Nos 

corresponding to nucleic acid sequences preceded by "NUC", and the SEQ ID Nos corresponding to 
the encoded polypeptide sequences preceded by "PRT") that correspond to a SEQ ID NO in priority 
application number 60/197,873. Applicants' internal designation number (Clone ID) corresponding 
to each sequence identification (SEQ ID) number is also provided. 

10 Table II lists the putative chromosomal location of the polynucleotides of the present 

invention. The SEQ ID NO listed for each polynucleotide is that from the priority application 
60/197,873; the corresponding SEQ IDNOs for the sequence in the present application can be 
determined by referring to Table I. 

Table III lists the number of hits in Genset's cDNA libraries of tissues and cell types for 

1 5 polynucleotides of the invention. The following abbreviations are used to refer to each cell or tissue 
type: A=Brain; B=Fetal brain; C=Fetal kidney; D=Fetal liver; E=Pituitary gland; F=Liver; 
G=Placenta; H=Prostate; I=Salivary gland; J=Stomach/Intestine; and K=Testis. The SEQ ID NO 
listed for each polynucleotide is that from the priority application 60/197,873; the corresponding 
SEQ ID NOs for the sequence in the present application can be determined by referring to Table I. 

20 Table IV lists the number of hits in publicly available library of tissues and cell types for 

polynucleotides of the invention. The SEQ ID NO listed for each polynucleotide is that from the 
priority application 60/1 97,873; the corresponding SEQ ID NOs for each sequence in the present 
application can be determined by referring to Table I. 

Table V lists the tissues and cell types in which the polynucleotide sequences of the present 

25 invention are over- or under-represented. The SEQ ID NO listed for each polynucleotide is that 
from the priority application 60/197,873; the corresponding SEQ IDNOs for each sequence in the 
present application can be determined by referring to Table I. 

BRIEF DESCRIPTION OF THE SEQUENCE LISTING 

30 SEQ ID NOs: 1 - 1 69, 339-455, 561-784 are the nucleotide sequences of cDNAs, with open 

reading frames as indicated as features (CDS). When appropriate, the locations of the potential 
polyadenylation site and polyadenylation signal are also indicated. 

SEQ ID NOs: 170-338, 456-560, 785-918 are the amino acid sequences of proteins encoded 
by the cDNAs of SEQ ID NOs: 1-1 69, 339-455, 561-784. 

35 SEQ lDNOs:l-85, 339-400, 406-407, 413-415, 561-594, and 634-651 are the nucleotide 

sequences of cDNAs encoding a potentially secreted protein. The locations of the ORFs and 
sequences encoding signal peptides are listed in the accompanying Sequence Listing. In addition, 
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the von Heijne score of the signal peptide computed as described below is listed as the "score" in 
the accompanying Sequence Listing. The sequence of the signal-peptide is listed as "seq" in the 
accompanying Sequence Listing. The "/'' in the signal peptide sequence indicates the location 
where proteolytic cleavage of the signal peptide occurs to generate a mature protein. When 
S appropriate, the locations of the first and last nucleotides of the coding sequences, eventually the 
locations of the first and last nucleotides of the polyA and the locations of the first and last 
nucleotides of the polyA sites are indicated. 

SEQ ID NOs:86-169, 401-405, 408-412, 416-455, 595-633, 652-784 are the nucleotide 
sequences of cDNAs in which no sequence encoding a signal peptide has been identified to date. 

10 However, it remains possible that subsequent analysis will identify a sequence encoding a signal 
peptide in these nucleic acids. The locations of the ORFs are listed in the accompanying Sequence 
Listing. When appropriate, the locations of the first and last nucleotides of the coding sequences, 
eventually the locations of the first and last nucleotides of the polyA and the locations of the first 
and last nucleotides of the polyA sites are indicated. 

1 5 SEQ ID NOs: 1 70-254, 456-5 1 7, 520-52 1 , 527-529, 785-81 8, and 858-875 are the amino 

acid sequences of polypeptides which contain a signal peptide. These polypeptides are encoded by 
the cDNAs of SEQ ID NOs: 1-85, 339-400, 406-407, 413-415, 561-594, and 634-651 . The location 
of the signal peptide is listed in the accompanying Sequence Listing. 

SEQ ID NOs:255.338, 517-519, 522-526, 530-560, 819-857. 876-91 8 are the amino acid 

20 sequences of polypeptides in which no signal peptide has been identified to date. However, it 

remains possible that subsequent analysis will identify a signal peptide in these polypeptides. These 
polypeptides are encoded by the nucleic acids of SEQ ID NOs: 86-169, 401-405, 408-412, 416-455, 
595-633, 652-784. 

In accordance with the regulations relating to Sequence Listings, the following codes have 
25 been used in the Sequence Listing to describes nucleotide sequences. The code "r" in the sequences 
indicates that the nucleotide may be a guanine or an adenine. The code *y in the sequences 
indicates that the nucleotide may be a thymine or a cytosine. The code ^^m" in the sequences 
indicates that the nucleotide may be an adenine or a cytosine. The code "'k'* in the sequences 
indicates that the nucleotide may be a guanine or a thymine. The code "s'' in the sequences 
30 indicates that the nucleotide may be a guanine or a cytosine. The code ' V in the sequences 
indicates that the nucleotide may be an adenine or an thymine. In addition, all instances of the 
symbol ''n" in the nucleic acid sequences mean that the nucleotide can be adenine, guanine, cytosine 
or thymine. 

In some instances, the polypeptide sequences in the Sequence Listing contain the symbol 
35 "Xaa." These "Xaa" symbols indicate either (1) a residue which cannot be identified because of 
nucleotide sequence ambiguity or (2) a stop codon in the determined sequence where applicants 
believe one should not exist (if the sequence were determined more accurately). In some instances, 
several possible identities of the unknown amino acids may be suggested by the genetic code. 
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In the case of secreted proteins, it should be noted that, in accordance with the regulations 
governing Sequence Listings, in the appended Sequence Listing the encoded protein (i.e. the protein 
containing the signal peptide and the mature protein or part thereof) extends from an amino acid 
residue having a negative number through a positively numbered amino acid residue. Thus, the first 
5 amino acid of the mature protein resulting from cleavage of the signal peptide is designated as 
amino acid number 1, and the first amino acid of the signal peptide is designated with the 
appropriate negative number. 

DETAILED DESCRIPTION OF THE INVENTION AND PREFERRED 
10 EMBODIMENTS 

DEFINITlOm 

Before describing the invention in greater detail, the following definitions are set forth to 
illustrate and define the meaning and scope of the terms used to describe the invention herein. 

The term "GENSET gene," when used herein, encompasses genomic, mRNA and cDNA 
1 5 sequences encoding a GENSET polypeptide, including the S' and 3' untranslated regions of said 
sequences. 

The term "GENSET polypeptide biological activity" or "GENSET biological activity" is 
intended for polypeptides exhibiting any activity similar, but not necessarily identical, to an activity 
of a GENSET polypeptide of the invention. The GENSET polypeptide biological activity of a | 
20 given polypeptide may be assessed using any suitable biological assay, a number of which are 
known to those skilled in the art. In contrast, the term "biological activity" refers to any activity 
that any polypeptide may have. 

The term "corresponding mRNA" refers to mRNA which was or can be a template for 
cDNA synthesis for producing a cDNA of the present invention. 
25 The term "corresponding genomic DNA" refers to genomic DNA which encodes an mRNA 

of interest, e.g. corresponding to a cDNA of the invention, which genomic DNA includes the 
sequence of one of the strands of the mRNA, in which thymidine residues in the sequence of the 
genomic DNA (or cDNA) are replaced by uracil residues in the mRNA. 

The term "deposited clone pool" is used herein to refer to the pool of clones entitled cDNA- 
30 8-2000, deposited with the ATCC on September 27, 2000, or the pool of clones entitled cDNA-1 1- 
2000, deposited with the ATCC on November 27, 2000, or any other deposited clone pool 
containing a clone corresponding to any of the herein-described sequences. 

The term "heterologous", when used herein, is intended to designate any polynucleotide or 
polypeptide other than a GENSET polynucleotide or GENSET polypeptide of the invention, 
35 respectively. 

"Providing" with respect to, e.g. a biological sample, population of cells, etc. indicates that 
the sample, population of cells, etc. is somehow used in a method or procedure. Significantly, 
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"providing" a biological sample or population of cells does not require that the sample or cells are 
specifically isolated or obtained for the purposes of the invention, but can instead refer, for 
example, to the use of a biological sample obtained by another individual, for another purpose. 

An "amplification product" refers to a product of any amplification reaction, e.g. PCR, RT- 
5 PCR,LCR,etc. 

A "modulator"' of a protein or other compound refers to any agent that has a functional 
effect on the protein, including physical binding to the protein, alterations of the quantity or quality 
of expression of the protein, altering any measurable or detectable activity, property, or behavior of 
the protein, or in any way interacts with the protein or compound. 

1 0 "A test compound" can be any molecule that is evaluated for its ability to modulate a 

protein or other compound. 

An antibody or other compound that specifically binds to a polypeptide or polynucleotide of 
the invention is also said to "selectively recognize" the polypeptide or polynucleotide. 

The term "isolated" with respect to a molecule requires that the molecule be removed from 

15 its original environment (e. g., the natural environment if it is naturally occurring). For example, a 
naturally-occurring polynucleotide or polypeptide present in a living animal is not isolated, but the 
same polynucleotide or DNA or polypeptide, separated from some or all of the coexisting materials 
in the natural system, is isolated. Such polynucleotide could be part of a vector and/or such 
polynucleotide or polypeptide could be part of a composition, and still be isolated in that the vector 

20 or composition is not part of its natural environment. For example, a naturally-occurring 

polynucleotide present in a living animal is not isolated, but the same polynucleotide, separated 
from some or all of the coexisting materials in the natural system, is isolated. Specifically excluded 
from the definition of "isolated" are: naturally-occurring chromosomes (such as chromosome 
spreads), artificial chromosome libraries, genomic libraries, and cDNA libraries that exist either as 

25 an in vitro nucleic acid preparation or as a transfected/transformed host cell preparation, wherein the 
host cells are either an in vitro heterogeneous preparation or plated as a heterogeneous population of 
single colonies. Also specifically excluded are the above libraries wherein a specified 
polynucleotide makes up less than 5% of the number of nucleic acid inserts in the vector molecules. 
Further specifically excluded are whole cell genomic DNA or whole cell RNA preparations 

30 (including said whole cell preparations which are mechanically sheared or enzymatically digested). 
Further specifically excluded are the above whole cell preparations as either an in vitro preparation 
or as a heterogeneous mixture separated by electrophoresis (including blot transfers of the same) 
wherein the polynucleotide of the invention has not further been separated from the heterologous 
polynucleotides in the electrophoresis medium (e.g., further separating by excising a single band 

35 from a heterogeneous band population in an agarose gel or nylon blot). 

The term "purified" does not require absolute purity; rather, it is intended as a relative 
definition. Purification of starting material or natural material to at least one order of magnitude, 
preferably two or three orders, and more preferably four or five orders of magnitude is expressly 
contemplated. As an example, purification from 0.1% concentration to 10% concentration is two 



13 



wo 02/083898 PCT/IBO 1/009 14 

orders of magnitude. To illustrate, individual cDNA clones isolated from a cDNA library have been 
conventionally purified to electrophoretic homogeneity. The sequences obtained from these clones 
could not be obtained directly either from the library or from total human DNA. The cDNA clones 
are not naturally occurring as such, but rather are obtained via manipulation of a partially purified 

5 naturally occurring substance (messenger RNA), The conversion of mRNA into a cDN A library 
involves the creation of a synthetic substance (cDNA) and pure individual cDNA clones can be 
isolated from the synthetic library by clonal selection. Thus, creating a cDNA library from 
messenger RNA and subsequently isolating individual clones from that library results in an 
approximately 10"*- 10^ fold purification of the native message. 

10 The term "purified" is further used herein to describe a polypeptide or polynucleotide of the 

invention which has been separated from other compounds including, but not limited to, 
polypeptides or polynucleotides, carbohydrates, lipids, etc. The term "purified" may be used to 
specify the separation of monomeric polypeptides of the invention from oligomeric forms such as 
homo- or hetero- dimers, trimers, etc. The term "purified" may also be used to specify the 

1 5 separation of covalently closed (i.e. circular) polynucleotides from linear polynucleotides. A 
polynucleotide is substantially pure when at least about 50%, preferably 60 to 75% of a sample 
exhibits a single polynucleotide sequence and conformation (linear versus covalently close). A 
substantially pure polypeptide or polynucleotide typically comprises about 50%, preferably 60 to 
90% weight/weight of a polypeptide or polynucleotide sample, respectively, more usually about 

20 95%, and preferably is over about 99% pure. Polypeptide and polynucleotide purity, or 
homogeneity, is indicated by a number of means well known in the art, such as agarose or 
polyacrylamide gel electrophoresis of a sample, followed by visualizing a single band upon staining 
the gel. For certain purposes higher resolution can be provided by using HPLC or other means well 
known in the art. As an alternative embodiment, purification of the polypeptides and polynucleotides 

25 of the present invention may be expressed as "at least" a percent purity relative to heterologous 

polypeptides and polynucleotides (DNA, RNA or both). As a preferred embodiment, the polypeptides 
and polynucleotides of the present invention are at least; 1 0%, 20%, 30%, 40%, 50%, 60%, 70%, 
80%, 90%, 95%, 96%, 96%, 98%, 99%, or 100% pure relative to heterologous polypeptides and 
polynucleotides, respectively. As a further preferred embodiment the polypeptides and 

30 polynucleotides have a purity ranging from any number, to the thousandth position, between 90% and 
1 00% (e.g., a polypeptide or polynucleotide at least 99.995% pure) relative to either heterologous 
polypeptides or polynucleotides, respectively, or as a weight/weight ratio relative to all compounds 
and molecules other than those existing in the carrier. Each number representing a percent purity, 
to the thousandth position, may be claimed as individual species of purity. 

35 As used interchangeably herein, the terms " nucleic acid molecule(sV \ " o!igonucleotide(sV ', 

and " polvnucleotidefsV include RNA or DNA (either single or double stranded, coding, 
complementary or antisense), or RNA/DNA hybrid sequences of more than one nucleotide in either 
single chain or duplex form (although each of the above species may be particularly specified). The 
term " nucleotide " is used herein as an adjective to describe molecules comprising RNA, DNA, or 
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RNA/DNA hybrid sequences of any length in single-stranded or duplex form. More precisely, the 
expression '^nucleotide sequence" encompasses the nucleic material itself and is thus not restricted 
to the sequence information (i.e. the succession of letters chosen among the four base letters) that 
biochemically characterizes a specific DNA or RNA molecule. The term "nucleotide" is also used 
S herein as a noun to refer to individual nucleotides or varieties of nucleotides, meaning a molecule, 
or individual unit in a larger nucleic acid molecule, comprising a purine or pyrimidine, a ribose or 
deoxyribose sugar moiety, and a phosphate group, or phosphodiester linkage in the case of 
nucleotides within an oligonucleotide or polynucleotide. The term '"nucleotide" is also used herein 
to encompass ''modified nucleotides" which comprise at least one modirication such as (a) an 

1 0 alternative linking group, (b) an analogous form of purine, (c) an analogous form of pyrimidine, or 
(d) an analogous sugar. For examples of analogous linking groups, purine, pyrimidines, and sugars, 
see, for example, PCT publication No. WO 95/04064, which disclosure is hereby incorporated by 
reference in its entirety. Preferred modifications of the present invention include, but are not 
limited to, 5-fluorouracil, 5-bromouracil, 5-chlorouracil, 5-iodouracil, hypoxanthine, xantine, 4- 

15 acetylcytosine, 5-(carboxyhydroxylmethyl) uracil, 5-carboxymethylaminomethyl-2-thiouridine, 5- 
carboxymethylaminomethyluracil, dihydrouracil, beta-D-galactosylqueosine, inosine, N6- 
isopenteny (adenine, 1-methylguanine, I-methylinosine, 2,2-dimethylguanine, 2-methyladenine, 2- 
methylguanine, 3-methylcytosine, 5-methylcytosine, N6-adenine, 7-methylguanine, 5- 
methylaminomethyluracil, 5-methoxyaminomethyl-2-thiouracil, beta-D-mannosylqueosine, 5'- 

20 methoxycarboxymethyluracil, 5-methoxyuracil, 2-methylthio-N6-isopentenyladenine, uracil-5- 
oxyacetic acid (v) ybutoxosine, pseudouracil, queosine, 2-thiocytosine, 5-methyl-2-thiouracil, 2- 
thiouracil, 4-thiouracil, 5-methyluracil, uracil-5-oxyacetic acid methylester, uracil-5-oxyacetic acid, 
5-methyl-2-thiouracil, 3-(3-amino-3-N-2-carboxypropyl) uracil, and 2,6-diaminopurine. The 
polynucleotide sequences of the invention may be prepared by any known method, including 

25 synthetic, recombinant, ex vivo generation, or a combination thereof, as well as utilizing any 
purification methods known in the art. Methylenemethylimino linked oligonucleosides as well as 
mixed backbone compounds having, may be prepared as described in U.S. Pat. Nos. 5,378,825; 
5,386,023; 5,489,677; 5,602,240; and 5,610,289, which disclosures are hereby incorporated by 
reference in their entireties. Formacetal and thioformacetal linked oligonucleosides may be prepared 

30 as described in U.S. Pat. Nos. 5,264,562 and 5,264,564, which disclosures are hereby incorporated 
by reference in their entireties. Ethylene oxide linked oligonucleosides may be prepared as 
described in U.S. Pat. No. 5,223,618, which disclosure is hereby incorporated by reference in its 
entirety. Phosphinate oligonucleotides may be prepared as described in U.S. Pat. No. 5,508,270, 
which disclosure is hereby incorporated by reference in its entirety. Alkyl phosphonate 

35 oligonucleotides may be prepared as described in U.S. Pat. No. 4,469,863, which disclosure is 
hereby incorporated by reference in its entirety. 3'-Deoxy-3'-methylene phosphonate 
oligonucleotides may be prepared as described in U.S. Pat. Nos. 5,610,289 or 5,625,050 which 
disclosures are hereby incorporated by reference in their entireties. Phosphoramidite 
oligonucleotides may be prepared as described in U.S. Pat. No. 5,256,775 or U.S. Pat. No. 
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5,366,878 which disclosures are hereby incorporated by reference in their entireties. 
Alkylphosphonothioate oligonucleotides may be prepared as described in published PCT 
applications WO 94/17093 and WO 94/02499 which disclosures are hereby incorporated by 
reference in their entireties. 3'-Deoxy-3'-amino phosphoramidate oligonucleotides may be prepared 
5 as described in U.S. Pat. No. 5,476,925, which disclosure is hereby incorporated by reference in its 
entirety. Phosphotriester oligonucleotides may be prepared as described in U.S. Pat. No. 5,023,243, 
which disclosure is hereby incorporated by reference in its entirety. Borano phosphate 
oligonucleotides may be prepared as described in U.S. Pat. Nos. 5,130,302 and 5,177,198 which 
disclosures are hereby incorporated by reference in their entireties. 

10 The term "upstream" is used herein to refer to a location which is toward the 5' end of the 

polynucleotide from a specific reference point. 

The terms "base paired" and "Watson & Crick base paired" are used interchangeably herein 
to refer to nucleotides which can be hydrogen bonded to one another by virtue of their sequence 
identities in a manner like that found in double-helical DNA with thymine or uracil residues linked 

1 5 to adenine residues by two hydrogen bonds and cytosine and guanine residues linked by three 
hydrogen bonds (see Stryer, 1995, which disclosure is hereby incorporated by reference in its 
entirety). 

The terms "complementary*' or "complement thereof are used herein to refer to the 
sequences of polynucleotides which is capable of forming Watson & Crick base pairing with 

20 another specified polynucleotide throughout the entirety of the complementary region. For the 
purpose of the present invention, a first polynucleotide is deemed to be complementary to a second 
polynucleotide when each base in the first polynucleotide is paired with its complementary base. 
Complementary bases are, generally. A and T (or A and U), or C and G. "Complement" is used 
herein as a synonym from "complementary polynucleotide", "complementary nucleic acid" and 

25 "complementary nucleotide sequence". These terms are applied to pairs of polynucleotides based 
solely upon their sequences and not any particular set of conditions under which the two 
polynucleotides would actually bind. Unless otherwise stated, all complementary polynucleotides 
are fully complementary on the whole length of the considered polynucleotide. 

The terms "polypeptide" and "protein", used interchangeably herein, refer to a polymer of 

30 amino acids without regard to the length of the polymer; thus, peptides, oligopeptides, and proteins 
are included within the definition of polypeptide. This term also does not specify or exclude 
chemical or post-expression modifications of the polypeptides of the invention, although chemical 
or post-expression modifications of these polypeptides may be included excluded as specific 
embodiments. Therefore, for example, modifications to polypeptides that include the covalent 

35 attachment of glycosyl groups, acetyl groups, phosphate groups, lipid groups and the like are 
expressly encompassed by the term polypeptide. Further, polypeptides with these modifications 
may be specified as individual species to be included or excluded from the present invention. The 
natural or other chemical modifications, such as those listed in examples above can occur anywhere 
in a polypeptide, including the peptide backbone, the amino acid side-chains and the amino or 
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carboxyl termini. It will be appreciated that the same type of modification may be present in the 
same or varying degrees at several sites in a given polypeptide. Also, a given polypeptide may 
contain many types of modifications. Polypeptides may be branched, for example, as a result of 
ubiquitination, and they may be cyclic, with or without branching. Modifications include 
5 acetylation, acylation, ADP-ribosylation, amidation, covalent attachment of flavin, covalent 

attachment of a heme moiety, covalent attachment of a nucleotide or nucleotide derivative, covalent 
attachment of a lipid or lipid derivative, covalent attachment of phosphotidy I inositol, cross-linking, 
cyclization, disulfide bond formation, demethylation, formation of covalent cross-links, formation 
of cysteine, formation of pyroglutamate, formylation, gamma-carboxylation, glycosylation, dPI 

10 anchor formation, hydroxylation, iodination, methylation, myristoylation, oxidation, pegylation, 
proteolytic processing, phosphorylation, prenylation, racemization, selenoylation, sulfation, 
transfer-RNA mediated addition of amino acids to proteins such as arginylation, and ubiquitination. 
(See, for instance Creighton (1993); Seifter et a/., (1990); Rattan et al, (1992)). Also included 
within the definition are polypeptides which contain one or more analogs of an amino acid 

1 5 (including, for example, non-naturally occurring amino acids, amino acids which only occur 

naturally in an unrelated biological system, modified amino acids from mammalian systems, etc.), 
polypeptides with substituted linkages, as well as other modifications known in the art, both 
naturally occurring and non-naturally occurring. 

As used herein, the terms "recombinant polynucleotide" and "polynucleotide construct" are 

20 used interchangeably to refer to linear or circular, purified or isolated polynucleotides that have 
been artificially designed and which comprise at least two nucleotide sequences that are not found 
as contiguous nucleotide sequences in their initial natural environment. In particular, these terms 
mean that the polynucleotide or cDNA is adjacent to "backbone" nucleic acid to which it is not 
adjacent in its natural environment. Additionally, to be "enriched" the cDNAs will represent 5% or 

25 more of the number of nucleic acid inserts in a population of nucleic acid backbone molecules. 
Backbone molecules according to the present invention include nucleic acids such as expression 
vectors, self-replicating nucleic acids, viruses, integrating nucleic acids, and other vectors or nucleic 
acids used to maintain or manipulate a nucleic acid insert of interest. Preferably, the enriched 
cDNAs represent 15% or more of the number of nucleic acid inserts in the population of 

30 recombinant backbone molecules. More preferably, the enriched cDNAs represent 50% or more of 
the number of nucleic acid inserts in the population of recombinant backbone molecules. In a 
highly preferred embodiment, the enriched cDNAs represent 90% or more (including any number 
between 90 and 100%, to the thousandth position, e.g., 99.5%) of the number of nucleic acid inserts 
in the population of recombinant backbone molecules. 

35 The term "recombinant polypeptide" is used herein to refer to polypeptides that have been 

artificially designed and which comprise at least two polypeptide sequences that are not found as 
contiguous polypeptide sequences in their initial natural environment, or to refer to polypeptides 
which have been expressed from a recombinant polynucleotide. 
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As used herein, the term "operably linked" refers to a linkage of polynucleotide elements in 
a functional relationship. A sequence which is "operably linked" to a regulatory sequence such as a 
promoter means that said regulatory element is in the correct location and orientation in relation to 
the nucleic acid to control RNA polymerase initiation and expression of the nucleic acid of interest. 
5 For instance, a promoter or enhancer is operably linked to a coding sequence if it affects the 
transcription of the coding sequence. 

As used herein, the term "non-human animal" refers to any non-human animal, including 
insects, birds, rodents and more usually mammals. Preferred non-human animals include: primates; 
farm animals such as swine, goats, sheep, donkeys, cattle, horses, chickens, rabbits; and rodents, 
1 0 preferably rats or mice. As used herein, the term "animal" is used to refer to any species in the 
animal kingdom, preferably vertebrates, including birds and fish, and more preferable a mammal. 
Both the terms "animal" and "mammal" expressly embrace human subjects unless preceded with 
the term "non-human". 

The term "domain" refers to an amino acid fragment with specific biological properties. 
15 This term encompasses all known structural and linear biological motifs. Examples of such motifs 
include but are not limited to leucine zippers, helix-tum-helix motifs, glycosylation sites, 
ubiquitination sites, alpha helices, and beta sheets, signal peptides which direct the secretion of 
proteins, sites for post-translational modification, en2ymatic active sites, substrate binding sites, and 
enzymatic cleavage sites. 

20 Although each of these terms has a distinct meaning, the terms "comprising", "consisting 

of and "consisting essentially of may be interchanged for one another throughout the instant 
application. The term "having" has the same meaning as "comprising" and may be replaced with 
either the term "consisting of* or "consisting essentially of*. 

Unless otherwise specified in the application, nucleotides and amino acids of 

25 polynucleotides and polypeptides, respectively, of the present invention are contiguous and not 
interrupted by heterologous sequences. 

Identity Between Nucleic Acids Or Polypeptides 

The terms "percentage of sequence identity" and "percentage homology" are used 
interchangeably herein to refer to comparisons among polynucleotides and polypeptides, and are 

30 determined by comparing two optimally aligned sequences over a comparison window, wherein the 
portion of the polynucleotide or polypeptide sequence in the comparison window may comprise 
additions or deletions (i.e., gaps) as compared to the reference sequence (which does not comprise 
additions or deletions) for optimal alignment of the two sequences. The percentage is calculated by 
determining the number of positions at which the identical nucleic acid base or amino acid residue 

35 occurs in both sequences to yield the number of matched positions, dividing the number of matched 
positions by the total number of positions in the window of comparison and multiplying the result 
by 100 to yield the percentage of sequence identity. Homology is evaluated using any of the variety 
of sequence comparison algorithms and programs known in the art. Such algorithms and programs 
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include, but are by no means limited to, TBLASTN, BLASTP, FASTA, TFASTA, CLUSTALW, 
FASTDB (Pearson and Lipman, 1988; Altschul e/ a/., 1990; Thompson ^/ a/., 1994; Higgins e/ 
al,, 1996; Altschul a/., 1990; Altschul e/a/., 1993; Brutlage/a/, 1990), the disclosures of which 
are incorporated by reference in their entireties. 
5 In a particularly preferred embodiment, protein and nucleic acid sequence homologies are 

evaluated using the Basic Local Alignment Search Tool ("BLAST") which is well known in the art 
(see, e.g., Karlin and Altschul, 1990; Altschul et aL, 1990, 1993, 1997), the disclosures of which 
are incorporated by reference in their entireties. In particular, five specific BLAST programs are 
used to perform the following task: 
10 ( 1 ) BLASTP and BLASTS compare an amino acid query sequence against a protein 

sequence database; 

(2) BLASTN compares a nucleotide query sequence against a nucleotide sequence 

database; 

(3) BLASTX compares the six-frame conceptual translation products of a query nucleotide 
1 5 sequence (both strands) against a protein sequence database; 

(4) TBLASTN compares a query protein sequence against a nucleotide sequence database 
translated in all six reading frames (both strands); and 

(5) TBLASTX compares the six-frame translations of a nucleotide query sequence against 
the six-frame translations of a nucleotide sequence database. 

20 The BLAST programs identify homologous sequences by identifying similar segments, 

which are referrejl to herein as "high-scoring segment pairs," between a query amino or nucleic acid 
sequence and a test sequence which is preferably obtained from a protein or nucleic acid sequence 
database. High-scoring segment pairs are preferably identified (i,e., aligned) by means of a scoring 
matrix, many of which are known in the art. Preferably, the scoring matrix used is the BLOSUM62 

25 matrix (Gonnet et al., 1 992; HenikofF and HenikofF, 1 993, the disclosures of which are 

incorporated by reference in their entireties). Less preferably, the PAM or PAM250 matrices may 
also be used (see, e.g., Schwartz and DayhofT, eds., 1978, the disclosure of which is incorporated by 
reference in its entirety). The BLAST programs evaluate the statistical significance of all high- 
scoring segment pairs identified, and preferably selects those segments which satisfy a user- 

30 specified threshold of significance, such as a user-specified percent homology. Preferably, the 
statistical significance of a high-scoring segment pair is evaluated using the statistical significance 
formula of Karlin (see, e.g., Karlin and Altschul, 1990), the disclosure of which is incorporated by 
reference in its entirety. The BLAST programs may be used with the default parameters or with 
modified parameters provided by the user. 

35 Another preferred method for determining the best overall match between a query 

nucleotide sequence (a sequence of the present invention) and a subject sequence, also referred to as 
a global sequence alignment, can be determined using the FASTDB computer program based on the 
algorithm of Brutlag et aL (1990), the disclosure of which is incorporated by reference in its 
entirety. In a sequence alignment the query and subject sequences are both DNA sequences. An 
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RNA sequence can be compared by first converting U's to Vs. The result of said global sequence 
, alignment is in percent identity. Preferred parameters used in a FASTDB alignment of DNA 
sequences to calculate percent identity are: Matrix = Unitary, k-tuple = 4, Mismatch Penalty = 1, 
Joining Penalty = 30, Randomization Group Length = 0, Cutoff Score = 1, Gap Penalty = 5, Gap 
5 Size Penalty = 0.05, Window Size = 500 or the length of the subject nucleotide sequence, 

whichever is shorter. If the subject sequence is shorter than the query sequence because of 5* or 3* 
deletions, not because of internal deletions, a manual correction must be made to the results. This is 
because the FASTDB program does not account for 5' and 3* truncations of the subject sequence 
when calculating percent identity. For subject sequences truncated at the 5' or 3'ends, relative to the 

10 query sequence, the percent identity is corrected by calculating the number of bases of the query 
sequence that are 5' and 3' of the subject sequence, which are not matched/aligned, as a percent of 
the total bases of the query sequence. Whether a nucleotide is matched/aligned is determined by 
results of the FASTDB sequence alignment. This percentage is then subtracted from the percent 
identity, calculated by the above FASTDB program using 10, the specified parameters, to arrive at a 

15 final percent identity score. This corrected score is what is used for the purposes of the present 
invention. Only nucleotides outside the 5' and 3' nucleotides of the subject sequence, as displayed 
by the FASTDB alignment, which are not matched/aligned with the query sequence, are calculated 
for the purposes of manually adjusting the percent identity score. For example, a 90 nucleotide 
subject sequence is aligned to a 100 nucleotide query sequence to determine percent identity. The 

20 deletions occur at the 5' end of the subject sequence and therefore, the FASTDB alignment does not 
show a matched/alignment of the first 10 nucleotides at 5' end. The 10 unpaired nucleotides 
represent 10% of the sequence (number of nucleotides at the 5' and 3* ends not matched/total 
number of nucleotides in the query sequence) so 10% is subtracted from the percent identity score 
calculated by the FASTDB program. If the remaining 90 nucleotides were perfectly matched the 

25 final percent identity would be 90%. In another example, a 90 nucleotide subject sequence is 
compared with a 100 nucleotide query sequence. This time the deletions are internal deletions so 
that there are no nucleotides on the S' or 3' of the subject sequence which are not matched/aligned 
with the query. In this case the percent identity calculated by FASTDB is not manually corrected. 
Once again, only nucleotides 5* and 3* of the subject sequence which are not matched/aligned with 

30 the query sequence are manually corrected. No other manual corrections are made for the purposes 
of the present invention. 

Another preferred method for determining the best overall match between a query amino 
acid sequence (a sequence of the present invention) and a subject sequence, also referred to as a 
global sequence alignment, can be determined using the FASTDB computer program based on the 

35 algorithm of Brullag et al, ( 1 990). In a sequence alignment the query and subject sequences are 
both amino acid sequences. The result of said global sequence alignment is in percent identity. 
Preferred parameters used in a FASTDB amino acid alignment are: Matrix = PAM 0, k-tuple = 2, 
Mismatch Penalty = 1, Joining Penalty = 20, Randomization Group Length = 0, Cutoff Score = 1, 
Window Size = sequence length. Gap Penalty = 5, Gap Size Penalty = 0.05, Window Size = 500 or 
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the length of the subject amino acid sequence, whichever is shorter. If the subject sequence is 
shorter than the query sequence due to N-or C-terminal deletions, not because of internal deletions, 
the results, in percent identity, must be manually corrected. This is because the FASTDB program 
does not account for N- and C-terminal truncations of the subject sequence when calculating global 
5 percent identity. For subject sequences truncated at the N- and C-termini, relative to the query 
sequence, the percent identity is corrected by calculating the number of residues of the query 
sequence that are N- and C- terminal of the subject sequence, which are not matched/aligned with a 
corresponding subject residue, as a percent of the total bases of the query sequence. Whether a 
residue is matched/aligned is determined by results of the FASTDB sequence alignment. This 

1 0 percentage is then subtracted from the percent identity, calculated by the above FASTDB program 
using the specified parameters, to arrive at a final percent identity score. This final percent identity 
score is what is used for the purposes of the present invention, Only residues to the N- and C- 
termini of the subject sequence, which are not matched/aligned with the query sequence, are 
considered for the purposes of manually adjusting the percent identity score. That is, only query 

1 5 amino acid residues outside the farthest N- and C-terminal residues of the subject sequence. For 
example, a 90 amino acid residue subject sequence is aligned with a 100-residue query sequence to 
determine percent identity. The deletion occurs at the N-terminus of the subject sequence and 
therefore, the FASTDB alignment does not match/align with the first residues at the N-terminus. 
The 10 unpaired residues represent 10% of the sequence (number of residues at the N- and C- 

20 termini not matched/total number of residues in the query sequence) so 10% is subtracted from the 
percent identity score calculated by the FASTDB program. If the remaining 90 residues were 
perfectly matched the final percent identity would be 90%. In another example, a 90-residue 
subject sequence is compared with a 100-residue query sequence. This time the deletions are 
internal so there are no residues at the N- or C-termini of the subject sequence, which are not 

25 matched/aligned with the query. In this case the percent identity calculated by FASTDB is not 
manually corrected. Once again, only residue positions outside the N- and C-terminal ends of the 
subject sequence, as displayed in the FASTDB alignment, which are not matched/aligned with the 
query sequence are manually corrected. No other manual corrections are made for the purposes of 
the present invention. 

30 The term "percentage of sequence similarity" refers to comparisons between polypeptide 

sequences and is determined by comparing two optimally aligned sequences over a comparison 
window, wherein the portion of the polypeptide sequence in the comparison window may comprise 
additions or deletions (i.e., gaps) as compared to the reference sequence (which does not comprise 
additions or deletions) for optimal alignment of the two sequences. The percentage is calculated by 

35 determining the number of positions at which an identical or equivalent amino acid residue occurs 
in both sequences to yield the number of matched positions, dividing the number of matched 
positions by the total number of positions in the window of comparison and multiplying the result 
by 100 to yield the percentage of sequence similarity. Similarity is evaluated using any of the 
variety of sequence comparison algorithms and programs known in the art, including those 
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described above in this section, 
polypeptides" section. 
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Equivalent amino acid residues are defined herein in the "Mutated 



Polynucleotides of the invention 

5 The present invention concerns GENSET genomic and cDNA sequences. The present 

invention encompasses GENSET genes, polynucleotides comprising GENSET genomic and cDNA 
sequences, as well as fragments and variants thereof. These polynucleotides may be purified, 
isolated, or recombinant. 

Also encompassed by the present invention are allelic variants, orthologs, splice variants, 

1 0 and/or species homolpgues of the GENSET genes. Procedures known in the art can be used to 
obtain full-length genes and cDNAs, allelic variants, splice variants, full-length coding portions, 
orthologs, and/or species homologues of genes and cDNAs corresponding to a nucleotide sequence 
selected from the group consisting of sequences of SEQ ID NOs:l-169, 339-455, 561-784 and 
sequences of clone inserts of the deposited clone pool, using information from the sequences 

1 5 disclosed herein or the clone pool deposited with the ATCC or other depositary authority. For 
example, allelic variants, orthologs and/or species homologues may be isolated and identified by 
making suitable probes or primers from the sequences provided herein and screening a suitable 
nucleic acid source for allelic variants and/or the desired homologue using any technique known to 
those skilled in the art including those described into the section entitled "To find similar 

20 sequences". 

In a specific embodiment, the polynucleotides of the invention are at least 15, 30, 50, 100, 
125, 500, or 1000 continuous nucleotides. In another embodiment, the polynucleotides are less than 
or equal to 300kb, 200kb, lOOkb, 50kb, lOkb, 7.5kb, 5kb, 2.5kb, 2kb, 1.5kb, or Ikb in length. In a 
further embodiment, polynucleotides of the invention comprise a portion of the coding sequences, 
25 as disclosed herein, but do not comprise all or a portion of any intron. In another embodiment, the 
polynucleotides comprising coding sequences do not contain coding sequences of a genomic 
flanking gene (i.e., 5' or 3' to the gene of interest in the genome). In other embodiments, the 
polynucleotides of the invention do not contain the coding sequence of more than 1000, 500, 250, 
100, 75, 50, 25, 20, 15, 10, 5, 4, 3, 2, or 1 naturally occurring genomic flanking gene(s). 

30 Deposited clone pool of the invention 

Expression of GENSET genes has been shown to lead to the production of at least one 
mRNA species per GENSET gene, which cDNA sequence is set forth in the appended sequence 
listing as SEQ ID NOs: 1-169, 339-455, 561-784. The cDNAs (SEQ ID NOs:l-169, 339-455, 561- 
784) corresponding to these GENSET mRNA species were cloned either in the vector pBluescriptll 
35 SK" (Stratagene) or in a vector called pPT. Cells containing the cloned cDNAs of the present 
invention are maintained in permanent deposit by the inventors at Censet, S.A., 24 Rue Royale, 
75008 Paris, France. Each cDNA can be removed from the Bluescript vector in which it was 
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inserted by performing a Not! Pst I double digestion, or from the pPT vector by performing a Muni 
Hindlll double digestion, to produce the appropriate fragment for each clone, provided the cDNA 
sequence does not contain any of the corresponding restriction sites within its sequence. 
Alternatively, other restriction enzymes of the multicloning site of the vector may be used to 
5 recover the desired insert as indicated by the manufacturer. 

Pools of celts containing certain cDNAs of the invention, from which the cells containing a 
particular polynucleotide is obtainable, have also been deposited with the American Tissue Culture 
Collection (ATCC), 10801 University Boulevard, Manassas, VA 201 10-2209, United States. These 
cDNA clones have been transfected into separate bacterial cells (E-coli) for these composite 
10 deposits. 

Bacterial cells containing a particular clone can be obtained from the composite deposit as 

follows: 

An oligonucleotide probe or probes should be designed to the sequence that is known for 
that particular clone. This sequence can be derived from the sequences provided herein, or from a 
15 combination of those sequences. The design of the oligonucleotide probe should preferably follow 
these parameters: 

(a) It should be designed to an area of the sequence which has the fewest ambiguous bases 
("N's"), if any; 

(b) Preferably, the probe is designed to have a Tm of approximately 80 degrees Celsius 
20 (assuming 2 degrees for each A or T and 4 degrees for each G or C). However, probes having 

melting temperatures between 40 degrees Celsius and 80 degrees Celsius may also be used provided 
that specificity is not lost. 

The oligonucleotide should preferably be labeled with gamma["P] ATP (specific activity 
6000 Ci/mmole) and T4 polynucleotide kinase using commonly employed techniques for labeling 
25 oligonucleotides. Other labeling techniques can also be used. Unincorporated label should 

preferably be removed by gel filtration chromatography or other established methods. The amount 
of radioactivity incorporated into the probe should be quantified by measurement in a scintillation 
counter. Preferably, specific activity of the resulting probe should be approximately 4x10* 
dpm/pmole. 

30 The bacterial culture containing the pool of full-length clones should preferably be thawed 

and 100 ul of the stock used to inoculate a sterile culture flask containing 25 ml of sterile L-broth 
containing ampicillin at 100 ug/ml. The culture should preferably be grown to saturation at 37 
degrees Celsius, and the saturated culture should preferably be diluted in fresh L-broth. Aliquots of 
these dilutions should preferably be plated to determine the dilution and volume which will yield 

35 approximately 5000 distinct and well-separated colonies on solid bacteriological media containing 
L-broth containing ampicillin at 100 ug/ml and agar at 1.5% in a 150 mm petri dish when grown 
overnight at 37 degrees Celsius. Other known methods of obtaining distinct, well-separated 
colonies can also be employed. 
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Standard colony hybridization procedures should then be used to transfer the colonies to 
nitrocellulose filters and lyse, denature and bake them. 

The filter is then preferably incubated at 65 degrees Celsius for 1 hour with gentle agitation 
in 6X SSC (20X stock is 1 75.3 g NaCl/liter, 88.2 g Na citrate/liter, adjusted to pH 7.0 with NaOH) 
5 containing 0.5% SDS, 100 pg/ml of yeast RNA, and 10 mM EDTA (approximately 10 ml per 150 
mm filter). Preferably, the probe is then added to the hybridization mix at a concentration greater 
than or equal to 1x10^ dpm/ml. The filter is then preferably incubated at 65 degrees Celsius with 
gentle agitation overnight. The filter is then preferably washed in 500 ml of 2X SSC/0.1% SDS at 
room temperature with gentle shaking for 15 minutes. A third wash with O.IX SSC/0.5% SDS at 65 
1 0 degrees Celsius for 30 minutes to 1 hour is optional. The filter is then preferably dried and 

subjected to autoradiography for sufficient time to visualize the positives on the X-ray film. Other 
known hybridization methods can also be employed. 

The positive colonies are picked, grown in culture, and plasmid DNA isolated using 
standard procedures. The clones can then be verified by restriction analysis, hybridization analysis, 
1 5 or DNA sequencing. The plasmid DNA obtained using these procedures may then be manipulated 
using standard cloning techniques familiar to those skilled in the art. 

Alternatively, to recover cDNA inserts from the pool of bacteria, a PCR can be performed 
on plasmid DNA isolated using standard procedures and primers designed at both ends of the cDNA 
insertion, including primers designed in the multicloning site of the vector. If a specific cDNA of 
20 interest is to be recovered, primers may be designed in order to be specific for the 5' end and the 3' 
end of this cDNA using sequence information available from the appended sequence listing. The 
PCR product which corresponds to the cDNA of interest can then be manipulated using standard 
cloning techniques familiar to those skilled in the art. 

Therefore, an object of the invention is an isolated, purified, or recombinant polynucleotide 
25 comprising a nucleotide sequence selected from the group consisting of cDNA inserts of the 
deposited clone pool. Moreover, preferred polynucleotides of the invention include purified, 
isolated, or recombinant GENSET cDNAs consisting of, consisting essentially of, or comprising a 
nucleotide sequence selected from the group consisting of cDNA inserts of the deposited clone 
pool. 

30 cDNA sequences of the invention 

Another object of the invention is a purified, isolated, or recombinant polynucleotide 
comprising a nucleotide sequence selected from the group consisting of sequences of SEQ ID 
NOs:M69, 339-455, 561-784, complementary sequences thereto, and fragments thereof 
Moreover, preferred polynucleotides of the invention include purified, isolated, or recombinant 
35 GENSET cDNAs consisting of, consisting essentially of, or comprising a sequence selected from 
the group consisting of SEQ ID NOs:l-169, 339-455, 561-784. 

Accordingly, the coding sequence (CDS) or open reading frame (ORF) of each cDNA of 
the invention refers to the nucleotide sequence beginning with the first nucleotide of the start codon 
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and ending with the last nucleotide of the stop codon. Similarly, the 5* untranslated region (or 
5'UTR) of each cDNA of the invention refers to the nucleotide sequence starting at nucleotide 1 
and ending at the nucleotide immediately 5' to the first nucleotide of the start codon. The 3' 
untranslated region (or 3'UTR) of each cDNA of the invention refers to the nucleotide sequence 
S starting at the nucleotide immediately 3' to the last nucleotide of the stop codon and ending at the 
last nucleotide of the cDNA. 

Untranslated regions 

In addition, the invention concerns a purified, isolated, and recombinant nucleic acid 
comprising a nucleotide sequence selected from the group consisting of the 5'UTRs of sequences of 

10 SEQ ID >JOs:l-l69, 339-455, 561-784 and sequences of clone inserts of the deposited clone pool, 
sequences complementary thereto, and allelic variants thereof. The invention also concerns a 
purified, isolated, and/or recombinant nucleic acid comprising a nucleotide sequence selected from 
the group consisting of the 3'UTRs of sequences of SEQ ID NOs:l-169, 339-455, 561-784 and 
sequences of clone inserts of the deposited clone pool, sequences complementary thereto, and allelic 

15 variants thereof. 

These polynucleotides may be used to detect the presence of GENSET mRNA species in a 
biological sample using either hybridization or RT-PCR techniques well known to those skilled in 
the art. 

In addition, these polynucleotides may be used as regulatory molecules able to affect the 
20 processing and maturation of any polynucleotide including them (either a GENSET polynucleotide 
or an heterologous polynucleotide), preferably the localization, stability and/or translation of said 
polynucleotide including them (for a review on UTRs see Decker and Parker, 1995, Derrigo et ai, 
2000). In particular, 3 'UTRs may be used in order to control the stability of heterologous mRNAs 
in recombinant vectors using any methods known to those skilled in the art including Makrides 
25 (( 1 999) Protein Expr Purif 1 999 Nov; 1 7(2): 1 83-202), US Patents 5,925.564; 5,807,707 and 
5,756,264, which disclosures are hereby incorporated by reference in their entireties. 

Coding sequences 

Another object of the invention is an isolated, purified or recombinant polynucleotide 
comprising the coding sequence of a sequence selected from the group consisting of sequences of 
30 SEQ ID NOs: 1-169, 339-455, 561-784, clone inserts of the deposited clone pool, and variants 
thereof. 

A further object of the invention is an isolated, purified or recombinant polynucleotide 
encoding a polypeptide comprising a sequence selected fi'om the group consisting of sequences of 
SEQ ID NOs: 1 70-338, 456-560, 785-9 1 8 and allelic variants thereof. Another object of the 
35 invention is an isolated, purified or recombinant polynucleotide encoding a polypeptide comprising 
a sequence selected from the group consisting of polypeptides encoded by cDNA inserts of the 
deposited clone pool and allelic variants thereof 
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It will be appreciated that should the extent of the coding sequence differ from that 
indicated in the appended sequence listing as a result of a sequencing error, reverse transcription or 
amplification error, mRNA splicing, post-translational modification of the encoded protein, 
enzymatic cleavage of the encoded protein, or other biological factors, one skilled in the art would 
5 be readily able to identify the extent of the coding sequences in the sequences of SEQ ID NOs: 1 - 
169, 339-4SS, 561-784. Accordingly, the scope of any claims herein relating to nucleic acids 
containing the coding sequence of one of SEQ ID NOs: 1-169, 339-455, 561-784 is not to be 
construed as excluding any readily identifiable variations from or equivalents to the coding 
sequences described in the appended sequence listing. Equivalents includes any alterations in a 

1 0 nucleotide coding sequence that does not result in an amino acid change, or that results in a 
conservative amino acid substitution, as defined below, in the polypeptide encoded by the 
nucleotide sequence. Similarly, should the extent of the polypeptides differ from those indicated in 
the appended sequence listing as a result of any of the preceding factors, the scope of claims 
relating to polypeptides comprising the amino acid sequence of the polypeptides of SEQ ID 

15 NOs: 170-338, 456-560, 785-91 8 is not to be construed as excluding any readily identifiable 
variations from or equivalents to the sequences described in the appended sequence listing. 

The above-disclosed polynucleotides that contain the coding sequence of the GENSET 
genes may be expressed in a desired host cell or a desired host organism, when this polynucleotide 
is placed under the control of suitable expression signals. The expression signals may be either the 

20 expression signals contained in the regulatory regions in the GENSET genes of the invention or, in 
contrast, the signals may be exogenous regulatory nucleic sequences. Such a polynucleotide, when 
placed under the suitable expression signals, tnay also be inserted in a vector for its expression 
and/or amplification. 

Further included in the present invention are polynucleotides encoding the polypeptides of 
25 the present invention that are fused in frame to the coding sequences for additional heterologous 
amino acid sequences. Also included in the present invention are nucleic acids encoding 
polypeptides of the present invention together with additional, non-coding sequences, including, but 
not limited to, non-coding 5' and 3' sequences, vector sequence, sequences used for purification, 
probing, or priming. For example, heterologous sequences include transcribed, untranslated 
30 sequences that may play a role in transcription and mRNA processing, such as ribosome binding 
and stability of mRNA. The heterologous sequences may alternatively comprise additional coding 
sequences that provide additional functionalities. Thus, a nucleotide sequence encoding a 
polypeptide may be fused to a tag sequence, such as a sequence encoding a peptide that facilitates 
purification or detection of the fused polypeptide. In certain preferred embodiments of this aspect 
35 of the invention, the tag amino acid sequence is a hexa-histidine peptide, such as the tag provided in 
a pQE vector (QIAGEN), or in any of a number of additional, commercially available vectors. For 
instance, hexa-histidine provides for the convenient purification of the fusion protein (see, Gentz et 
a/., 1989, Proc Natl Acad Sci U S A Feb;86(3):821-4, the disclosure of which is incorporated by 
reference in its entirety). The "HA" tag is another peptide useful for purification which corresponds 
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to an epitope derived from the influenza hemagglutinin protein (see, Wilson et al., 1984, Cell 
Jul;37(3):767-78, the disclosure of which is incorporated by reference in its entirety). As discussed 
below, other such fusion proteins include a GEIMSET polypeptide fused to Fc at the N- or 
C-terminus. 

S Suitable recombinant vectors that contain a polynucleotide such as described herein are 

disclosed elsewhere in the specification. Expression vectors encoding GENSET polypeptides or 
fragments thereof are described in the section entitled "Preparation of the polypeptides". 

Regulatory sequences of the invention 

As mentioned, the genomic sequence of GENSET genes contain regulatory sequences in 
10 the non-coding 5'-flanking region and possibly in the non-coding 3'-flanking region that border the 
GENSET polypeptide coding regions containing the exons of these genes. 

Polynucleotides derived from GENSET polynucleotide 5* and 3' regulatory regions are 
useful in order to detect the presence of at least a copy of a genomic nucleotide sequence of the 
GENSET gene or a fragment thereof in a test sample. 

15 Preferred regulatorv sequences 

Polynucleotides carrying the regulatory elements located at the 5' end and at the 3' end of 
GENSET polypeptide coding regions may be advantageously used to control, e.g., the 
transcriptional and translational activity of a heterologous polynucleotide of interest. 

Thus, the present invention also concerns a purified or isolated nucleic acid comprising a 

20 polynucleotide which is selected from the group consisting of the 5' and 3' GENSET 

polynucleotide regulatory regions, sequences complementary thereto, regulatory active fragments 
and variants thereof. The invention also pertains to a purified or isolated nucleic acid comprising a 
polynucleotide having at least 95% nucleotide identity with a polynucleotide selected from the 
group consisting of GENSET polynucleotide 5' and 3' regulatory regions, advantageously 99 % 

25 nucleotide identity, preferably 99.5% nucleotide identity and most preferably 99.8% nucleotide 
identity with a polynucleotide selected from the group consisting of GENSET polynucleotide 5' and 
3' regulatory regions, sequences complementary thereto, variants and regulatory active fragments 
thereof 

Another object of the invention consists of purified, isolated or recombinant nucleic acids 
30 comprising a polynucleotide that hybridizes, under the stringent hybridization conditions defined 
herein, with a polynucleotide selected from the group consisting of the nucleotide sequences of 
GENSET polynucleotide 5' and 3' regulatory regions, sequences complementary thereto, variants 
and regulatory active fragments thereof. 

Preferred fragments of 5' regulatory regions have a length of about 1500 or 1000 
35 nucleotides, preferably of about 500 nucleotides, more preferably about 400 nucleotides, even more 
preferably 300 nucleotides and most preferably about 200 nucleotides. 
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Preferred fragments of 3' regulatory regions are at least 20, 50. 1 00, 1 50, 200. 300 or 400 
bases in length. 

"Regulatory active" polynucleotide derivatives of the 5' or 3' regulatory region are 
polynucleotides comprising or alternatively consisting of a fragment of said polynucleotide which is 
5 functional as a regulatory region for expressing a recombinant polypeptide or a recombinant 
polynucleotide in a recombinant cell host. It could act either as an enhancer or as a repressor. For 
the purpose of the invention, a nucleic acid or polynucleotide is ''functional" as a regulatory region 
for expressing a recombinant polypeptide or a recombinant polynucleotide if said regulatory 
polynucleotide contains nucleotide sequences which contain transcriptional and translational 

10 regulatory information, and such sequences are "operably linked" to nucleotide sequences which 
encode the desired polypeptide or the desired polynucleotide. 

The regulatory polynucleotides of the invention may be prepared from the nucleotide 
sequence of GENSET genomic or cDNA sequence, for example, by cleavage using suitable 
restriction enzymes, or by PCR. The regulatory polynucleotides may also be prepared by digestion 

15 of a GENSET gene-containing genomic clone by an exonuclease enzyme, such as Bal3 1 (Wabiko et 
aL, DNA 5(4):305-l4 (1986), the disclosure of which is incorporated by reference in its entirety). 
These regulatory polynucleotides can also be prepared by nucleic acid chemical synthesis, as 
described elsewhere in the specification. 

The regulatory polynucleotides according to the invention may be part of a recombinant 

20 expression vector that may be used to express a coding sequence in a desired host ceil or host 
organism. The recombinant expression vectors according to the invention are described elsewhere 
in the specification. 

Preferred 5'-regulatory polynucleotides of the invention include 5'-UTRs of GENSET 
cDNAs, or regulatory active fragments or variants thereof. More preferred 5'-regulatory 
25 polynucleotides of the invention include sequences selected from the group consisting of 5'-UTRs 
of sequences of SEQ ID NOs:l-169, 339-455, 561-784, 5'-UTRs of clone inserts of the deposited 
clone pool, regulatory active fragments and variants thereof. 

Preferred 3 '-regulatory polynucleotide of the invention include 3'-UTRs of GENSET 
cDNAs, or regulatory active fragments or variants thereof. More preferred 3'-regulatory 
30 polynucleotides of the invention include sequences selected from the group consisting of 3'-UTRs 
of sequences of SEQ ID NOs:l-169, 339-455, 561-784, 3'-UTRs of clone inserts of the deposited 
clone pool, regulatory active fragments and variants thereof. 

A further object of the invention consists of a purified or isolated nucleic acid comprising: 
a) a polynucleotide comprising a 5' regulatory nucleotide sequence selected from the group 
35 consisting of: 

(i) a nucleotide sequence comprising a polynucleotide of a GENSET polynucleotide 5* 
regulatory region or a complementary sequence thereto; 
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(ii) a nucleotide sequence comprising a polynucleotide having at least 95% of nucleotide 
identity with the nucleotide sequence of a GENSET polynucleotide 5' regulatory region or a 
complementary sequence thereto; 

(iii) a nucleotide sequence comprising a polynucleotide that hybridizes under stringent 

5 hybridization conditions with the nucleotide sequence of a GENSET polynucleotide 5' regulatory 
region or a complementary sequence thereto; and 

(iv) a regulatory active fragment or variant of the polynucleotides in (i), (ii) and (iii); 

b) a nucleic acid molecule encoding a desired polypeptide or a nucleic acid molecule of 
interest, wherein said nucleic acid molecule is operably linked to the polynucleotide defined in (a); 

10 and 

c) optionally, a polynucleotide comprising a 3*- regulatory polynucleotide, preferably a 3'- 
regulatory polynucleotide of a GENSET gene. 

In a specific embodiment, the nucleic acid defined above includes the 5'-UTR of a 

GENSET cDNA, or a regulatory active fragment or variant thereof 
15 In a second specific embodiment, the nucleic acid defined above includes the 3'-UTR of a 

GENSET cDNA, or a regulatory active fragment or variant thereof. 

The regulatory polynucleotide of the 5' regulatory region, or its regulatory active fragments 

or variants, is operably linked at the S'-end of the nucleic acid molecule encoding the desired 

polypeptide or nucleic acid molecule of interest. 
20 The regulatory polynucleotide of the 3' regulatory region, or its regulatory active fragments 

or variants, is advantageously operably linked at the 3 '-end of the nucleic acid molecule encoding 

the desired polypeptide or nucleic acid molecule of interest. 

The desired polypeptide encoded by the above-described nucleic acid may be of various 

nature or origin, encompassing proteins of prokaryotic viral or eukaryotic origin. Among the 
25 polypeptides expressed under the control of a GENSET polynucleotide regulatory region include 

bacterial, fungal or viral antigens. Also encompassed are eukaryotic proteins such as intracellular 

proteins, such as "house-keeping" proteins, membrane-bound proteins, such as mitochondrial 

membrane-bound proteins and cell surface receptors, and secreted proteins such as endogenous 

mediators such as cytokines. The desired polypeptide may be an heterologous polypeptide or a 
30 GENSET polypeptide, especially a protein with an amino acid sequence selected from the group 

consisting of sequences of SEQ ID NOs: 1 70-338, 456-560, 785-9 1 8, fragments and variants 

thereof 

The desired nucleic acids encoded by the above-described polynucleotides, usually an RNA 
molecule, may be complementary to a desired coding polynucleotide, for example to a GENSET 
35 coding sequence, and thus useful as an antisense polynucleotide. Such a polynucleotide may be 
included in a recombinant expression vector in order to express the desired polypeptide or the 
desired nucleic acid in host cell or in a host organism. Suitable recombinant vectors that contain a 
polynucleotide such as described herein are disclosed elsewhere in the specification. 
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The invention also relates to variants of the polynucleotides described herein and fragments 
thereof "Variants" of polynucleotides, as the term is used herein, are polynucleotides that differ 
from a reference polynucleotide. Generally, differences are limited so that the nucleotide sequences 
5 of the reference and the variant are closely similar overall and, in many regions, identical. The 
present invention encompasses both allelic variants and degenerate variants. 

Examples of variant sequences of polynucleotides of the invention are given in the 
appended sequence listing. Specifically, Table I includes sequences for which a plurality of closely 
related sequences, e.g. variants, are provided. 

10 Allelic variants 

A variant of a polynucleotide may be a naturally occurring variant such as a naturally 
occurring allelic variant, or it may be a variant that is not known to occur naturally. By an "allelic 
variant" is intended one of several alternate forms of a gene occupying a given locus on a 
chromosome of an organism (see Lewin, 1990), the disclosure of which is incorporated by reference 
1 5 in its entirety. Diploid organisms may be homozygous or heterozygous for an allelic form. Non- 
naturally occurring variants of the polynucleotide may be made by art-known mutagenesis 
techniques, including those applied to polynucleotides, cells or organisms. See, for example. Table 
I, which includes sequences for which a plurality of closely related sequences, e.g. allelic variants of 
a single gene, are provided. 

20 Degenerate variant 

In addition to the isolated polynucleotides of the present invention, and fragments thereof, 
the invention further includes polynucleotides which comprise a sequence substantially different 
from those described above but which, due to the degeneracy of the genetic code, still encode a 
GENSET polypeptide of the present invention. These polynucleotide variants are referred to as 

25 "degenerate variants" throughout the instant application. That is, all possible polynucleotide 
sequences that encode the GENSET polypeptides of the present invention are contemplated. This 
includes the genetic code and species-specific codon preferences known in the art. Thus, it would 
be routine for one skilled in the art to generate the degenerate variants described above, for instance, 
to optimize codon expression for a particular host (e.g., change codons in the human mRNA to 

30 those preferred by other mammalian or bacterial host cells). 

Nucleotide changes present in a variant polynucleotide may be silent, which means that 
they do not alter the amino acids encoded by the polynucleotide. However, nucleotide changes may 
also result in amino acid substitutions, additions, deletions, fusions and truncations in the 
polypeptide encoded by the reference sequence. The substitutions, deletions or additions may 

35 involve one or more nucleotides. The variants may be altered in coding or non-coding regions or 
both. Alterations in the coding regions may produce conservative or non-conservative amino acid 
substitutions, deletions or additions. In the context of the present invention, preferred embodiments 
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are those in which the polynucleotide variants encode polypeptides which retain substantially the 
same biological properties or activities as the GENSET protein. More preferred polynucleotide 
variants are those containing conservative substitutions. 

Similar polynucleotides 

S Other embodiments of the present invention provide a purified, isolated or recombinant 

polynucleotide which is at least 80%, 85%, 90%, 95%, 96%, 97%, 98% or 99% identical to a 
polynucleotide selected from the group consisting of sequences of SEQ ID NOs:l-169, 339-455, 
561-784 and the clone inserts of the deposited clone pool. The above polynucleotides are included 
regardless of whether they encode a polypeptide having a GENSET biological activity. This is 

10 because even where a particular nucleic acid molecule does not encode a polypeptide having 
activity, one of skill in the art would still know how to use the nucleic acid molecule, for instance, 
as a hybridization probe or primer. Uses of the nucleic acid molecules of the present invention that 
do not encode a polypeptide having GENSET activity include, inter alia, isolating a GENSET gene . 
or allelic variants thereof from a DNA library, and detecting GENSET mRNA expression in 

1 5 biological samples suspected of containing GENSET mRNA or DNA, e.g., by Northern Blot or 
PCR analysis. 

The present invention is further directed to polynucleotides having sequences at least 50%. 
60%. 70%. 80%, 90%, 95%, 96%. 97%, 98% or 99% identity to a polynucleotide selected from the 
group consisting of sequences of SEQ ID NOs: 1-169, 339-455, 561-784 and clone inserts of the 

20 deposited clone pool, where said polynucleotide do, in fact, encode a polypeptide having a 

GENSET biological activity. Of course, due to the degeneracy of the genetic code, one of ordinary 
skill in the art will immediately recognize that a large number of the polynucleotides at least 50%. 
60%, 70%, 80%, 90%, 95%, 96%, 97%, 98%, or 99% identical to a polynucleotide selected from 
the group consisting of sequences of SEQ ID NOs: 1-1 69, 339-455, 561-784 and clone inserts of the 

25 deposited clone pool will encode a polypeptide having biological activity. In fact, since degenerate 
variants of these nucleotide sequences all encode the same polypeptide, this will be clear to the 
skilled artisan even without performing the above described comparison assay. It will be further 
recognized in the art that, for such nucleic acid molecules that are not degenerate variants, a 
reasonable number will also encode a polypeptide having biological activity. This is because the 

30 skilled artisan is fully aware of amino acid substitutions that are either less likely or not likely to 
significantly affect protein function (e.g., replacing one aliphatic amino acid with a second aliphatic 
amino acid), as further described below. By a polynucleotide having a nucleotide sequence at least, 
for example, 95% "identical" to a reference nucleotide sequence of the present invention, it is 
intended that the nucleotide sequence of the polynucleotide is identical to the reference sequence 

35 except that the polynucleotide sequence may include up to five point mutations per each 100 
nucleotides of the reference nucleotide sequence encoding the GENSET polypeptide. In other 
words, to obtain a polynucleotide having a nucleotide sequence at least 95% identical to a reference 
nucleotide sequence, up to 5% of the nucleotides in the reference sequence may be deleted, inserted, 
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or substituted with another nucleotide. The query sequence may be an entire sequence selected 
from the group consisting of sequences of SEQ ID NOs:l-169, 339-455, 561-784 and sequences of 
clone inserts of the deposited clone pool, or the ORF (open reading frame) of a polynucleotide 
sequence selected from said group, or any fragment specified as described herein. 

5 Hybridizing Polynucleotides 

In another aspect, the invention provides an isolated or purified nucleic acid molecule 
comprising a polynucleotide which hybridizes under stringent hybridization conditions to any 
polynucleotide of the present invention using any methods known to those skilled in the art 
including those disclosed herein and in particular in the "To find similar sequences" section. Also 

10 contemplated are nucleic acid molecules that hybridize to the polynucleotides of the present 
invention at lower stringency hybridization conditions, preferably at moderate or low stringency 
conditions as defined herein. Such hybridizing polynucleotides may be of at least 15,18, 20, 23, 25, 
28, 30, 35, 40, 50, 75, 100, 200, 300, 500 or 1000 nucleotides in length. 

Of particular interest are polynucleotides hybridizing to any polynucleotide of the invention 

1 5 and encoding GENSET polypeptides, particularly GENSET polypeptides exhibiting a GENSET 
biological activity. 

Of course, a polynucleotide which hybridizes only to polyA+ sequences (such as any 3' 
terminal polyA+ tract of a cDNA shown in the sequence listing), or to a 5' complementary stretch 
of T (or U) residues, would not be included in the definition of "polynucleotide," since such a 
20 polynucleotide would hybridize to any nucleic acid molecule containing a poly(A) stretch or the 
complement thereof (e.g., practically any double-stranded cDNA clone generated using oligo dT as 
a primer). 

Complementary polynucleotides 

The invention further provides isolated nucleic acid molecules having a nucleotide 
25 sequence fully complementary to any polynucleotide of the invention. The present invention 
encompasses a purified, isolated or recombinant polynucleotide having a nucleotide sequence 
complementary to a sequence selected from the group consisting of sequences of SEQ ID NOs:l- . 
169, 339-455, 561-784, sequences of clone inserts of the deposited clone pool and fragments 
thereof Such isolated molecules, particularly DN A molecules, are useful as probes for gene 
30 mapping and for identifying GENSET mRNA in a biological sample, for instance, by PGR or 
Northern blot analysis. 

Polynucleotide fragments 

The present invention is further directed to polynucleotides encoding portions or fragments 
of the nucleotide sequences described herein. Uses for the polynucleotide fragments of the present 
35 invention include probes, primers, molecular weight markers and for expressing the polypeptide 
fragments of the present invention. Fragments include portions of polynucleotides selected from 
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the group consisting of a) the sequences of SEQ IDNOs:l-169, 339-455, 561-784, b) genomic 
GENSET sequences, c) the polynucleotides encoding a polypeptide selected from the group 
consisting of the sequences of SEQ ID NOs: 170-338, 456-560, 785-918, d) the sequences of clone 
inserts of the deposited clone pool, and e) the polynucleotides encoding the polypeptides encoded 
5 by the clone inserts of the deposited clone pool. Particularly included in the present invention is a 
purified or isolated polynucleotide coniprising at least 8 consecutive bases of a polynucleotide of 
the present invention. In one aspect of this embodiment, the polynucleotide comprises at least 10, 
12, 15, 18, 20, 25, 28, 30, 35, 40. 50, 75, 100, 150, 200, 300, 400, 500, 800, 1000, 1500, or 2000 
consecutive nucleotides of a polynucleotide of the present invention. 

10 In addition to the above preferred polynucleotide sizes, further preferred sub-genuses of 

polynucleotides comprise at least 8 nucleotides, wherein "at least 8" is defined as any integer 
between 8 and the integer representing the 3* most nucleotide position as set forth in the sequence 
listing or elsewhere herein. Further included as preferred polynucleotides of the present invention 
are polynucleotide fragments at least 8 nucleotides in length, as described above, that are further 

15 specified in terms of their 5' and 3' position. The 5* and 3' positions are represented by the position 
numbers set forth in the appended sequence listing. For allelic, degenerate and other variants, 
position 1 is defined as the 5* most nucleotide of the ORF, i.e., the nucleotide "A" of the start codon 
with the remaining nucleotides numbered consecutively. Therefore, every combination of a 5' and 
3' nucleotide position that a polynucleotide fragment of the present invention, at least 8 contiguous 

20 nucleotides in length, could occupy on a polynucleotide of the invention is included in the invention 
as an individual species. The polynucleotide fragments specified by 5' and 3* positions can be 
immediately envisaged and are therefore not individually listed solely for the purpose of not 
unnecessarily lengthening the specification. 

It is noted that the above species of polynucleotide fragments of the present invention may 

25 alternatively be described by the formula "a to b"; where "a" equals the 5' most nucleotide position 
and "b" equals the 3' most nucleotide position of the polynucleotide; and further where "a" equals 
an integer between 1 and the number of nucleotides of the polynucleotide sequence of the present 
invention minus 8, and where "b" equals an integer between 9 and the number of nucleotides of the 
polynucleotide sequence of the present invention; and where "a" is an integer smaller then "b" by at 

30 leasts. 

Therefore, the present invention encompasses isolated, purified, or recombinant 

polynucleotides which consist of, consist essentially of, or comprise a contiguous span of at least 8, 
10, 12, 15, 18,20,25, 28, 30,35,40, 50,75. 100, 150,200,300,400, 500, 1000 or 2000 nucleotides 
of a sequence selected from the group consisting of the sequences of SEQ ID NOs: 1-169, 339-455, 
35 561-784 and sequences fully complementary thereto. 

Other preferred fragments of the invention are polynucleotides comprising polynucleotides 
encoding domains of polypeptides. Such fragments may be used to obtain other polynucleotides 
encoding polypeptides having similar domains using hybridization or RT-PCR techniques. 
Alternatively, these fragments may be used to express a polypeptide domain which may have a 
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specific biological property. Thus, another object of the invention is an isolated, purified or 
recombinant polynucleotide encoding a polypeptide consisting of, consisting essentially of, or 
comprising a contiguous span of at least 5, 6, 8, 10, 12, 15,20, 25, 30,35,40, 50, 60, 75, 100, 150 
or 200 consecutive amino acids of a sequence selected from the group consisting of the sequences 
5 of SEQ ID NOs: 1 70-338, 456-560, 785-91 8, to the extent that a contiguous span of these lengths is 
consistent with the lengths of said selected sequence, where said contiguous span comprises at least 
1, 2, 3, 5, or 10 of the amino acid positions of a domain of said selected sequence. The present 
invention also encompasses isolated, purified or recombinant polynucleotides encoding a 
polypeptide comprising a contiguous span of at least 5, 6, 8, 10, 12, 15, 20, 25, 30, 35, 40, 50, 60, 

10 75, 1 00, 1 50 or 200 consecutive amino acids of a sequence selected from the group consisting of 
sequences of SEQ ID NOs: 1 70-338, 456-560, 785-91 8, to the extent that a contiguous span of these 
lengths is consistent with the lengths of said selected sequence, where said contiguous span is a 
domain of said selected sequence. The present invention also encompasses isolated, purified or 
recombinant polynucleotides encoding a polypeptide comprising a domain of a sequence selected 

15 from the group consisting of the sequences of SEQ ID NOs: 170-338, 456-560, 785-918. 

The present invention further encompasses any combination of the polynucleotide 
fragments listed in this section. 

Oligonucleotide primers and probes 

The present invention also encompasses fragments of GENSET polynucleotides for use as 
20 primers and probes. Polynucleotides derived from the GENSET genomic and cDNA sequences are 
useful in order to detect the presence of at least a copy of a GENSET polynucleotide or fragment, 
complement, or variant thereof in a test sample. 

Structural definition 

Any polynucleotide of the invention may be used as a primer or probe. Particularly 
25 preferred probes and primers of the invention include isolated, purified, or recombinant 

polynucleotides comprising a contiguous span of at least 12, 15, 18, 20, 25, 30, 35, 40, 50, 60, 70, 
80, 90, 100, 150, 200, 500, or 1000 nucleotides of a sequence selected from the group consisting of 
the GENSET genomic sequences, the cDNA sequences and the sequences fully complementary 
thereto. Another object of the invention is a purified, isolated, or recombinant polynucleotide 
30 comprising the nucleotide sequence of a sequence selected from the group consisting of the 
sequences of SEQ ID NOs: 1-1 69, 339-455, 561-784, sequences of clone inserts of the deposited 
clone pool, sequences fully complementary thereto, allelic variants thereof, and fragments thereof. 
Moreover, preferred probes and primers of the invention include purified, isolated, or recombinant 
GENSET cDNAs consisting of, consisting essentially of, or comprising the sequences of SEQ ID 
35 NOs: 1-1 69, 339-455, 561-784 and sequences of clone inserts of the deposited clone pool. 

Particularly preferred probes and primers of the invention include isolated, purified, or recombinant 
polynucleotides comprising a contiguous span of at least 12, 1 5, 1 8, 20, 25, 30, 35, 40, 50, 60, 70, 
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80, 90, 100, 150, 200, 500, or 1000 nucleotides of a sequence selected from the group consisting of 
the sequences of SEQ ID NOs:l-169, 339-455, 561-784 and the sequences fully complementary 
thereto. 

Design of primers and probes 

5 A probe or a primer according to the invention has between 8 and 1000 nucleotides in 

length, or is specified to be at least 12. 1 5, 18, 20, 25, 35, 40. 50, 60, 70, 80, 100, 250, 500 or 1000 
nucleotides in length. More particularly, the length of these probes and primers can range from 8, 
10, 15, 20, or 30 to 100 nucleotides, preferably from 10 to 50, more preferably from 15 to 30 
nucleotides. Shorter probes and primers tend to lack specificity for a target nucleic acid sequence 

10 and generally require cooler temperatures to form sufficiently stable hybrid complexes with the 
template. Longer probes and primers are expensive to produce and can sometimes self-hybridize to 
form hairpin structures. The appropriate length for primers and probes under a particular set of 
assay conditions may be empirically determined by one of skill in the art. The formation of stable 
hybrids depends on the melting temperature (Tm) of the DNA. The Tm depends on the length of 

1 5 the primer or probe, the ionic strength of the solution and the G+C content. The higher the G+C 
content of the primer or probe, the higher is the melting temperature because G:C pairs are held by 
three H bonds whereas A:T pairs have only two. The GC content in the probes of the invention 
usually ranges between 10 and 75%, preferably between 35 and 60%, and more preferably between 
40 and 55%. 

20 For amplification purposes, pairs of primers with approximately the same Tm are 

preferable. Primers may be designed using the OSP software (Hillier and Green, 1991), the 
disclosure of which is incorporated by reference in its entirety, based on GC content and melting 
temperatures of oligonucleotides, or using PC-Rare (http:// 

bioinformatics.weizmann.ac.il/software/PC-Rare/doc/manuel.html) based on the octamer frequency 
25 disparity method (Griffais et aL , 1 99 1 ), the disclosure of which is incorporated by reference in its 
entirety. DNA amplification techniques are well known to those skilled in the art. Amplification 
techniques that can be used in the context of the present invention include, but are not limited to, the 
ligase chain reaction (LCR) described in EP-A- 320 308, WO 9320227 and EP-A-439 1 82, the 
polymerase chain reaction (PGR, RT-PCR) and techniques such as the nucleic acid sequence based 
30 amplification (NASBA) described in Guatelli etal{\99Q) and in Compton (1991), Q-beta 
amplification as described in European Patent Application No 4544610, strand displacement 
amplification as described in Walker a/. (1996) and EP A 684 315 and, target mediated 
amplification as described in PCT Publication WO 9322461, the disclosures of which are 
incorporated by reference in their entireties. 
35 LCR and Gap LCR are exponential amplification techniques, both depending on DNA 

ligase to join adjacent primers annealed to a DNA molecule. In Ligase Chain Reaction (LCR), 
probe pairs are used which include two primary (first and second) and two secondary (third and 
fourth) probes, all of which are employed in molar excess to target. The first probe hybridizes to a 
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first segment of the target strand and the second probe hybridizes to a second segment of the target 
strand, the first and second segments being contiguous so that the primary probes abut one another 
in 5* phosphate-3'hydroxyl relationship, and so that a ligase can covalently fuse or ligate the two 
probes into a fused product. In addition, a third (secondary) probe can hybridize to a portion of the 
5 first probe and a fourth (secondary) probe can hybridize to a portion of the second probe in a similar 
abutting fashion. Of course, if the target is initially double stranded, the secondary probes also will 
hybridize to the target complement in the first instance. Once the ligated strand of primary probes 
is separated from the target strand, it will hybridize with the third and fourth probes, which can be 
ligated to form a complementary, secondary ligated product. It is important to realize that the 

1 0 ligated products are functionally equivalent to either the target or its complement. By repeated 
cycles of hybridization and ligation, amplification of the target sequence is achieved. A method for 
multiplex LCR has also been described (WO 9320227), the disclosure of which is incorporated by 
reference in its entirety. Gap LCR (GLCR) is a version of LCR where the probes are not adjacent 
but are separated by 2 to 3 bases. 

15 For amplification of mRNAs, it is within the scope of the present invention to reverse 

transcribe mRNA into cDNA followed by polymerase chain reaction (RT-PCR); or, to use a single 
enzyme for both steps as described in U.S. Patent No. 5,322,770 or, to use Asymmetric Gap LCR 
(RT-AGLCR) as described by Marshall et a/.(1994), the disclosures of which are incorporated by 
reference in its entireties. AGLCR is a modification of GLCR that allows the amplification of 

20 RNA. 

PCR technology is the preferred amplification technique used in the present invention. A 
variety of PCR techniques are familiar to those skilled in the art. For a review of PCR technology, 
see White (1997), Eriich (1992) and the publication entitled "PCR Methods and Applications" 
((1991) Cold Spring Harbor Laboratory Press), the disclosures of which are incorporated by 

25 reference in their entireties. In each of these PCR procedures, PCR primers on either side of the 
nucleic acid sequences to be amplified are added to a suitably prepared nucleic acid sample along 
with dNTPs and a thermostable polymerase such as Taq polymerase, Pfu polymerase, Tth 
polymerase or Vent polymerase. The nucleic acid in the sample is denatured and the PCR primers 
are specifically hybridized to complementary nucleic acid sequences in the sample. The hybridized 

30 primers are extended. Thereafter, another cycle of denaturation, hybridization, and extension is 
initiated. The cycles are repeated multiple times to produce an amplified fragment containing the 
nucleic acid sequence between the primer sites. PCR has further been described in several patents 
including US Patent Nos. 4,683,195; 4,683,202; and 4,965,188, the disclosures of which are 
incorporated herein by reference in their entireties. 

35 Preparation of primers and probes 

Primers and probes can be prepared by any suitable method, including, for example, 
cloning and restriction of appropriate sequences and direct chemical synthesis by a method such as 
the phosphodiester method of Narang et a/.(1979), the phosphodiester method of Brown et 
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(7/.(1979), the diethylphosphoramidite method of Beaucage et al.{\9Z\) and the solid support 
method described in EP 0 707 592, which disclosures are hereby incorporated by reference in their 
entireties. 

Detection probes are generally nucleic acid sequences or uncharged nucleic acid analogs 
5 such as, for example peptide nucleic acids which are disclosed in International Patent Application 
WO 92/20702, morpholino analogs which are described in U.S. Patent Nos, 5,1 85,444; 5,034,506 
and 5,142,047, which disclosures are hereby incorporated by reference in their entireties. The probe 
may have to be rendered "non-extendable" in that additional dNTPs cannot be added to the probe. 
In and of themselves analogs usually are non-extendable and nucleic acid probes can be rendered 

10 non-extendable by modifying the 3' end of the probe such that the hydroxyl group is no longer 
capable of participating in elongation. For example, the 3' end of the probe can be functional ized 
with the capture or detection label to thereby consume or otherwise block the hydroxyl group. 
Alternatively, the 3* hydroxyl group simply can be cleaved, replaced or modified, U.S. Patent 
Application Serial No. 07/049,061 filed April 19, 1993, which disclosure is hereby incorporated by 

15 reference in its entirety, describes modifications, which can be used to render a probe non- 
extendable. 

Labeling of probes 

Any of the polynucleotides of the present invention can be labeled, if desired, by 

20 incorporating any label known in the art to be detectable by spectroscopic, photochemical, 

biochemical, immunochemical, or chemical means. For example, useful labels include radioactive 
substances (including, "P, ^^S, *^H, '^^I), fluorescent dyes (including, 5-bromodesoxyuridin, 
fluorescein, acetylaminofluorene, digoxigenin) or biotin. Preferably, polynucleotides are labeled at 
their 3' and 5' ends. Examples of non-radioactive labeling of nucleic acid fragments are described 

25 in the French patent No. FR.78 1 0975 or by Urdea €tal{\ 988) or Sanchez-Pescador e(al{\ 988), 
which disclosures are hereby incorporated by reference in their entireties. In addition, the probes 
according to the present invention may have structural characteristics such that they allow the signal 
amplification, such structural characteristics being, for example, branched DNA probes as those 
described by Urdea et al in 1 991 or in the European patent No. EP 0 225 807 (Chiron), which 

30 disclosures are hereby incorporated by reference in their entireties. 

The detectable probe may be single stranded or double stranded and may be made using 
techniques known in the art, including in vitro transcription, nick translation, or kinase reactions, A 
nucleic acid sample containing a sequence capable of hybridizing to the labeled probe is contacted 
with the labeled probe. If the nucleic acid in the sample is double stranded, it may be denatured 

35 prior to contacting the probe. In some applications, the nucleic acid sample may be immobilized on 
a surface such as a nitrocellulose or nylon membrane. The nucleic acid sample may comprise 
nucleic acids obtained from a variety of sources, including genomic DNA, cDNA libraries, RNA, or 
tissue samples. 
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Procedures used to detect the presence of nucleic acids capable of hybridizing to the 
detectable probe include well known techniques such as Southern blotting, Northern blotting, dot 
blotting, colony hybridization, and plaque hybridization. In some applications, the nucleic acid 
capable of hybridizing to the labeled probe may be cloned into vectors such as expression vectors, 
S sequencing vectors, or in vitro transcription vectors to facilitate the characterization and expression 
of the hybridizing nucleic acids in the sample. For example, such techniques may be used to isolate 
and clone sequences in a genomic library or cDNA library which are capable of hybridizing to the 
detectable probe as described herein. 

Immobilization of probes 

10 A label can also be used to capture the primer, so as to facilitate the immobilization of 

either the primer or a primer extension product, such as amplified DNA, on a solid support. A 
capture label is attached to the primers or probes and can be a specific binding member which forms 
a binding pair with the solid phase reagent's specific binding member (e.g. biotin and streptavidin). 
Therefore depending upon the type of label carried by a polynucleotide or a probe, it may be 

1 5 employed to capture or to detect the target DNA. Further, it will be understood that the 

polynucleotides, primers or probes provided herein, may, themselves, serve as the capture label. 
For example, in the case where a solid phase reagent's binding member is a nucleic acid sequence, 
it may be selected such that it binds a complementary portion of a primer or probe to thereby 
immobilize the primer or probe to the solid phase. In cases where a polynucleotide probe itself 

20 serves as the binding member, those skilled in the art will recognize that the probe will contain a 
sequence or "tail" that is not complementary to the target. In the case where a polynucleotide 
primer itself serves as the capture label, at least a portion of the primer will be free to hybridize with 
a nucleic acid on a solid phase. DNA Labeling techniques are well known to the skilled technician. 
The probes of the present invention are useful for a number of purposes. They can notably 

25 be used in Southern hybridization to genomic DNA. The probes can also be used to detect PCR 
amplification products. They may also be used to detect mismatches in the GENSET gene or 
mRNA using other techniques. They may also be used for in situ hybridization. 

Any of the polynucleotides, primers and probes of the present invention can be 
conveniently immobilized on a solid support. The solid support is not critical and can be selected 

30 by one skilled in the art. Thus, latex particles, microparticles, magnetic beads, non-magnetic beads 
(including polystyrene beads), membranes (including nitrocellulose strips), plastic tubes, walls of 
microtiter wells, glass or silicon chips, sheep (or other suitable animal's) red blood cells and 
duracytes are all suitable examples. Suitable methods for immobilizing nucleic acids on solid 
phases include ionic, hydrophobic, covalent interactions and the like. A solid support, as used 

35 herein, refers to any material which is insoluble, or can be made insoluble by a subsequent reaction. 
The solid support can be chosen for its intrinsic ability to attract and immobilize the capture reagent. 
Alternatively, the solid phase can retain an additional receptor which has the ability to attract and 
immobilize the capture reagent. The additional receptor can include a charged substance that is 
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oppositely charged with respect to the capture reagent itself or to a charged substance conjugated to 
the capture reagent. As yet another alternative, the receptor molecule can be any specific binding 
member which is immobilized upon (attached to) the solid support and which has the ability to 
immobilize the capture reagent through a specific binding reaction. The receptor molecule enables 
5 the indirect binding of the capture reagent to a solid support material before the performance of the 
assay or during the performance of the assay. The solid phase thus can be a plastic, derivatized 
plastic, magnetic or non-magnetic metal, glass or silicon surface of a test tube, mtcrotiter well, 
sheet, bead, microparticle, chip, sheep (or other suitable animal's) red blood cells, duracytes® and 
other configurations known to those of ordinary skill in the art. The polynucleotides of the 
10 invention can be attached to or immobilized on a solid support individually or in groups of at least 
2, 5, 8, 10, 12, 15, 20, or 25 distinct polynucleotides of the invention to a single solid support. In 
addition, polynucleotides other than those of the invention may be attached to the same solid 
support as one or more polynucleotides of the invention. 

15 Oligonucleotide arrav 

A substrate comprising a plurality of oligonucleotide primers or probes of the invention 
may be used either for detecting or amplifying targeted sequences in GENSET genes, may be used 
for detecting mutations in the coding or in the non-coding sequences of GENSET genes, and may 
also be used to determine GENSET gene expression in different contexts such as in different 

20 tissues, at different stages of a process (embryo development, disease treatment), and in patients 
versus healthy individuals as described elsewhere in the application. 

As used herein, the term "array" means a one dimensional, two dimensional, or 
multidimensional arrangement of nucleic acids of sufficient length to permit specific detection of 
gene expression. For example, the array may contain a plurality of nucleic acids derived from 

2S genes whose expression levels are to be assessed. The array may include a GENSET genomic 
DNA, a GENSET cDNA, sequences complementary thereto or fragments thereof. Preferably, the 
fragments are at least 12, 15, 18, 20, 25, 30, 35, 40 or 50 nucleotides in length. More preferably, the 
fragments are at least 1 00 nucleotides in length. Even more preferably, the fragments are more than 
100 nucleotides in length. In some embodiments the fragments may be more than 500 nucleotides 

30 in length. 

Any polynucleotide provided herein may be attached in overlapping areas or at random 
locations on the solid support. Alternatively the polynucleotides of the invention may be attached 
in an ordered array wherein each polynucleotide is attached to a distinct region of the solid support 
which does not overlap with the attachment site of any other polynucleotide. Preferably, such an 
35 ordered array of polynucleotides is designed to be "addressable" where the distinct locations are 
recorded and can be accessed as part of an assay procedure. Addressable polynucleotide arrays 
typically comprise a plurality of different oligonucleotide probes that are coupled to a surface of a 
substrate in different known locations. The knowledge of the precise location of each 
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polynucleotides location makes these "addressable" arrays particularly useful in hybridization 
assays. Any addressable array technology known in the art can be employed with the 
polynucleotides of the invention. One particular embodiment of these polynucleotide arrays is 
known as the Genechips^*^, and has been generally described in US Patent No. 5,143,854; PCT 
5 publications WO 90/1 5070 and 92/1 0092, which disclosures are hereby incorporated by reference in 
their entireties. These arrays may generally be produced using mechanical synthesis methods or 
light directed synthesis methods which incorporate a combination of photolithographic methods and 
solid phase oligonucleotide synthesis (Fodor et a/., 1991X which disclosure is hereby incorporated 
by reference in its entirety. The immobilization of arrays of oligonucleotides on solid supports has 

1 0 been rendered possible by the development of a technology generally identified as "Very Large 
Scale Immobilized Polymer Synthesis" (VLSIPS"^"^) in which, typically, probes are immobilized in 
a high density array on a solid surface of a chip. Examples of VLSIPS^*^ technologies are provided 
in US Patents 5,143,854; and 5,412,087 and in PCT Publications WO 90/15070, WO 92/10092 and 
WO 95/1 1995, which disclosures are hereby incorporated by reference in their entireties, which 

1 5 describe methods for forming oligonucleotide arrays through techniques such as light-directed 
synthesis techniques. In designing strategies aimed at providing arrays of nucleotides immobilized 
on solid supports, further presentation strategies were developed to order and display the 
oligonucleotide arrays on the chips in an attempt to maximize hybridization patterns and sequence 
information. Examples of such presentation strategies are disclosed in PCT Publications WO 

20 94/12305, WO 94/1 1530, WO 97/29212 and WO 97/31256, the disclosures of which are 
incorporated herein by reference in their entireties. 

Consequently, the invention concerns an array of nucleic acid molecules comprising at least 
one polynucleotide of the invention, particularly a probe or primer as described herein. Preferably, 
the invention concerns an array of nucleic acids comprising at least two polynucleotides of the 

25 invention, particularly probes or primers as described herein. Preferably, the invention concerns an 
array of nucleic acids comprising at least five polynucleotides of the invention, particularly probes 
or primers as described herein. 

A preferred embodiment of the present invention is an array of polynucleotides of at least 
12, 1 S, 1 8, 20, 25, 30, 35, 40, 50, 100 or 500 nucleotides in length which includes at least 1, 2, 5, 

30 10, 1 5, 20, 35, 50 or 1 00 sequences selected from the group consisting of the sequences of SEQ ID 
NOs: 1-169, 339-455, 561-784 and sequences of clone inserts of the deposited clone pool, sequences 
fully complementary thereto, and fragments thereof. 

Methods of making the polynucleotides of the invention 

The present invention also comprises methods of making the polynucleotides of the 
35 invention, including the polynucleotides of SEQ ID NOs:l-169, 339-455, 561-784, genomic DNA 
obtainable therefrom, or fragments thereof. These methods comprise sequentially linking together 
nucleotides to produce the nucleic acids having the preceding sequences. Polynucleotides of the 
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invention may be synthesized either enzymatically using techniques well known to those skilled in 
the art including amplification or hybridization-based methods as described herein, or chemically. 

A variety of chemical methods of synthesizing nucleic acids are known to those skilled in 
the art. In many of these methods, synthesis is conducted on a solid support. These included the 3' 
5 phosphoramidite methods in which the 3' terminal base of the desired oligonucleotide is 

immobilized on an insoluble carrier. The nucleotide base to be added is blocked at the S' hydroxy! 
and activated at the 3' hydroxyl so as to cause coupling with the immobilized nucleotide base. 
Deblocking of the new immobilized nucleotide compound and repetition of the cycle will produce 
the desired polynucleotide. Alternatively, polynucleotides may be prepared as described in U.S. 
10 Patent No. 5,049,656, which disclosure is hereby incorporated by reference in its entirety. In some 
embodiments, several polynucleotides prepared as described above are ligated together to generate 
longer polynucleotides having a desired sequence. 

Pol ypeptides of the invention 

The term " GENSET polypeptides" is used herein to embrace all of the proteins and 
15 polypeptides of the present invention. The present invention encompasses GENSET polypeptides, 
including recombinant, isolated or purified GENSET polypeptides consisting of, consisting 
essentially of, or comprising a sequence selected from the group consisting of SEQ ID NOs: 1 70- 
338, 456-560, 785-91 8 and the polypeptides encoded by human cDNAs contained in the deposited 
clones. Other objects of the invention are polypeptides encoded by the polynucleotides of the 
20 invention as well as fusion polypeptides comprising such polypeptides. 

Polypeptide variants 

The present invention further provides for GENSET polypeptides encoded by allelic and 
splice variants, orthologs, and/or species homologues. Procedures known in the art can be used to 
obtain, allelic variants, splice variants, orthologs, and/or species homologues of polynucleotides 

25 encoding by polypeptides of the group consisting of SEQ ID NOs: 1 70-338, 456-560, 785-9 1 8 and 
polypeptides encoded by the clone inserts of the deposited clone pool, using information from the 
sequences disclosed herein or the clones deposited with the ATCC. 

The polypeptides of the present invention also include polypeptides having an amino acid 
sequence at least 50% identical, more preferably at least 60% identical, and still more preferably 

30 70%, 80%, 90%, 95%, 96%, 97%, 98% or 99% identical to a polypeptide selected from the group 
consisting of the sequences of SEQ ID NOs: 170-338, 456-560, 785-91 8 and those encoded by the 
clone inserts of the deposited clone pool. By a polypeptide having an amino acid sequence at least, 
for example, 95% "identical" to a query amino acid sequence of the present invention, it is intended 
that the amino acid sequence of the subject polypeptide is identical to the query sequence except 

35 that the subject polypeptide sequence may include up to five amino acid alterations per each 100 
amino acids of the query amino acid sequence. In other words, to obtain a polypeptide having an 
amino acid sequence at least 95% identical to a query amino acid sequence, up to 5% (5 of 100) of 
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the amino acid residues in the subject sequence may be inserted, deleted, (indels) or substituted with 
another amino acid. 

Further polypeptides of the present invention include polypeptides which have at least 90% 
similarity, more preferably at least 95% similarity, and still more preferably at least 96%, 97%, 98% 
S or 99% similarity to those described above. By a polypeptide having an amino acid sequence at 
least, for example, 95% "similar" to a query amino acid sequence of the present invention, it is 
intended that the amino acid sequence of the subject polypeptide is similar (i.e. contains identical or 
equivalent amino acid residues) to the query sequence except that the subject polypeptide sequence 
may include up to five amino acid alterations per each 100 amino acids of the query amino acid 
10 sequence. In other words, to obtain a polypeptide having an amino acid sequence at least 95% 
similar to a query amino acid sequence, up to 5% (5 of ICQ) of the amino acid residues in the 
subject sequence may be inserted, deleted, (indels) or substituted with another non-equivalent 
amino acid. 

These alterations of the reference sequence may occur at the amino or carboxy terminal 
15 positions of the reference amino acid sequence or anywhere between those terminal positions, 
interspersed either individually among residues in the reference sequence or in one or more 
contiguous groups within the reference sequence. The query sequence may be an entire amino acid 
sequence selected from the group consisting of sequences of SEQ ID NOs: 170-338, 456-560, 785- 
9 1 8 and those encoded by the clone inserts of the deposited clone pool or any fragment specified as 
20 described herein. 

The variant polypeptides described herein are included in the present invention regardless 
of whether they have their normal biological activity. This is because even where a particular 
polypeptide molecule does not have biological activity, one of skill in the art would still know how 
to use the polypeptide, for instance, as a vaccine or to generate antibodies. Other uses of the 

25 polypeptides of the present invention that do not have GENSET biological activity include, inter 
alia, as epitope tags, in epitope mapping, and as molecular weight markers on SDS-PAGE gels or 
on molecular sieve gel filtration columns using methods known to those of skill in the art. As 
described below, the polypeptides of the present invention can also be used to raise polyclonal and 
monoclonal antibodies, which are useful in assays for detecting GENSET protein expression or as 

30 agonists and antagonists capable of enhancing or inhibiting GENSET protein function. Further, 
such polypeptides can be used in the yeast two-hybrid system to "capture" GENSET protein binding 
proteins, which are also candidate agonists and antagonists according to the present invention {see, 
eg.. Fields et al, 1989, which disclosure is hereby incorporated by reference in its entirety). 

Preparation of the polypeptides of the invention 

35 The polypeptides of the present invention can be prepared in any suitable manner. Such 

polypeptides include isolated naturally occurring polypeptides, recombinantly produced 
polypeptides, synthetically produced polypeptides, or polypeptides produced by a combination of 
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these methods. The polypeptides of the present invention are preferably provided in an isolated 
form, and may be partially or preferably substantially purified. 

Consequently, the present invention also comprises methods of making the polypeptides of 
the invention, particularly polypeptides encoded by the cDNAs of SEQ ID NOs: 1-169, 339-455, 
5 561-784 or by the clone inserts of the deposited clone pool, genomic DN A obtainable therefrom, or 
fragments thereof and methods of making the polypeptides of SEQ ID NOs: 170-338, 456-560, 785- 
918 or fragments thereof. The methods comprise sequentially linking together amino acids to 
produce the nucleic polypeptides having the preceding sequences. In some embodiments, the 
polypeptides made by these methods are 150 amino acids or less in length. In other embodiments, 
10 the polypeptides made by these methods are 120 amino acids or less in length. 

Isolation 

From natural sources 

The GENSET proteins of the invention may be isolated from natural sources, including 
bodily fluids, tissues and cells, whether directly isolated or cultured cells, of humans or non-human 

1 5 animals. Methods for extracting and purifying natural proteins are known in the art, and include the 
use of detergents or chaotropic agents to disrupt particles followed by differential extraction and 
separation of the polypeptides by ion exchange chromatography, affinity chromatography, 
sedimentation according to density, and gel electrophoresis. See, for example, "Methods in 
Enzymology, Academic Press, 1993" for a variety of methods for purifying proteins, which 

20 , disclosure is hereby incorporated by reference in its entirety. Polypeptides of the invention also can 
be purified from natural sources using antibodies directed against the polypeptides of the invention, 
such as those described herein, in methods which are well known in the art of protein purification. 

From recombinant sources 

Preferably, the GENSET polypeptides of the invention are recombinantly produced using 
25 routine expression methods known in the art. The polynucleotide encoding the desired polypeptide 
is operably linked to a promoter into an expression vector suitable for any convenient host. Both 
eukaryotic and prokaryotic host systems are used in forming recombinant polypeptides. The 
polypeptide is then isolated from lysed cells or from the culture medium and purified to the extent 
needed for its intended use. 
30 Any GENSET polynucleotide, including those described in SEQ ID NOs: 1-169, 339-455, 

561-784, those of clone inserts of the deposited clone pool, and allelic variants thereof may be used 
to express GENSET polypeptides. The nucleic acid encoding the GENSET polypeptide to be 
expressed is operably linked to a promoter in an expression vector using conventional cloning 
technology. The GENSET insert in the expression vector may comprise the full coding sequence 
35 for the GENSET protein or a portion thereof. For example, the GENSET derived insert may encode 
a polypeptide comprising at least 6, 8, 10, 12, 15,20,25. 30,35,40, 50, 60, 75, 100, 150 or 200 
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consecutive amino acids of a GENSET protein selected from the group consisting of sequences of 
SEQ ID NOs: 170-338, 456-560, 785-918 and polypeptides encoded by the clone inserts of the 
deposited clone pool. 

Consequently, a further embodiment of the present invention is a method of making a 
5 polypeptide comprising a protein selected from the group consisting of sequences of SEQ ID 
NOs: 1 70-338, 456-560, 785-91 8 and polypeptides encoded by the clone inserts of the deposited 
clone pool, said method comprising the steps of: 

a) obtaining a cDNA comprising a sequence selected from the group consisting of i) the 
sequences SEQ ID NOs: 1-169, 339-455, 561-784, ii) the sequences of clone inserts of the deposited 

10 clone pool one, iii) sequences encoding one of the polypeptide of SEQ ID NOs:170-338, 456-560, 
785-91 8, and iv) sequences of polynucleotides encoding a polypeptide which is encoded by one of 
the clone insert of the deposited clone pool; 

b) inserting said cDNA in an expression vector such that the cDNA is operably linked to a 
promoter; and 

15 c) introducing said expression vector into a host cell v^hereby said host cell produces said 

polypeptide. 

In one aspect of this embodiment, the method further comprises the step of isolating the 
polypeptide. Another embodiment of the present invention is a polypeptide obtainable by the 
method described in the preceding paragraph. 

20 The expression vector is any of the mammalian, yeast, insect or bacterial expression 

systems known in the art. Commercially available vectors and expression systems are available 
from a variety of suppliers including Genetics Institute (Cambridge, MA), Stratagene (La Jolla, 
California), Promega (Madison, Wisconsin), and Invitrogen (San Diego, California). If desired, to 
enhance expression and facilitate proper protein folding, the codon context and codon pairing of the 

25 sequence is optimized for the particular expression organism in which the expression vector is 
introduced, as explained in U.S. Patent No. 5,082,767, which disclosure is hereby incorporated by 
reference in its entirety. 

In one embodiment, the entire coding sequence of a GENSET cDNA and the 3*UTR 
through the poly A signal of the cDNA is operably linked to a promoter in the expression vector. 

30 Alternatively, if the nucleic acid encoding a portion of the GENSET protein lacks a methionine to 
serve as the initiation site, an initiating methionine can be introduced next to the first codon of the 
nucleic acid using conventional techniques. Similarly, if the insert from the GENSET cDNA lacks 
a poly A signal, this sequence can be added to the construct by, for example, splicing out the Poly A 
signal from pSG5 (Stratagene) using Bgll and Sail restriction endonuclease enzymes and 

35 incorporating it into the mammalian expression vector pXTl (Stratagene). pXTl contains the LTRs 
and a portion of the gag gene from Moloney Murine Leukemia Virus. The position of the LTRs in 
the construct allow efficient stable transfection. The vector includes the Herpes Simplex Thymidine 
Kinase promoter and the selectable neomycin gene. The nucleic acid encoding the GENSET 
protein or a portion thereof is obtained by PGR from a vector containing a GENSET cDNA selected 
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from the group consisting of the sequences of SEQ ID NOs: 1-169, 339-455, 561-784 and the clone 
inserts of the deposited clone pool using oligonucleotide primers complementary to the GENSET 
cDNA or portion thereof and containing restriction endonuclease sequences for Pst 1 incorporated 
into the 5' primer and Bglll at the 5' end of the corresponding cDNA 3' primer, taking care to ensure 
5 that the sequence encoding the GENSET protein or a portion thereof is positioned properly with 
respect to the poly A signal. The purified fragment obtained from the resulting PCR reaction is 
digested with PstI, blunt ended with an exonuclease, digested with Bgl 11, purified and ligated to 
pXTl, now containing a poly A signal and digested with Bglll. 

In another embodiment, it is often advantageous to add to the recombinant polynucleotide 
1 0 additional nucleotide sequence which codes for secretory or leader sequences, pro-sequences, 
sequences which aid in purification, such as multiple histidine residues, or an additional sequence 
for stability during recombinant production. 

As a control, the expression vector lacking a cDNA insert is introduced into host cells or 
organisms. 

15 Transfection of a GENSET expression vector into mouse NTH 3T3 cells is but one 

embodiment of introducing polynucleotides into host cells. Introduction of a polynucleotide 
encoding a polypeptide into a host cell can be effected by calcium phosphate transfection, 
DEAE-dextran mediated transfection, cationic lipid-mediated transfection, electroporation, 
transduction, infection, or other methods. Such methods are described in many standard laboratory 

20 manuals, such as Davis et al, (1 986), which disclosure is hereby incorporated by reference in its 
entirety. It is specifically contemplated that the polypeptides of the present invention may in fact be 
expressed by a host cell lacking a recombinant vector. 

Recombinant cell extracts, or proteins from the culture medium if the expressed polypeptide 
is secreted, are then prepared and proteins separated by gel electrophoresis. If desired, the proteins 

25 may be ammonium sulfate precipitated or separated based on size or charge prior to electrophoresis. 
The proteins present are detected using techniques such as Coomassie or silver staining or using 
antibodies against the protein encoded by the GENSET cDNA of interest. Coomassie and silver 
staining techniques are familiar to those skilled in the art. 

Proteins fi'om the host cells or organisms containing an expression vector which contains 

30 the GENSET cDNA or a fragment thereof are compared to those from the control cells or organism. 
The presence of a band from the cells containing the expression vector which is absent in control 
cells indicates that the GENSET cDNA is expressed. Generally, the band corresponding to the 
protein encoded by the GENSET cDNA will have a mobility near that expected based on the 
number of amino acids in the open reading frame of the cDNA. However, the band may have a 

35 mobility different than that expected as a result of modifications such as glycosylation, 
ubiquitination, or enzymatic cleavage. 

Alternatively, the GENSET polypeptide to be expressed may also be a product of 
transgenic animals, i.e., as a component of the milk of transgenic cows, goats, pigs or sheep which 
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are characterized by somatic or germ cells containing a nucleotide sequence encoding the protein of 
interest. 

A polypeptide of this invention can be recovered and purified from recombinant cell 
cultures by well-known methods including differential extraction, ammonium sulfate or ethanol 
5 precipitation, acid extraction, anion or cation exchange chromatography, phosphocellulose 
chromatography, hydrophobic interaction chromatography, affinity chromatography, 
hydroxylapatite chromatography and lectin chromatography. See, for example, "Methods in 
Enzymology", supra for a variety of methods for purifying proteins. Most preferably, high 
performance liquid chromatography ("HPLC") is employed for purification. A recombinantly 

1 0 produced version of a GENSET polypeptide can be substantially purified using techniques 
described herein or otherwise known in the art, such as, for example, by the one-step method 
described in Smith and Johnson (1988), which disclosure is hereby incorporated by reference in its 
entirety. Polypeptides of the invention also can be purified from recombinant sources using 
antibodies directed against the polypeptides of the invention, such as those described herein, in 

1 5 methods which are well known in the art of protein purification. 

Preferably, the recombinantly expressed GENSET polypeptide is purified using standard 
immunochromatography techniques such as the one described in the section entitled 
"Immunoaflfinity Chromatography". In such procedures, a solution containing the protein of 
interest, such as the culture medium or a cell extract, is applied to a column having antibodies 

20 against the protein attached to the chromatography matrix. The recombinant protein is allowed to 
bind the immunochromatography column. Thereafter, the column is washed to remove non- 
specificaily bound proteins. The specifically bound secreted protein is then released from the 
column and recovered using standard techniques. 

If antibody production is not possible, the GENSET cDNA sequence or fragment thereof 

25 may be incorporated into expression vectors designed for use in purification schemes employing 
chimeric polypeptides. In such strategies the coding sequence of the GENSET cDNA or fragment 
thereof is inserted in frame with the gene encoding the other half of the chimera. The other half of 
the chimera may be beta-globin or a nickel binding polypeptide encoding sequence. A 
chromatography matrix having antibody to beta-globin or nickel attached thereto is then used to 

30 purify the chimeric protein. Protease cleavage sites may be engineered between the beta-globin 
gene or the nickel binding polypeptide and the GENSET cDNA or fragment thereof. Thus, the two 
polypeptides of the chimera may be separated from one another by protease digestion. 

One useful expression vector for generating beta-globin chimerics is pSG5 (Stratagene), 
which encodes rabbit beta-globin. Intron 11 of the rabbit beta-globin gene facilitates splicing of the 

35 expressed transcript, and the polyadenylation signal incorporated into the construct increases the 
level of expression. These techniques as described are well known to those skilled in the art of 
molecular biology. Standard methods are published in methods texts such as Davis et al, (1986) 
and many of the methods are available from Stratagene, Life Technologies, Inc., or Promega. 
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Polypeptide may additionally be produced from the construct using in vitro translation systems such 
as the In vitro Express^^ Translation Kit (Stratagene). 

Depending upon the host employed in a recombinant production procedure, the 
polypeptides of the present invention may be glycosylated or may be non-glycosylated. In addition, 
5 polypeptides of the invention may also include an initial modified methionine residue, in some 
cases as a result of host-mediated processes. Thus, it is well known in the art that the N-terminal 
methionine encoded by the translation initiation codon generally is removed with high efficiency 
from any protein after translation in all eukaryotic cells. While the N-terminal methionine on most 
proteins also is efficiently removed in most prokaryotes, for some proteins, this prokaryotic removal 
1 0 process is inefficient, depending on the nature of the amino acid to which the N-terminal 
methionine is covalently linked. 

From chemical synthesis 

In addition, polypeptides of the invention, especially short protein fragments, can be 
chemically synthesized using techniques known in the art (See, e.g., Creighton, 1983; and 

1 5 Hunkapiller et aL^ 1 984), which disclosures are hereby incorporated by reference in their entireties. 
For example, a polypeptide corresponding to a fragment of a polypeptide sequence of the invention 
can be synthesized by use of a peptide synthesizer. A variety of methods of making polypeptides 
are known to those skilled in the art, including methods in which the carboxyl terminal amino acid 
is bound to polyvinyl benzene or another suitable resin. The amino acid to be added possesses 

20 blocking groups on its amino moiety and any side chain reactive groups so that only its carboxyl 
moiety can react. The carboxyl group is activated with carbodiimide or another activating agent 
and allowed to couple to the immobilized amino acid. After removal of the blocking group, the 
cycle is repeated to generate a polypeptide having the desired sequence. Alternatively, the methods 
described in U.S. Patent No. 5,049,656, which disclosure is hereby incorporated by reference in its 

25 entirety, may be used. 

Furthermore, if desired, nonclassical amino acids or chemical amino acid analogs can be 
introduced as a substitution or addition into the polypeptide sequence. Non-classical amino acids 
include, but are not limited to, to the D-isomers of the common amino acids, 2,4-diaminobutyric 
acid, a-amino isobutyric acid, 4-aminobutyric acid, Abu, 2-amino butyric acid, g-Abu, e-Ahx, 6- 

30 amino hexanoic acid, Aib, 2-amino isobutyric acid, 3-amino propionic acid, ornithine, norleucine, 
norvaline, hydroxyproline, sarcosine, citrulline, homocitrulline, cysteic acid, t-butylglycine, t- 
butylalanine, phenylglycine, cyclohexylalanine, b-alanine, fluoroamino acids, designer amino acids 
such as b-methyl amino acids, Ca-methyl amino acids, Na-methyl amino acids, and amino acid 
analogs in general. Furthermore, the amino acid can be D (dextrorotary) or L (levorotary). 

35 Modifications 

The invention encompasses polypeptides which are differentially modified during or after 
translation, e.g., by glycosylation, acelylation, phosphorylation, amidation, derivatization by knowm 
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protecting/blocking groups, proteolytic cleavage, linkage to an antibody molecule or other cellular 
ligand, etc. Any of numerous chemical modifications may be carried out by known techniques, 
including but not limited, to specific chemical cleavage by cyanogen bromide, trypsin, 
chymotrypsin, papain, V8 protease, NaBH4; acetylation, formylation, oxidation, reduction; 
S metabolic synthesis in the presence of tunicamycin; etc. 

Additional post-translational modifications encompassed by the invention include, for 
example, e.g., N-linked or 0-linked carbohydrate chains, processing of N-terminal or C-terminal 
ends), attachment of chemical moieties to the amino acid backbone, chemical modifications of 
N-linked or 0-linked carbohydrate chains, and addition or deletion of an N-terminal methionine 

10 residue as a result of prokaryotic host cell expression. The polypeptides may also be modified with 
a detectable label, such as an enzymatic, fluorescent, isotopic or affinity label to allow for detection 
and isolation of the protein. 

Also provided by the invention are chemically modified derivatives of the polypeptides of 
the invention which may provide additional advantages such as increased solubility, stability and 

1 5 circulating time of the polypeptide, or decreased immunogenicity. See U.S. Patent No: 4, 1 79,337. 
The chemical moieties for derivatization may be selected See U.S. Patent No: 4,1 79,337, which 
disclosure is hereby incorporated by reference in its entirety. The chemical moieties for 
derivatization may be selected from water soluble polymers such as polyethylene glycol, ethylene 
glycol/propylene glycol copolymers, carboxymethylcellulose, dextran, polyvinyl alcohol and the 

20 like. The polypeptides may be modified at random positions within the molecule, or at 

predetermined positions within the molecule and may include one, two, three or more attached 
chemical moieties. 

The polymer may be of any molecular weight, and may be branched or unbranched. For 
polyethylene glycol, the preferred molecular weight is between about I kDa and about 100 kDa (the 

25 term "about" indicating that in preparations of polyethylene glycol, some molecules will weigh 
more, some less, than the stated molecular weight) for ease in handling and manufacturing. Other 
sizes may be used, depending on the desired therapeutic profile (e.g., the duration of sustained 
release desired, the effects, if any on biological activity, the ease in handling, the degree or lack of 
antigenicity and other known effects of the polyethylene glycol to a therapeutic protein or analog). 

30 The polyethylene glycol molecules (or other chemical moieties) should be attached to the 

protein with consideration of effects on functional or antigenic domains of the protein. There are a 
number of attachment methods available to those skilled in the art, e.g., EP 0 401 384, (coupling 
PEG to G-CSF), and Malik et al. (1992) (reporting pegylation of GM-CSF using tresyl chloride), 
which disclosures are hereby incorporated by reference in their entireties. For example, 

35 polyethylene glycol may be covalently bound through amino acid residues via a reactive group, 
such as, a free amino or carboxyl group. Reactive groups are those to which an activated 
polyethylene glycol molecule may be bound. The amino acid residues having a free amino group 
may include lysine residues and the N-terminal amino acid residues; those having a fi-ee carboxyl 
group may include aspartic acid residues glutamic acid residues and the C-terminal amino acid 
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residue. Sulfhydryl groups may also be used as a reactive group for attaching the polyethylene 
glycol molecules. Preferred for therapeutic purposes is attachment at an amino group, such as 
attachment at the N-terminus or lysine group. 

One may specifically desire proteins chemically modified at the N-terminus. Using 
5 polyethylene glycol as an illustration of the present composition, one may select from a variety of 
polyethylene glycol molecules (by molecular weight, branching, etc.), the proportion of 
polyethylene glycol molecules to protein (polypeptide) molecules in the reaction mix, the type of 
pegylation reaction to be performed, and the method of obtaining the selected N-terminally 
pegylated protein. The method of obtaining the N-terminally pegylated preparation (i.e., separating 

10 this moiety from other monopegylated moieties if necessary) may be by purification of the 
N-terminally pegylated material fi-om a population of pegylated protein molecules. Selective 
proteins chemically modified at the N-terminus modification may be accomplished by reductive 
alkylation, which exploits differential reactivity of different types of primary amino groups (lysine 
versus the N-terminal) available for derivatization in a particular protein. Under the appropriate 

15 reaction conditions, substantially selective derivatization of the protein at the N-terminus with a 
carbonyl group containing polymer is achieved. 

Multimerization 

The polypeptides of the invention may be in monomers or multimers (i.e., dimers, trimers, 
tetramers and higher multimers). Accordingly, the present invention relates to monomers and 

20 multimers of the polypeptides of the invention, their preparation, and compositions containing them. 
In specific embodiments, the polypeptides of the invention are monomers, dimers, trimers or 
tetramers. In additional embodiments, the multimers of the invention are at least dimers, at least 
trimers, or at least tetramers. 

Multimers encompassed by the invention may be homomers or heteromers. As used herein, 

25 the term "homomer", refers to a multimer containing only polypeptides corresponding to the amino 
acid sequences of SEQ ID NOs:170-338, 456-560, 785-918 or encoded by the clone inserts of the 
deposited clone pool (including fragments, variants, splice variants, and fusion proteins, 
corresponding to these polypeptides as described herein). These homomers may contain 
polypeptides having identical or different amino acid sequences. In a specific embodiment, a 

30 homomer of the invention is a multimer containing only polypeptides having an identical amino 
acid sequence. In another specific embodiment, a homomer of the invention is a multimer 
containing polypeptides having different amino acid sequences. In specific embodiments, the 
multimer of the invention is a homodimer (e.g., containing polypeptides having identical or 
difierent amino acid sequences) or a homotrimer (e.g., containing polypeptides having identical 

35 and/or different amino acid sequences). In additional embodiments, the homomenc multimer of the 
invention is at least a homodimer, at least a homotrimer, or at least a homotetramer. 

As used herein, the term "heteromer" refers to a multimer containing one or more 
heterologous polypeptides (i.e., polypeptides of different proteins) in addition to the polypeptides of 
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the invention. In a specific embodiment, the multimer of the invention is a heterodimer, a 
heterotrimer, or a heterotetramer. In additional embodiments, the heteromeric multimer of the 
invention is at least a heterodimer, at least a heterotrimer, or at least a heterotetramer. 

Multimers of the invention may be the result of hydrophobic, hydrophilic, ionic and/or 
5 covalent associations and/or may be indirectly linked, by for example, liposome formation. Thus, 
in one embodiment, multimers of the invention, such as, for example, homodimers or homotrimers, 
are formed when polypeptides of the invention contact one another in solution. In another 
embodiment, heteromultimers of the invention, such as, for example, heterotrimers or 
heterotetramers, are formed when polypeptides of the invention contact antibodies to the 

1 0 polypeptides of the invention (including antibodies to the heterologous polypeptide sequence in a 
fusion protein of the invention) in solution. In other embodiments, multimers of the invention are 
formed by covalent associations with and/or between the polypeptides of the invention. Such 
covalent associations may involve one or more amino acid residues contained in the polypeptide 
sequence (e.g., that recited in the sequence listing, or contained in the polypeptide encoded by a 

1 5 deposited clone). In one instance, the covalent associations are cross-linking between cysteine 
residues located within the polypeptide sequences, which interact in the native (i.e., naturally 
occurring) polypeptide. In another instance, the covalent associations are the consequence of 
chemical or recombinant manipulation. Alternatively, such covalent associations may involve one 
or more amino acid residues contained in the heterologous polypeptide sequence in a fusion protein 

20 of the invention. 

In one example, covalent associations are between the heterologous sequence contained in a 
fusion protein of the invention (see, e.g., US Patent Number 5,478,925, which disclosure is hereby 
incorporated by reference in its entirety). In a specific example, the covalent associations are 
between the heterologous sequence contained in an Fc fusion protein of the invention (as described 

25 herein). In another specific example, covalent associations of fusion proteins of the invention are 
between heterologous polypeptide sequence from another protein that is capable of forming 
covalently associated multimers, such as for example, oseteoprotegerin (see, e.g., International 
Publication No: WO 98/49305, the contents of which are herein incorporated by reference in its 
entirety). In another embodiment, two or more polypeptides of the invention are joined through 

30 peptide linkers. Examples include those peptide linkers described in U.S. Pat. No. 5,073,627 
(hereby incorporated by reference). Proteins comprising multiple polypeptides of the invention 
separated by peptide linkers may be produced using conventional recombinant DNA technology. 

Another method for preparing multimer polypeptides of the invention involves the use of 
polypeptides of the invention fused to a leucine zipper or isoleucine zipper polypeptide sequence. 

35 Leucine zipper and isoleucine zipper domains are polypeptides that promote multimerization of the 
proteins in which they are found. Leucine zippers were originally identified in several 
DNA-binding proteins, and have since been found in a variety of different proteins. (Landschuiz et 
al., 1 988). Among the known leucine zippers are naturally occurring peptides and derivatives 
thereof that dimerize or trimerize. Examples of leucine zipper domains suitable for producing 
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soluble multimeric proteins of the invention are those described in PCT application WO 94/10308, 
hereby incorporated by reference. Recombinant fusion proteins comprising a polypeptide of the 
invention fused to a polypeptide sequence that dimerizes or trimerizes in solution are expressed in 
suitable host cells, and the resulting soluble multimeric fusion protein is recovered from the culture 
5 supernatant using techniques known in the art. 

Trimeric polypeptides of the invention may offer the advantage of enhanced biological 
activity. Preferred leucine zipper moieties and isoleucine moieties are those that preferentially form 
trimers. One example is a leucine zipper derived from lung surfactant protein D (SPD), as 
described in Hoppe et ah (1994) and in U.S. patent application Ser, No. 08/446,922, which 

10 disclosure is hereby incorporated by reference in its entirety. Other peptides derived from naturally 
occurring trimeric proteins may be employed in preparing trimeric polypeptides of the invention. In 
another example, proteins of the invention are associated by interactions between Flag® 
polypeptide sequence contained in fusion proteins of the invention containing Flag® polypeptide 
sequence. In a further embodiment, associations proteins of the invention are associated by 

1 5 interactions between heterologous polypeptide sequence contained in Flag® fusion proteins of the 
invention and anti Flag® antibody. 

The multimers of the invention may be generated using chemical techniques known in the 
art. For example, polypeptides desired to be contained in the multimers of the invention may be 
chemically cross-linked using linker molecules and linker molecule length optimization techniques 

20 known in the art (see, e.g., US Patent Number 5,478,925, which is herein incorporated by reference 
in its entirety). Additionally, multimers of the invention may be generated using techniques known 
in the art to form one or more inter-molecule cross-links between the cysteine residues located 
within the sequence of the polypeptides desired to be contained in the multimer (see, e.g., US Patent 
Number 5,478,925, which is herein incorporated by reference in its entirety). Further, polypeptides 

25 of the invention may be routinely modified by the addition of cysteine or biotin to the C terminus or 
N-terminus of the polypeptide and techniques known in the art may be applied to generate 
multimers containing one or more of these modified polypeptides (see, e.g., US Patent Number 
5,478,925, which is herein incorporated by reference in its entirety). Additionally, other techniques 
known in the art may be applied to generate liposomes containing the polypeptide components 

30 desired to be contained in the multimer of the invention (see, e.g., US Patent Number 5,478,925, 
which is herein incorporated by reference in its entirety). 

Alternatively, multimers of the invention may be generated using genetic engineering 
techniques known in the art. In one embodiment, polypeptides contained in multimers of the 
invention are produced recombinantly using fusion protein technology described herein or 

35 otherwise known in the art (see, e.g., US Patent Number 5,478,925, which is herein incorporated by 
reference in its entirety). In a specific embodiment, polynucleotides coding for a homodimer of the 
invention are generated by ligating a polynucleotide sequence encoding a polypeptide of the 
invention to a sequence encoding a linker polypeptide and then further to a synthetic polynucleotide 
encoding the translated product of the polypeptide in the reverse orientation from the original 
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C-terminus to the N-terminus (lacking the leader sequence) (see, e.g., US Patent Number 5,478,925, 
which is herein incorporated by reference in its entirety). In another embodiment, recombinant 
techniques described herein or otherwise known in the art are applied to generate recombinant 
polypeptides of the invention which contain a transmembrane domain (or hydrophobic or signal 
5 peptide) and which can be incorporated by membrane reconstitution techniques into liposomes (see, 
e.g., US Patent Number 5,478,925, which is herein incorporated by reference in its entirety). 

Mutated polypeptides 

To improve or alter the characteristics of GENSET polypeptides of the present invention, 
protein engineering may be employed. Recombinant DNA technology known to those skilled in the 

10 art can be used to create novel mutant proteins or muteins including single or multiple amino acid 
substitutions, deletions, additions, or fusion proteins. Such modified polypeptides can show, e.g., 
increased/decreased biological activity or increased/decreased stability. In addition, they may be 
purified in higher yields and show better solubility than the corresponding natural polypeptide, at 
least under certain purification and storage conditions. Further, the polypeptides of the present 

15 invention may be produced as multimers including dimers, trimers and tetramers. Multimerization 
may be facilitated by linkers or recombinantly though heterologous polypeptides such as Fc regions. 

N- and C-terminal deletions 

It is known in the art that one or more amino acids may be deleted from the N-terminus or 
C-terminus without substantial loss of biological function. For instance, Ron etal, (1993), reported 

20 modified KGF proteins that had heparin binding activity even if 3, 8, or 27 N-terminal amino acid 
residues were missing. Accordingly, the present invention provides polypeptides having one or 
more residues deleted from the amino terminus of the polypeptides of SEQ ID NOs: 1 70-338, 456- 
560, 785-91 8 or that encoded by the clone inserts of the deposited clone pool. Similarly, many 
examples of biologically functional C-terminal deletion mutants are known. For instance, 

25 Interferon gamma shows up to ten times higher activities by deleting 810 amino acid residues from 
the C-terminus of the protein {See, e.g., Dobeli, et al 1988), which disclosure is hereby 
incorporated by reference in its entirety. Accordingly, the present invention provides polypeptides 
having one or more residues deleted from the carboxy terminus of the polypeptides shown of SEQ 
ID NOs:170-338, 456-560. 785-91 8 or encoded by the clone inserts of the deposited clone pool. 

30 The invention also provides polypeptides having one or more amino acids deleted from both the 
amino and the carboxyl termini as described below. 

Other mutations 

Other mutants in addition to N- and C-terminal deletion forms of the protein discussed 
above are included in the present invention. It also will be recognized by one of ordinary skill in 
35 the art that some amino acid sequences of the GENSET polypeptides of the present invention can be 
varied without significant effect of the structure or function of the protein. If such differences in 
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sequence are contemplated, it should be remembered that there will be critical areas on the protein 
which determine activity. Thus, the invention further includes variations of the GENSET 
polypeptides which show substantial GENSET polypeptide activity. Such mutants include 
deletions, insertions, inversions, repeats, and substitutions selected according to general rules 
S known in the art so as to have little effect on activity. For example, guidance concerning how to 
make phenotypically silent amino acid substitutions is provided. 

There are two main approaches for studying the tolerance of an amino acid sequence to 
change (see, Bowie ei al 1994, which disclosure is hereby incorporated by reference in its entirety). 
The first method relies on the process of evolution, in which mutations are either accepted or 

1 0 rejected by natural selection. 

The second approach uses genetic engineering to introduce amino acid changes at specific 
positions of a cloned gene and selections or screens to identify sequences that maintain 
functionality. These studies have revealed that proteins are surprisingly tolerant of amino acid 
substitutions. The studies indicate which amino acid changes are likely to be permissive at a certain 

1 5 position of the protein. For example, most buried amino acid residues require nonpolar side chains, 
whereas few features of surface side chains are generally conserved. Other such phenotypically 
silent substitutions are described by Bowie et al. (supra) and the references cited therein. 

Typically seen as conservative substitutions are the replacements, one for another, among 
the aliphatic amino acids Ala, Val, Leu and Phe; interchange of the hydroxy! residues Ser and Thr, 

20 exchange of the acidic residues Asp and Glu, substitution between the amide residues Asn and Gin, 
exchange of the basic residues Lys and Arg and replacements among the aromatic residues Phe, 
Tyr. Thus, the fragment, derivative, analog, or homologue of the polypeptide of the present 
invention may be, for example: (i) one in which one or more of the amino acid residues are 
substituted with a conserved or non-conserved amino acid residue (preferably a conserved amino 

25 acid residue) and such substituted amino acid residue may or may not be one encoded by the genetic 
code: or (ii) one in which one or more of the amino acid residues includes a substituent group: or 
(iii) one in which the GENSET polypeptide is fused with another compound, such as a compound to 
increase the half-life of the polypeptide (for example, polyethylene glycol): or (iv) one in which the 
additional amino acids are fused to the above form of the polypeptide, such as an IgG Fc fusion 

30 region peptide or leader or secretory sequence or a sequence which is employed for purification of 
the above form of the polypeptide or a pro-protein sequence. Such fragments, derivatives and 
analogs are deemed to be within the scope of those skilled in the art from the teachings herein. 

Thus, the GENSET polypeptides of the present invention may include one or more amino 
acid substitutions, deletions, or additions, either from natural mutations or human manipulation. As 

35 indicated, changes are preferably of a minor nature, such as conservative amino acid substitutions 
that do not significantly affect the folding or activity of the protein. The following groups of amino 
acids generally represent equivalent changes: (1) Ala, Pro, Gly, Glu, Asp, Gin, Asn, Ser, Thr; (2) 
Cys, Ser, Tyr, Thr; (3) Val, He, Uu, Met, Ala, Phe; (4) Lys, Arg, His; (5) Phe, Tyr, Trp, His. 
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A specific embodiment of a modified GENSET peptide molecule of interest according to 
the present invention, includes, but is not limited to, a peptide molecule which is resistant to 
proteolysis, is a peptide in which the -CONH- peptide bond is modified and replaced by a (CH2NH) 
reduced bond, a (NHCO) retro inverso bond, a (CH2-0) methylene-oxy bond, a (CH2-S) 
5 thiomethylene bond, a (CH2CH2) carba bond, a (C0-CH2) cetomethylene bond, a (CH0H-CH2) 
hydroxyethylene bond), a (N-N) bound, a E-alcene bond or also a -CH-CH- bond. The invention 
also encompasses a human GENSET polypeptide or a fragment or a variant thereof in which at least 
one peptide bond has been modified as described above. 

Amino acids in the GENSET proteins of the present invention that are essential for function 

10 can be identified by methods known in the art, such as site-directed mutagenesis or 

alanine-scanning mutagenesis (see, e.g., Cunningham e( aL 1989, which disclosure is hereby 
incorporated by reference in its entirety). The latter procedure introduces single alanine mutations 
at every residue in the molecule. The resuhing mutant molecules are then tested for biological 
activity using assays appropriate for measuring the function of the particular protein. Of special 

1 5 interest are substitutions of charged amino acids with other charged or neutral amino acids which 
may produce proteins with highly desirable improved characteristics, such as less aggregation. 
Aggregation may not only reduce activity but also be problematic when preparing pharmaceutical 
formulations, because aggregates can be immunogenic, (see, e.g., Pinckard et aL, 1967; Robbins, et 
aL, 1987; and Cleland, et aL, 1993). 

20 A further embodiment of the invention relates to a polypeptide which comprises the amino 

acid sequence of a GENSET polypeptide having an amino acid sequence which contains at least one 
conservative amino acid substitution, but not more than 50 conservative amino acid substitutions, 
not more than 40 conservative amino acid substitutions, not more than 30 conservative amino acid 
substitutions, and not more than 20 conservative amino acid substitutions. Also provided are 

25 polypeptides which comprise the amino acid sequence of a GENSET polypeptide, having at least 
one, but not more than 10, 9, 8, 7, 6, 5, 4, 3, 2 or I conservative amino acid substitutions. 

Polypeptide fragments 

Structural definition 

The present invention is further directed to fragments of the amino acid sequences 
30 described herein such as the polypeptides of SEQ ID NOs: 1 70-338, 456-560, 785-918 or those 
encoded by the clone inserts of the deposited clone pool. More specifically, the present invention 
embodies purified, isolated, and recombinant polypeptides comprising at least 6, preferably at least 
8 to 10, more preferably 12, 15, 20, 25, 30, 35, 40, 50, 60, 75, 100, 125, 150, 175, 200, 225, 250, 
275, or 300 consecutive amino acids of a polypeptide selected from the group consisting of the 
35 sequences of SEQ ID NOs: 1 70-338, 456-560, 785-918, the polypeptides encoded by the clone 
inserts of the deposited clone pool, and other polypeptides of the present invention. 
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In addition to the above polypeptide fragments, further preferred sub-genuses of 
polypeptides comprise at least 6 amino acids, wherein "at least 6" is defined as any integer between 
6 and the integer representing the C-terminal amino acid of the polypeptide of the present invention 
including the polypeptide sequences of the sequence listing below. Further included are species of 
S polypeptide fragments at least 6 amino acids in length, as described above, that are further specified 
in terms of their N-terminal and C-terminal positions. However, included in the present invention 
as individual species are all polypeptide fragments, at least 6 amino acids in length, as described 
above, and may be particularly specified by a N-terminal and C4erminal position. That is, every 
combination of a N-terminal and C-terminal position that a fragment at least 6 contiguous amino 

10 acid residues in length could occupy, on any given amino acid sequence of the sequence listing or 
of the present invention is included in the present invention 

The present invention also provides for the exclusion of any fragment species specified by 
N-terminal and C-terminal positions or of any fragment sub-genus specified by size in amino acid 
residues as described above. Any number of fragments specified by N-terminal and C-terminal 

1 S positions or by size in amino acid residues as described above may be excluded as individual 
species. 

The above polypeptide fragments of the present invention can be immediately envisaged 
using the above description and are therefore not individually listed solely for the purpose of not 
unnecessarily lengthening the specification. Moreover, the above fragments need not have a 

20 GENSET biological activity, although polypeptides having these activities are preferred 
embodiments of the invention, since they would be useful, for example, in immunoassays, in 
epitope mapping, epitope tagging, as vaccines, and as molecular weight markers. The above 
fragments may also be used to generate antibodies to a particular portion of the polypeptide. These 
antibodies can then be used in immunoassays well known in the art to distinguish between human 

25 and non-human cells and tissues or to determine whether cells or tissues in a biological sample are 
or are not of the same type which express the polypeptides of the present invention. 

It is noted that the above species of polypeptide fragments of the present invention may 
alternatively be described by the formula ''a to b"; where "a" equals the N-terminal most amino acid 
position and "b" equals the C-terminal most amino acid position of the polynucleotide; and further 

30 where "a" equals an integer between 1 and the number of amino acids of the polypeptide sequence 
of the present invention minus 6, and where "b" equals an integer between 7 and the number of 
amino acids of the polypeptide sequence of the present invention; and where "a" is an integer 
smaller then "b" by at least 6. 

The present invention also provides for the exclusion of any species of polypeptide 

35 fragments of the present invention specified by 5' and 3' positions or sub-genuses of polypeptides 
specified by size in amino acids as described above. Any number of fragments specified by 5' and 
3' positions or by size in amino acids, as described above, may be excluded. 
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Domains 

Preferred polynucleotide fragments of the invention are domains of polypeptides of the 
invention. Such domains may eventually comprise linear or structural motifs and signatures 
5 including, but not limited to, leucine zippers, helix-tum-helix motifs, post-translational modification 
sites such as glycosylation sites, ubiquitination sites, alpha helices, and beta sheets, signal sequences 
encoding signal peptides which direct the secretion of the encoded proteins, sequences implicated in 
transcription regulation such as homeoboxes, acidic stretches, enzymatic active sites, substrate 
binding sites, and enzymatic cleavage sites. Such domains may present a particular biological 

10 activity such as DNA or RNA-binding, secretion of proteins, transcription regulation, enzymatic 
activity, substrate binding activity, etc. 

A domain has a size generally comprised between 3 and 1 000 amino acids. In a preferred 
embodiment, domains comprise a number of amino acids that is any integer between 6 and 200. 
Domains may be synthesized using any methods known to those skilled in the art, including those 

1 5 disclosed herein, particularly in the section entitled "Preparation of the polypeptides of the 

invention". Methods for determining the amino acids which make up a domain with a particular 
biological activity include mutagenesis studies and assays to determine the biological activity to be 
tested. 

Alternatively, the polypeptides of the invention may be scanned for motifs, domains and/or 

20 signatures in databases using any computer method known to those skilled in the art. Searchable 
databases include Prosite (Hofmann et ah, 1999; Bucher and Bairoch 1994), Pfam (Sonnhammer et 
al, 1997; Henikoff al, 2000; Bateman et al, 2000), Blocks (HenikofFe/ al, 2000), Print 
(Attwood et al, 1996), Prodom (Sonnhammer and Kahn, 1994; Corpet et al 2000), Sbase (Pongor 
et al. 1993; Murvai et al, 2000), Smart (Schultz et al, 1998), Dali/FSSP (Holm and Sander, 1996, 

25 1997 and 1999), HSSP (Sander and Schneider 1991), CATH (Orengo et al, 1997; Pearl etal, 
2000). SCOP (Murzin et al, 1995; Lo Conte et al, 2000), COG (Tatusov et al, 1997 and 2000), 
specific family databases and derivatives thereof (Nevill-Manning et al, 1998; Yona et al, 1999; 
Attwood et al, 2000), each of which disclosures are hereby incorporated by reference in their 
entireties. For a review on available databases, see issue 1 of volume 28 of Nucleic Acid Research 

30 (2000), which disclosure is hereby incorporated by reference in its entirety. 

The domains of the present invention preferably comprises 6 to 200 amino acids (i.e. any 
integer between 6 and 200, inclusive) of a polypeptide of the present invention. Also, included in 
the present invention are domain fragments between the integers of 6 and the full length GENSET 
sequence of the sequence listing. All combinations of sequences between the integers of 6 and the 

35 full-length sequence of a GENSET polypeptide are included. The domain fragments may be 

specified by either the number of contiguous amino acid residues (as a sub-genus) or by specific N- 
terminal and C-terminal positions (as species) as described above for the polypeptide fragments of 
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the present invention. Any number of domain fragments of the present invention may also be 
excluded in the same manner. 

Epitopes and Antibody Fusions: 

A preferred embodiment of the present invention is directed to epitope-bearing polypeptides 
S and epitope-bearing polypeptide fragments. These epitopes may be "antigenic epitopes" or both an 
"antigenic epitope" and an "immunogenic epitope". An "immunogenic epitope" is defined as a part 
of a protein that elicits an antibody response in vivo when the polypeptide is the immunogen. On 
the other hand, a region of polypeptide to which an antibody binds is defined as an "antigenic 
determinant" or "antigenic epitope." The number of immunogenic epitopes of a protein generally is 

10 less than the number of antigenic epitopes {See, e.g., Geysen, e( al., 1984), which disclosure is 
hereby incorporated by reference in its entirety. It is particularly noted that although a particular 
epitope may not be immunogenic, it is nonetheless useful since antibodies can be made to both 
immunogenic and antigenic epitopes. 

An epitope can comprise as few as 3 amino acids in a spatial conformation, which is unique 

1 5 to the epitope. Generally an epitope consists of at least 6 such amino acids, and more often at least 
8-10 such amino acids. In preferred embodiment, antigenic epitopes comprise a number of amino 
acids that is any integer between 3 and 50. Fragments which function as epitopes may be produced 
by any conventional means {See, e,g„ Houghten, 1985), also fijrther described in U.S. Patent No. 
4,63 1,2 1, which disclosures are hereby incorporated by reference in their entireties. Methods for 

20 determining the amino acids which make up an epitope include x-ray crystallography, 2- 

dimensional nuclear magnetic resonance, and epitope mapping, e.g., the Pepscan method described 
by Geysen et al. (1984); PCT Publication No. WO 84/03564; and PCT Publication No. WO 
84/03506, which disclosures are hereby incorporated by reference in their entireties. Another 
example is the algorithm of Jameson and Wolf, (1988) (said reference incorporated by reference in 

25 its entirety). The Jameson- Wolf antigenic analysis, for example, may be performed using the 
computer program PROTEAN, using default parameters (Version 4.0 Windows, DNASTAR, Inc., 
1 228 South Park Street Madison, WI. 

The epitope-bearing fragments of the present invention preferably comprise 6 to 50 amino 
acids (i.e. any integer between 6 and 50, inclusive) of a polypeptide of the present invention. Also, 

30 included in the present invention are antigenic fragments between the integers of 6 and the full 
length GENSET sequence of the sequence listing. All combinations of sequences between the 
integers of 6 and the full-length sequence of a GENSET polypeptide are included. The epitope- 
bearing fragments may be specified by either the number of contiguous amino acid residues (as a 
sub-genus) or by specific N-terminal and C-terminal positions (as species) as described above for 

35 the polypeptide fragments of the present invention. Any number of epitope-bearing fragments of 
the present invention may also be excluded in the same manner. 

Antigenic epitopes are useful, for example, to raise antibodies, including monoclonal 
antibodies that specifically bind the epitope (see, Wilson et al, 1984; and SutclifiFe, et at., 1983, 
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which disclosures are hereby incorporated by reference in their entireties). The antibodies are then 
used in various techniques such as diagnostic and tissue/cell identification techniques, as described 
herein, and in purification methods such as immunoaffinity chromatography. 

Similarly, immunogenic epitopes can be used to induce antibodies according to methods 
5 well known in the art (See, Sutcliffe et al., supra; Wilson et aL, supra; Chow et flr/.(1985); and 
Bittle, et a/., (1985), which disclosures are hereby incorporated by reference in their entireties). A 
preferred immunogenic epitope includes the natural GENSET protein. The immunogenic epitopes 
may be presented together with a carrier protein, such as an albumin, to an animal system (such as 
rabbit or mouse) or, if it is long enough (at least about 25 amino acids), without a carrier. However, 

1 0 immunogenic epitopes comprising as few as 8 to 1 0 amino acids have been shown to be sufficient 
to raise antibodies capable of binding to, at the very least, linear epitopes in a denatured polypeptide 
(e.g., in Western blotting.). 

Epitope-bearing polypeptides of the present invention are used to induce antibodies 
according to methods well known in the art including, but not limited to, in vivo immunization, in 

1 5 vitro immunization, and phage display methods (See, e.g., Sutcliffe, et al., supra, Wilson, et a/., 
supra, and Bittle, etal, supra). If in vivo immunization is used, animals may be immunized with 
free peptide; however, anti-peptide antibody titer may be boosted by coupling of the peptide to a 
macromolecular carrier, such as keyhole limpet hemacyanin (KLH) or tetanus toxoid. For instance, 
peptides containing cysteine residues may be coupled to a carrier using a linker such as 

20 -maleimidobenzoyl- N-hydroxysuccinimide ester (MBS), while other peptides may be coupled to 
carriers using a more general linking agent such as glutaraldehyde. Animals such as rabbits, rats 
and mice are immunized with either free or carrier-coupled peptides, for instance, by intraperitoneal 
and/or intradermal injection of emulsions containing about 100 ^gs of peptide or carrier protein and 
Freund's adjuvant. Several booster injections may be needed, for instance, at intervals of about two 

25 weeks, to provide a useful titer of anti-peptide antibody, which can be detected, for example, by 
ELISA assay using free peptide adsorbed to a solid surface. The titer of anti-peptide antibodies in 
serum from an immunized animal may be increased by selection of anti-peptide antibodies, for 
instance, by adsorption to the peptide on a solid support and elution of the selected antibodies 
according to methods well known in the art. 

30 As one of skill in the art will appreciate, and discussed above, the polypeptides of the 

present invention comprising an immunogenic or antigenic epitope can be fused to heterologous 
polypeptide sequences. For example, the polypeptides of the present invention may be fused with 
the constant domain of immunoglobulins (IgA, IgE, IgG, IgM), or portions thereof (CHI, CH2, 
CH3, any combination thereof including both entire domains and portions thereof) resulting in 

35 chimeric polypeptides. These fusion proteins facilitate purification, and show an increased half-life 
in vivo. This has been shown, e.g., for chimeric proteins consisting of the first two domains of the 
human CD4-polypeptide and various domains of the constant regions of the heavy or light chains of 
mammalian immunoglobulins {See, e.g., EPA 0,394,827; and Traunecker e/ a/., 1988, which 
disclosures are hereby incorporated by reference in their entireties). Fusion proteins that have a 
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disulfide-linked dimeric structure due to the IgG portion can also be more efficient in binding and 
neutralizing other molecules than monomeric polypeptides or fragments thereof alone (See, e.g., 
Fountoulakis el al., 1995, which disclosure is hereby incorporated by reference in its entirety). 
Nucleic acids encoding the above epitopes can also be recombined with a gene of interest as an 
5 epitope tag to aid in detection and purification of the expressed polypeptide. 

Additional fusion proteins of the invention may be generated through the techniques of 
gene-shuffling, motif-shuffling, exon-shuffling, or codon-shuffling (collectively refenred to as 
"DNA shuffling"). DNA shuffling may be employed to modulate the activities of polypeptides of 
the present invention thereby effectively generating agonists and antagonists of the polypeptides. 

10 See, for example, U.S. Patent Nos.: 5,605,793; 5,81 1,238; 5,834,252; 5,837,458; and Patten, e( al, 
(1997); Harayama, (1998); Hansson, et a/ (1999); and Lorenzo and Blasco, (1998). (Each of these 
documents are hereby incorporated by reference). In one embodiment, one or more components, 
motifs, sections, parts, domains, fragments, etc., of coding polynucleotides of the invention, or the 
polypeptides encoded thereby may be recombined with one or more components, motifs, sections, 

15 parts, domains, fragments, etc. of one or more heterologous molecules. 

The present invention further encompasses any combination of the polypeptide fragments 
listed in this section. 



Antibodies 
20 Definitions 

The present invention further relates to antibodies and T-cell antigen receptors (TCR), 
which specifically bind the polypeptides, and more specifically, the epitopes of the polypeptides of 
the present invention. The antibodies of the present invention include IgG (including IgGl, IgG2, 
IgG3, and IgG4), IgA (including IgAl and IgA2), IgD, IgE, or IgM, and IgY. The term "antibody" 

25 (Ab) refers to a polypeptide or group of polypeptides which are comprised of at least one binding 
domain, where a binding domain is formed from the folding of variable domains of an antibody 
molecule to form three-dimensional binding spaces with an internal surface shape and charge 
distribution complementary to the features of an antigenic determinant of an antigen, which allows 
an immunological reaction with the antigen. As used herein, the term "antibody" is meant to 

30 include whole antibodies, including single-chain whole antibodies, and antigen binding fragments 
thereof In a preferred embodiment the antibodies are human antigen binding antibody fragments of 
the present invention include, but are not limited to. Fab, Fab* F(ab)2 and F(ab')2, Fd, single-chain 
Fvs (scFv), single-chain antibodies, disulfide-linked Fvs (sdFv) and fragments comprising either a 
Vl or Vh domain. The antibodies may be from any animal origin including birds and mammals. 

35 Preferably, the antibodies are human, murine, rabbit, goat, guinea pig, camel, horse, or chicken. 

Antigen-binding antibody fragments, including single-chain antibodies, may comprise the 
variable region(s) alone or in combination with the entire or partial of the following: hinge region. 
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CHI , CH2, and CH3 domains. Also included in the invention are any combinations of variable 
region(s) and hinge region, CHI, CH2, and CHS domains. The present invention further includes 
chimeric, humanized, and human monoclonal and polyclonal antibodies, which specifically bind the 
polypeptides of the present invention. The present invention further includes antibodies that are 
S anti-id ioty pic to the antibodies of the present invention. 

The antibodies of the present invention may be monospeciflc, bispeciflc, and trispecific or 
have greater multispecificity. Multispecific antibodies may be specific for different epitopes of a 
polypeptide of the present invention or may be specific for both a polypeptide of the present 
invention as well as for heterologous compositions, such as a heterologous polypeptide or solid 

10 support material. See, e.g., WO 93/1 7715; WO 92/08802; WO 91/00360; WO 92/05793; Tutt, et al 
(1991); US Patents 5,573,920. 4,474,893, 5,601,819, 4,714,681, 4,925,648; Kostelny etal (1992), 
which disclosures are hereby incorporated by reference in their entireties. 

Antibodies of the present invention may be described or specified in terms of the epitope(s) 
or epitope-bearing portion(s) of a polypeptide of the present invention, which are recognized or 

1 5 specifically bound by the antibody. The antibodies may specifically bind a complete protein encoded 
by a nucleic acid of the present invention, or a fragment thereof. Therefore, the epitope(s) or epitope 
bearing polypeptide portion(s) may be specified as described herein, e.g., by N-terminal and C- 
terminal positions, by size in contiguous amino acid residues, or otherwise described herein 
(including the sequence listing). Antibodies which specifically bind any epitope or polypeptide of 

20 the present invention may also be excluded as individual species. Therefore, the present invention 
includes antibodies that specifically bind specified polypeptides of the present invention, and allows 
for the exclusion of the same. 

Thus, another embodiment of the present invention is a purified or isolated antibody 
capable of specifically binding to a polypeptide comprising a sequence selected from the group 

25 consisting of the sequences of SEQ ID NOs: 1 70-338, 456-560, 785-91 8 and the sequences of the 
clone inserts of the deposited clone pool. In one aspect of this embodiment, the antibody is capable 
of binding to an epitope-containing polypeptide comprising at least 6 consecutive amino acids, 
preferably at least 8 to 10 consecutive amino acids, more preferably at least 12, 15, 20, 25, 30, 40, 
SO, or 100 consecutive amino acids of a sequence selected from the group consisting of SEQ ID 

30 NOs: 1 70-338, 456-560, 785-91 8 and sequences of the clone inserts of the deposited clone pool. 

Antibodies of the present invention may also be described or specified in terms of their 
cross-reactivity. Antibodies that do not specifically bind any other analog, ortholog, or homologue 
of the polypeptides of the present invention are included. Antibodies that do not bind polypeptides 
with less than 95%, less than 90%, less than 85%, less than 80%, less than 75%, less than 70%, less 

35 than 65%, less than 60%, less than 55%, and less than 50% identity (as calculated using methods 
known in the art and described herein, e.g., using FASTDB and the parameters set forth herein) to a 
polypeptide of the present invention are also included in the present invention. Further included in 
the present invention are antibodies, which only bind polypeptides encoded by polynucleotides, 
which hybridize to a polynucleotide of the present invention under stringent hybridization 
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conditions (as described herein). Antibodies of the present invention may also be described or 
specified in terms of their binding affinity. Preferred binding affinities include those with a 
dissociation constant or Kd less than 5X10*^M, IQ-^M, SXIO'^M, lO'^M, 5X10**M, lO'^M, SXIO'X 
lO'M, SXIQ-^H lO 'V, 5X10 ''M, lO '^M, SXIO 'X lO'^^M, SXIO'^^M, 10''H SXIO 'X 10' 
5 'XsXlO'Hand 10'''M. 

The invention also concerns a purified or isolated antibody capable of specifically binding 
to a mutated GENSET protein or to a fragment or variant thereof comprising an epitope of the 
mutated GENSET protein. 



10 Preparation of antibodies 

The antibodies of the present invention may be prepared by any suitable method known in 
the art. Some of these methods are described in more detail in the example entitled "Preparation of 
Antibody Compositions to the GENSET protein". For example, a polypeptide of the present 
invention or an antigenic fragment thereof can be administered to an animal in order to induce the 

1 5 production of sera containing "polyclonal antibodies". As used herein, the term "monoclonal 

antibody" is not limited to antibodies produced through hybridoma technology but it rather refers to 
an antibody that is derived from a single clone, including eukaryotic, prokaryotic, or phage clone, 
and not the method by which it is produced. Monoclonal antibodies can be prepared using a wide 
variety of techniques known in the art including the use of hybridoma, recombinant, and phage 

20 display technology. 

Hybridoma techniques include those known in the art (See, e.g., Harlow et al. 1988; 
Hammerling, et al, 1981). (Said references incorporated by reference in their entireties). Fab and 
F(ab')2 fragments may be produced, for example, from hybridoma-produced antibodies by 
proteolytic cleavage, using enzymes such as papain (to produce Fab fragments) or pepsin (to 

25 produce F(ab')2 fragments). 

Alternatively, antibodies of the present invention can be produced through the application 
of recombinant DNA technology or through synthetic chemistry using methods known in the art. 
For example, the. antibodies of the present invention can be prepared using various phage display 
methods known in the art. In phage display methods, functional antibody domains are displayed on 

30 the surface of a phage particle, which carries polynucleotide sequences encoding them. Phage with 
a desired binding property are selected from a repertoire or combinatorial antibody library (e.g. 
human or murine) by selecting directly with antigen, typically antigen bound or captured to a solid 
surface or bead. Phage used in these methods are typically filamentous phage including fd and M 13 
with Fab, Fv or disulfide stabilized Fv antibody domains recombinantly fused to either the phage 

33 gene III or gene Vlll protein. Examples of phage display methods that can be used to make the 
antibodies of the present invention include those disclosed in Brinkman et al. (1995); Ames, ei aL 
(1995); Kettleborough, et aL (1994); Persic, et al (1997); Burton et al (1994); PCT/GB91/01 134; 
WO 90/02809; WO 91/10737; WO 92/01047; WO 92/18619; WO 93/1 1236; WO 95/15982; WO 
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95/20401; and US Patents 5,698,426, 5,223,409, 5,403,484, 5,580,717, 5,427,908, 5,750,753, 
5,821,047, 5,571,698, 5,427,908, 5,516,637, 5,780,225, 5,658,727 and 5,733,743 (said references 
incorporated by reference in their entireties). 

As described in the above references, after phage selection, the antibody coding regions 
5 from the phage can be isolated and used to generate whole antibodies, including human antibodies, 
or any other desired antigen binding fragment, and expressed in any desired host including 
mammalian cells, insect cells, plant cells, yeast, and bacteria. For example, techniques to 
recombinantly produce Fab, Fab' F(ab)2 and F(ab')2 fragments can also be employed using 
methods known in the art such as those disclosed in WO 92/22324; Mullinax et al (1992); and 
1 0 Sawai et al, ( 1 995); and Better et al, ( 1 988) (said references incorporated by reference in their 
entireties). 

Examples of techniques which can be used to produce single-chain Fvs and antibodies 
include those described in U.S. Patents 4,946,778 and 5,258,498; Huston etal. (1991); Shu et al 
(1993); and Skerra etal. (1988), which disclosures are hereby incorporated by reference in their 

15 entireties. For some uses, including in vivo use of antibodies in humans and in vitro detection 
assays, it may be preferable to use chimeric, humanized, or human antibodies. Methods for 
producing chimeric antibodies are known in the art. See e.g., Morrison, (198S); Oi et aL^ (1986); 
Gillies et o/. (1989); and US Patent 5,807,715, which disclosures are hereby incorporated by 
reference in their entireties. Antibodies can be humanized using a variety of techniques including 

20 CDR-grafting (EP 0 239 400; WO 91/09967; US Patent 5,530,101; and 5,585,089), veneering or 
resurfacing, (EP 0 592 106; EP 0 519 596; Padlan, 1991 ; Studnicka et al, 1994; Roguska et al, 
1994), and chain shuffling (US Patent 5,565,332), which disclosures are hereby incorporated by 
reference in their entireties. Human antibodies can be made by a variety of methods known in the 
art including phage display methods described above. See also, US Patents 4,444,887, 4,716,1 1 1, 

25 5,545,806, and 5,814,318; WO 98/46645; WO 98/50433; WO 98/24893; WO 96/34096; WO 
96/33735; and WO 91/10741 (said references incorporated by reference in their entireties). 

Further included in the present invention are antibodies recombinantly fused or chemically 
conjugated (including both covalently and non-covalently conjugations) to a polypeptide of the 
present invention. The antibodies may be specific for antigens other than polypeptides of the 

30 present invention. For example, antibodies of the present invention may be recombinantly fiised or 
conjugated to molecules useful as labels in detection assays and effector molecules such as 
heterologous polypeptides, drugs, or toxins. See, e.g., WO 92/08495; WO 91/14438; WO 89/1 2624; 
US Patent 5,314,995; and EP 0 396 387, which disclosures are hereby incorporated by reference in 
their entireties. Fused antibodies may also be used to target the polypeptides of the present 

35 invention to particular cell types, either in vitro or in vivo, by fusing or conjugating the polypeptides 
of the present invention to antibodies specific for particular cell surface receptors. Antibodies fused 
or conjugated to the polypeptides of the present invention may also be used in vitro immunoassays 
and purification methods using methods known in the art {See eg.. Harbor et al supra; WO 
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93/21232; BP 0 439 095; Naramura, M. et al. 1994; US Patent 5,474,981 ; Gillies ei al., 1992; Fell et 
aL, 1991) (said references incorporated by reference in their entireties). 

The present invention further includes compositions comprising the polypeptides of the 
present invention fused or conjugated to antibody domains other than the variable regions. For 
5 example, the polypeptides of the present invention may be fused or conjugated to an antibody Fc 
region, or portion thereof. The antibody portion fused to a polypeptide of the present invention may 
comprise the hinge region, CHI domain, CH2 domain, and CH3 domain or any combination of 
whole domains or portions thereof. The polypeptides of the present invention may be fused or 
conjugated to the above antibody portions to increase the in vivo half-life of the polypeptides or for 

10 use in immunoassays using methods known in the art. The polypeptides may also be fused or 

conjugated to the above antibody portions to form multimers. For example, Fc portions fused to the 
polypeptides of the present invention can form dimers through disulfide bonding between the Fc 
portions. Higher multimeric forms can be made by fusing the polypeptides to portions of IgA and 
IgM. Methods for fusing or conjugating the polypeptides of the present invention to antibody 

1 5 portions are known in the art. See e.g., US Patents 5,336,603, 5,622,929, 5.359,046, 5,349,053, 
5,447,851, 5,1 12,946; EP 0 307 434, EP 0 367 166; WO 96/04388, WO 91/06570; Ashkenazi etaL 
(1991); Zheng et al. (1995); and Vil et al, (1992) (said references incorporated by reference in their 
entireties). 

Non-human animals or mammals, whether wild-type or transgenic, which express a 
20 different species of GENSET than the one to which antibody binding is desired, and animals which 
do not express GENSET (i.e. a GENSET knock out animal as described herein) are particularly 
useful for preparing antibodies. GENSET knock out animals will recognize all or most of the 
exposed regions of a GENSET protein as foreign antigens, and therefore produce antibodies with a 
wider array of GENSET epitopes. Moreover, smaller polypeptides with only 10 to 30 amino acids 
25 may be useful in obtaining specific binding to any one of the GENSET proteins. In addition, the 
humoral immune system of animals which produce a species of GENSET that resembles the 
antigenic sequence will preferentially recognize the differences between the animal's native 
GENSET species and the antigen sequence, and produce antibodies to these unique sites in the 
antigen sequence. Such a technique will be particularly useful in obtaining antibodies that 
30 specifically bind to any one of the GENSET proteins. 

The antibodies of the invention may be labeled by, e.g., any one of the radioactive, 
fluorescent or enzymatic labels known in the art. 

Uses of pol ynucleotides 

35 Uses of polynucleotides as reagents 

The polynucleotides of the present invention, particularly those described in the 
''Oligonucleotide primers and probes" section, may be used as reagents in isolation procedures. 
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diagnostic assays, and forensic procedures. For example, sequences from the GENSET 
polynucleotides of the invention may be detectably labeled and used as probes to isolate other 
sequences capable of hybridizing to them. In addition, sequences from the GENSET 
polynucleotides of the invention may be used to design PGR primers to be used in isolation, 
5 diagnostic, or forensic procedures. 

In forensic analyses 

PGR primers may be used in forensic analyses, such as the DNA fingerprinting techniques 
described below. Such analyses may utilize detectable probes or primers based on the sequences of 
the polynucleotides of the invention. Gonsequently, the present invention encompasses methods of 
10 identification of an individual using the polynucleotides of the invention in forensic analyses, 
wherein said method includes the steps of: 

a) obtaining a biological sample containing nucleic acid material from an individual; 

b) obtaining an identification pattern for this individual using the polynucleotides of the 
invention, particularly using GENSET primers and probes; 

1 5 c) comparing said identification pattern with a reference identification pattern; and 

d) determining whether said identification pattern is identical to said reference identification 

pattern. 

In one embodiment of this method, the identification pattern consists in sequences of 
ampHcons obtained using GENSET primers as explained in the sections entitled "Forensic 
20 Matching by DNA Sequencing" and "Positive Identification by DNA Sequencing". 

In another embodiment, the identification pattern consists in unique band or dot patterns 
obtained using any method described in the sections entitled "Southern Blot Forensic 
Identification", "Dot Blot Identification Procedure" and "Alternative "Fingerprint" Identification 
Technique". 

25 Table I provides sets of related cDNAs of the invention, e.g. sequences that represent allelic 

variants of a single sequence. Such variants are especially useful for the herein-described forensic 
analyses, and are also useful as polymorphic markers to examine, e.g. associations between the 
herein-discussed GENSET genes and various diseases or conditions. 

Forensic Matching by DNA Sequencing 

30 In one exemplary method, DNA samples are isolated from forensic specimens of, for 

example, hair, semen, blood or skin ceils by conventional methods. A panel of PGR primers 
designed from different polynucleotides of the invention using any technique known to those skilled 
in the art including those described herein, is then utilized to amplify DNA of approximately 100- 
200 bases in length from the forensic specimen. Corresponding sequences are obtained from a test 

3S subject. Each of these identification DNAs is then sequenced using standard techniques, and a 
simple database comparison determines the differences, if any, between the sequences from the 
subject and those from the sample. Statistically significant differences between the suspect's DNA 
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sequences and those from the sample conclusively prove a lack of identity. This lack of identity can 
be proven, for example, with only one sequence. Identity, on the other hand, should be 
demonstrated with a large number of sequences, all matching. Preferably, a minimum of 50 
statistically identical sequences of 100 bases in length are used to prove identity between the 
5 suspect and the sample. 

Positive Identification by DNA Sequencing 

The "Forensic Matching by DNA Sequencing" technique described herein may also be used 
on a larger scale to provide a unique flngerprint-type identification of any individual. In this 
technique, primers are prepared from a large number of polynucleotides of the invention. 

10 Preferably, 20 to 50 different primers are used. These primers are used to obtain a corresponding 
number of PCR-generated DNA segments from the individual in question. Each of these DNA 
segments is sequenced. The database of sequences generated through this procedure uniquely 
identifies the individual from whom the sequences were obtained. The same panel of primers may 
then be used at any later time to absolutely correlate tissue or other biological specimen with that 

15 individual. 

Southern Blot Forensic Identification 

The "Positive Identification by DNA Sequencing" procedure described herein is repeated to 
obtain a panel of at least 10 amplified sequences from an individual and a specimen. Preferably, the 
panel contains at least 50 amplified sequences. More preferably, the panel contains 100 amplified 

20 sequences. In some embodiments, the panel contains 200 amplified sequences. This PCR- 
generated DNA is then digested with one or a combination of, preferably, four base specific 
restriction enzymes. Such enzymes are commercially available and known to those of skill in the 
art. After digestion, the resultant gene fragments are size separated in multiple duplicate wells on 
an agarose gel and transferred to nitrocellulose using Southern blotting techniques well known to 

25 those with skill in the art. For a review of Southern blotting see Davis etal. (1986), which 
disclosure is hereby incorporated by reference in its entirety. 

A panel of probes based on the sequences of the polynucleotides of the invention, or 
fragments thereof of at least 10 bases, are radioactively or colorimetrically labeled using methods 
known in the art, such as nick translation or end labeling, and hybridized to the Southern blot using 

30 techniques known in the art. Preferably, the probe comprises at least 12, 1 5, or 1 7 consecutive 
nucleotides from the polynucleotide of the invention. More preferably, the probe comprises at least 
20-30 consecutive nucleotides from the polynucleotide of the invention. In some embodiments, the 
probe comprises more than 30 nucleotides from the polynucleotide of the invention. In other 
embodiments, the probe comprises at least 40, at least 50, at least 75, at least 100, at least 1 SO, or at 

35 least 200 consecutive nucleotides from the polynucleotide of the invention. 

Preferably, at least 5 to 10 of these labeled probes are used, and more preferably at least 
about 20 or 30 are used to provide a unique pattern. The resultant bands appearing from the 
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hybridization of a large sample of polynucleotide of the invention will be a unique identifier. Since 
the restriction enzyme cleavage will be different for every individual, the band pattern on the 
Southern blot will also be unique. Increasing the number of cDNA probes will provide a 
statistically higher level of confidence in the identification since there will be an increased number 
S of sets of bands used for identification. 

Dot Blot Identification Procedure 

Another technique for identifying individuals using the polynucleotide sequences disclosed 
herein utilizes a dot blot hybridization technique. 

Genomic DNA is isolated from nuclei of subject to be identified. Oligonucleotide probes of 

10 approximately 30 bp in length are synthesized that correspond to at least 10, preferably 50 
sequences from the polynucleotide of the invention. The probes are used to hybridize to the 
genomic DNA through conditions known to those in the art. The oligonucleotides are end labeled 
with P^^ using polynucleotide kinase (Pharmacia). Dot Blots are created by spotting the genomic 
DNA onto nitrocellulose or the like using a vacuum dot blot manifold (BioRad, Richmond 

15 California). The nitrocellulose filter containing the genomic sequences is baked or UV linked to the 
filter, prehybridized and hybridized with labeled probe using techniques known in the art (Davis et 
al, 1986). The ^^P labeled DNA fragments are sequentially hybridized with successively stringent 
conditions to detect minimal differences between the 30 bp sequence and the DNA. 
Tetramethylammonium chloride is useflil for identifying clones containing small numbers of 

20 nucleotide mismatches (Wood et al, 1985). A unique pattern of dots distinguishes one individual 
from another individual. 

Alternative *Tingerprint" Identification Technique 

In a representative alternative fingerprinting procedure, the probes are derived from 
cDNAs. Preferably, a plurality of probes having sequences from different genes are used as 

25 follows. Polynucleotides containing at least 10 consecutive bases from these sequences can be used 
as probes. Preferably, the probe comprises at least 12, IS, or 17 consecutive nucleotides from the 
polynucleotide of the invention. More preferably, the probe comprises at least 20-30 consecutive 
nucleotides from the polynucleotide of the invention. In some embodiments, the probe comprises 
more than 30 nucleotides from the polynucleotide of the invention. In other embodiments, the 

30 probe comprises at least 40, at least 50, at least 75, at least 100, at least 150, or at least 200 
consecutive nucleotides from the polynucleotide of the invention. 

Oligonucleotides, generally 20-mers, are prepared from a large number, e.g. 50, 100, or 
200, of polynucleotides of the invention using commercially available oligonucleotide services such 
as Genset (Paris, France). Cell samples from the test subject are processed for DNA using 

35 techniques well known to those with skill in the art. The nucleic acid is digested with restriction 
enzymes such as EcoRI and Xbal. Following digestion, samples are applied to wells for 
electrophoresis. The procedure, as known in the art, may be modified to accommodate 
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polyacrylamide electrophoresis, however in this example, samples containing 5 ug of DNA are 
loaded into wells and separated on 0.8% agarose gels. The gels are transferred onto nitrocellulose 
using standard Southern blotting techniques. 

10 ng of each of the oligonucleotides are pooled and end-labeled with P^^. The 
S nitrocellulose is prehybridized with blocking solution and hybridized with the labeled probes. 
Following hybridization and washing, the nitrocellulose filter is exposed to X-Omat AR X-ray film. 
The resulting hybridization pattern will be unique for each individual. 

It is additionally contemplated within this example that the number of probe sequences used 
can be varied for additional accuracy or clarity. 

10 To find corresponding genomic DNA sequences 

The GENSET cDNAs of the invention may also be used to clone sequences located 
upstream of the cDNAs of the invention on the corresponding genomic DNA. Such upstream 
sequences may be capable of regulating gene expression, including promoter sequences, enhancer 
sequences, and other upstream sequences which influence transcription or translation levels. Once 
1 5 identified and cloned, these upstream regulatory sequences may be used in expression vectors 
designed to direct the expression of an inserted gene in a desired spatial, temporal, developmental, 
or quantitative fashion. 

Use of cDNAs or Fragments thereof to Clone Upstream Sequences from Genomic DNA 

Sequences derived from polynucleotides of the inventions may be used to isolate the 

20 promoters of the corresponding genes using chromosome walking techniques. In one chromosome 
walking technique, which utilizes the Genome Walker'"''^ kit available from Clontech, five complete 
genomic DNA samples are each digested with a different restriction enzyme which has a 6 base 
recognition site and leaves a blunt end. Following digestion, oligonucleotide adapters are ligated to 
each end of the resulting genomic DNA fragments. 

25 For each of the five genomic DNA libraries, a first PGR reaction is performed according to 

the manufacturer's instructions (which are incorporated herein by reference) using an outer adaptor 
primer provided in the kit and an outer gene specific primer. The gene specific primer should be 
selected to be specific for the polynucleotide of the invention of interest and should have a melting 
temperature, length, and location in the polynucleotide of the invention which is consistent with its 

30 use in PGR reactions. Each first PGR reaction contains 5ng of genomic DNA, 5 |il of lOX Tth 

reaction buffer, 0.2 mM of each dNTP, 0.2 each of outer adaptor primer and outer gene specific 
primer, 1.1 mM of Mg(0Ac)2, and 1 \i\ of the Tth polymerase SOX mix in a total volume of 50 |il. 
The reaction cycle for the first PGR reaction is as follows: 1 min at 94 degrees Celsius / 2 sec at 94 
degree Gelsius, 3 min at 72 degrees Gelsius (7 cycles) / 2 sec at 94 degrees Gelsius, 3 min at 67 

35 degrees Gelsius (32 cycles) / 5 min at 67 degrees Gelsius. 

The product of the first PGR reaction is diluted and used as a template for a second PGR 
reaction according to the manufacturer's instructions using a pair of nested primers which are 
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located internally on the amplicon resulting from the first PCR reaction. For example, 5 [i\ of the 
reaction product of the first PCR reaction mixture may be diluted ISO times. Reactions are made in 
a 50 ^1 volume having a composition identical to that of the first PCR reaction except the nested 
primers are used. The first nested primer is specific for the adaptor, and is provided with the 
5 GenomeWalker™ kit. The second nested primer is specific for the particular polynucleotide of the 
invention for which the promoter is to be cloned and should have a melting temperature, length, and 
location in the polynucleotide of the invention which is consistent with its use in PCR reactions. 
The reaction parameters of the second PCR reaction are as follows: 1 min at 94 degrees Celsius / 2 
sec at 94 degrees Celsius, 3 min at 72 degrees Celsius (6 cycles) / 2 sec at 94 degrees Celsius, 3 min 

10 at 67 degrees Celsius (25 cycles) / 5 min at 67 degrees Celsius 

The product of the second PCR reaction is purified, cloned, and sequenced using standard 
techniques. Alternatively, two or more human genomic DNA libraries can be constructed by using 
two or more restriction enzymes. The digested genomic DNA is cloned into vectors which can be 
converted into single stranded, circular, or linear DNA. A biotinylated oligonucleotide comprising 

15 at least 15 nucleotides fi*om the polynucleotide of the invention sequence is hybridized to the single 
stranded DNA. Hybrids between the biotinylated oligonucleotide and the single stranded DNA 
containing the polynucleotide of the invention sequence are isolated as described herein. 
Thereafter, the single stranded DNA containing the polynucleotide of the invention sequence is 
released from the beads and converted into double stranded DNA using a primer specific for the 

20 polynucleotide of the invention sequence or a primer corresponding to a sequence included in the 
cloning vector. The resulting double stranded DNA is transformed into bacteria. DNAs containing 
the GENSET polynucleotide sequences are identified by colony PCR or colony hybridization. 

Identification of Promoters in Cloned Upstream Sequences 

Once the upstream genomic sequences have been cloned and sequenced as described above, 
25 prospective promoters and transcription start sites within the upstream sequences may be identified 
by comparing the sequences upstream of the polynucleotides of the inventions with databases 
containing known transcription start sites, transcription factor binding sites, or promoter sequences. 

In addition, promoters in the upstream sequences may be identified using promoter reporter 
vectors as follows. The expression of the reporter gene will be detected when placed under the 
30 control of regulatory active polynucleotide fragments or variants of the GENSET promoter region 
located upstream of the first exon of the GENSET gene. Suitable promoter reporter vectors, into 
which the GENSET promoter sequences may be cloned include pSEAP-Basic, pSEAP-Enhancer, 
pPgal-Basic, ppgal-Enhancer, or pEGFP-1 Promoter Reporter vectors available from Clontech, or 
pGL2-basic or pGL3-basic promoterless luciferase reporter gene vector from Promega. Briefly, 
35 each of these promoter reporter vectors include multiple cloning sites positioned upstream of a 
reporter gene encoding a readily assayable protein such as secreted alkaline phosphatase, luciferase, 
beta-galactosidase, or green fluorescent protein. The sequences upstream the GENSET coding 
region are inserted into the cloning sites upstream of the reporter gene in both orientations and 
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introduced into an appropriate host cell. The level of reporter protein is assayed and compared to 
the level obtained from a vector which lacks an insert in the cloning site. The presence of an 
elevated expression level in the vector containing the insert with respect to the control vector 
indicates the presence of a promoter in the insert, if necessary, the upstream sequences can be 
5 cloned into vectors which contain an enhancer for increasing transcription levels from weak 
promoter sequences. A significant level of expression above that observed with the vector lacking 
an insert indicates that a promoter sequence is present in the inserted upstream sequence. 

Promoter sequence within the upstream genomic DNA may be further defined by site 
directed mutagenesis, linker scanning analysis, or other techniques familiar to those skilled in the 

10 art. For example, the boundaries of promoters may be further investigated by constructing nested S' 
and/or 3' deletions in the upstream DNA using conventional techniques such as Exonuclease III or 
appropriate restriction endonuclease digestion. The resulting deletion fragments can be inserted 
into the promoter reporter vector to determine whether the deletion has increased, reduced or 
illuminated promoter activity, such as described, for example, by Coles et al. (1998), the disclosure 

15 of which is incorporated herein by reference in its entirety. In this way, the boundaries of the 

promoters may be defined. If desired, potential individual regulatory sites within the promoter may 
be identified using site directed mutagenesis or linker scanning to obliterate potential transcription 
factor binding sites within the promoter individually or in combination. The effects of these 
mutations on transcription levels may be determined by inserting the mutations into cloning sites in 

20 promoter reporter vectors. This type of assay is well known to those skilled in the art and is 
described in WO 97/17359, US Patent No. 5,374,544; EP 582 796; US Patent No. 5,698,389; US 
5,643,746; US Patent No. 5,502,1 76; and US Patent 5,266,488; the disclosures of which are 
incorporated by reference herein in their entirety. 

The strength and the specificity of the promoter of each GENSET gene can be assessed 

25 through the expression levels of a detectable polynucleotide operably linked to the GENSET 
promoter in different types of cells and tissues. The detectable polynucleotide may be either a 
polynucleotide that specifically hybridizes with a predefined oligonucleotide probe, or a 
polynucleotide encoding a detectable protein, including a GENSET polypeptide or a fragment or a 
variant thereof. This type of assay is well known to those skilled in the art and is described in US 

30 Patent No. 5,502,1 76; and US Patent No. 5,266,488; the disclosures of which are incorporated by 
reference herein in their entirety. Some of the methods are discussed in more detail elsewhere in 
the application. 

The promoters and other regulatory sequences located upstream of the polynucleotides of 
the inventions may be used to design expression vectors capable of directing the expression of an 
35 inserted gene in a desired spatial, temporal, developmental, or quantitative manner. A promoter 
capable of directing the desired spatial, temporal, developmental, and quantitative patterns may be 
selected using the results of the expression analysis described herein. For example, if a promoter 
which confers a high level of expression in muscle is desired, the promoter sequence upstream of a 
polynucleotide of the invention derived from an mRNA which is expressed at a high level in muscle 
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may be used in the expression vector. Such vectors are described in more detail elsewhere in the 
application. 

Preferably, the desired promoter is placed near multiple restriction sites to facilitate the 
cloning of the desired insert downstream of the promoter, such that the promoter is able to drive 
5 expression of the inserted gene. The promoter may be inserted in conventional nucleic acid 
backbones designed for extrachromosomal replication, integration into the host chromosomes or 
transient expression. Suitable backbones for the present expression vectors include retroviral 
backbones, backbones from eukaryotic episomes such as SV40 or Bovine Papilloma Virus, 
backbones from bacterial episomes, or artificial chromosomes. 
1 0 Preferably, the expression vectors also include a polyA signal downstream of the multiple 

restriction sites for directing the polyadenylation of mRNA transcribed from the gene inserted into 
the expression vector. 

To find similar sequences 

Polynucleotides of the invention may be used to isolate and/or purify nucleic acids similar 
1 5 thereto using any methods well known to those skilled in the art including the techniques based on 
hybridization or on amplification described in this section. These methods may be used to obtain the 
genomic DNAs which encode the mRNAs from which the GENSET cDNAs are derived, mRNAs 
corresponding to GENSET cDNAs, or nucleic acids which are homologous to GENSET cDNAs or 
fragments thereof, such as variants, species homologues or orthologs. Thus, a plurality of cDNAs 
20 similar to GENSET polynucleotides may be provided as cDNA libraries for subsequent evaluation 
of the encoded proteins or used in diagnostic assays as described herein. cDNAs prepared by any 
method described therein may be subsequently engineered to obtain nucleic acids which include 
desired fragments of the cDNA using conventional techniques such as subcloning, PGR, or in vitro 
oligonucleotide synthesis. For example, nucleic acids which include only the coding sequences 
2S may be obtained using techniques known to those skilled in the art. Similarly, nucleic acids 
containing any other desired fragment of the coding sequences for the encoded protein may be 
obtained. 

Indeed, cDNAs of the present invention or fragments thereof may be used to isolate nucleic 
acids similar to cDNAs from a cDNA library or a genomic DN A library. Such cDNA libraries or 
30 genomic DNA libraries may be obtained from a commercial source or made using techniques 
familiar to those skilled in the art such as those described in PCT publication WO 00/37491, which 
disclosure is hereby incorporated by reference in its entirety. Examples of methods for obtaining 
nucleic acids similar to GENSET polynucleotides are described below. 

Hybridization-based methods 

35 Techniques for identifying cDNA clones in a cDNA library which hybridize to a given 

probe sequence are disclosed in Sambrook etaL, (1989) and in Hames and Higgins (1985), the 
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disclosures of which are incorporated herein by reference in their entireties. The same techniques 

may be used to isolate genomic DNAs. 

Briefly, cDNA or genomic DNA clones which hybridize to the detectable probe are 

identified and isolated for further manipulation as follows. Any polynucleotide fragment of the 
5 invention may be used as a probe, in particular those defined in the "Oligonucleotide primers and 

probes" section, A probe comprising at least 10 consecutive nucleotides from a GENSET cDNA or 

fragment thereof is labeled with a detectable label such as a radioisotope or a fluorescent molecule. 

Preferably, the probe comprises at least 12, 1 S, or 1 7 consecutive nucleotides from the cDN A or 

fragment thereof. More preferably, the probe comprises 20 to 30 consecutive nucleotides from the 
10 cDNA or fragment thereof. In some embodiments, the probe comprises more than 30 nucleotides 

from the cDN A or fragment thereof. 

Techniques for labeling the probe are well known and include phosphorylation with 

polynucleotide kinase, nick translation, in vitro transcription, and non radioactive techniques. The 

cDNAs or genomic DNAs in the library are transferred to a nitrocellulose or nylon filter and 
15 denatured. After blocking of non specific sites, the filter is incubated with the labeled probe for an 

amount of time sufficient to allow binding of the probe to cDNAs or genomic DNAs containing a 

sequence capable of hybridizing thereto. 

By varying the stringency of the hybridization conditions used to identify cDNAs or 

genomic DNAs which hybridize to the detectable probe, cDNAs or genomic DNAs having different 
20 levels of identity to the probe can be identified and isolated as described below. 

Stringent conditions 

"Stringent hybridization conditions" are defined as conditions in which only nucleic acids 
having a high level of identity to the probe are able to hybridize to said probe. These conditions may 
25 be calculated as follows: 

For probes between 14 and 70 nucleotides in length the melting temperature (Tm) is 
calculated using the formula: Tm=81 .5+16.6(log (Na+))+0.41 (fraction G+CH600/N) where N is 
the length of the probe. 

If the hybridization is carried out in a solution containing formamide, the melting 
30 temperature may be calculated using the equation: Tm=81 .5+16.6(log (Na+))+0.41 (fraction G+C)- 
(0.63% formamideH600/N) where N is the length of the probe. 

Prehybridization may be carried out in 6X SSC, 5X Denhardt's reagent, 0.5% SDS, 100 ^g 
denatured fragmented salmon sperm DNA or 6X SSC, 5X Denhardt's reagent, 0.5% SDS, 100 ^g 
denatured fragmented salmon sperm DNA, 50% formamide. The formulas for SSC and Denhardt's 
35 solutions are listed in Sambrook et al,, 1986. 

Hybridization is conducted by adding the detectable probe to the prehybridization solutions 
listed above. Where the probe comprises double stranded DNA, it is denatured before addition to 
the hybridization solution. The filter is contacted with the hybridization solution for a sufficient 
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period of time to allow the probe to hybridize to nucleic acids containing sequences complementary 
thereto or homologous thereto. For probes over 200 nucleotides in length, the hybridization may be 
carried out at 15-25°C below the Tm. For shorter probes, such as oligonucleotide probes, the 
hybridization may be conducted at 15-25''C below the Tm. Preferably, for hybridizations in 6X 

5 SSC, the hybridization is conducted at approximately 68°C. Preferably, for hybridizations in 50% 
formamide containing solutions, the hybridization is conducted at approximately 42^C. 

Following hybridization, the filter is washed in 2X SSC, 0.1% SDS at room temperature for 
15 minutes. The filter is then washed with O.IX SSC, 0.5% SDS at room temperature for 30 
minutes to 1 hour. Thereafter, the solution is washed at the hybridization temperature in O.IX SSC, 

10 0.5% SDS. A final wash is conducted in O.IX SSC at room temperature. 

Nucleic acids which have hybridized to the probe are identified by autoradiography or other 
conventional techniques. 

Low and moderate conditions 

Changes in the stringency of hybridization and signal detection are primarily accomplished 

1 5 through the manipulation of formamide concentration (lower percentages of formamide result in 
lowered stringency); salt conditions, or temperature. The above procedure may thus be modified to 
identify nucleic acids having decreasing levels of identity to the probe sequence. For example, the 
hybridization temperature may be decreased in increments of 5°C from 68**C to 42°C in a 
hybridization buffer having a sodium concentration of approximately IM. Following hybridization, 

20 the filter may be washed with 2X SSC, 0.5% SDS at the temperature of hybridization. These 
conditions are considered to be "moderate" conditions above SO^'C and "low" conditions below 
50^C. Alternatively, the hybridization may be carried out in buffers, such as 6X SSC, containing 
formamide at a temperature of 42°C. In this case, the concentration of formamide in the 
hybridization buffer may be reduced in 5% increments from 50% to 0% to identify clones having 

25 decreasing levels of identity to the probe. Following hybridization, the filter may be washed with 
6X SSC, 0.5% SDS at 50^C. These conditions are considered to be "moderate" conditions above 
25% formamide and "low** conditions below 25% formamide. cDNAs or genomic DNAs which 
have hybridized to the probe are identified by autoradiography or other conventional techniques. 
Note that variations in the above conditions may be accomplished through the inclusion 

30 and/or substitution of alternate blocking reagents used to suppress background in hybridization 
experiments. Typical blocking reagents include Denhardt's reagent, BLOTTO, heparin, denatured 
salmon 'sperm DNA, and commercially available proprietary formulations. The inclusion of specific 
blocking reagents may require modification of the hybridization conditions described above, due to 
problems with compatibility. 

35 Consequently, the present invention encompasses methods of isolating nucleic acids similar 

to the polynucleotides of the invention, comprising the steps of: 

a) contacting a collection of cDNA or genomic DNA molecules with a detectable probe 
comprising at least 1 2, 1 5, 1 8, 20, 23, 25, 28, 30, 35, 40 or 50 consecutive nucleotides of a sequence 
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selected from the group consisting of the sequences of SEQ ID NOs: M69, 339-455, 561-784, the 
sequences of clones inserts of the deposited clone pool and sequences complementary thereto under 
stringent, moderate or low conditions which permit said probe to hybridize to at least a cDNA or 
genomic DNA molecule in said collection; 
5 b) identifying said cDNA or genomic DNA molecule which hybridizes to said detectable 

probe; and 

c) isolating said cDNA or genomic DNA molecule which hybridized to said probe. 

PCR'based methods 

In addition to the above described methods, other protocols are available to obtain 
10 homologous cDNAs using GENSET cDNA of the present invention or fragment thereof as outlined 
in the following paragraphs. 

cDNAs may be prepared by obtaining mRNA from the tissue, cell, or organism of interest 
using mRNA preparation procedures utilizing polyA selection procedures or other techniques 
known to those skilled in the art. A first primer capable of hybridizing to the polyA tail of the 
1 5 mRNA is hybridized to the mRNA and a reverse transcription reaction is performed to generate a 
first cDNA strand. 

The term "capable of hybridizing to the polyA tail of said mRNA" refers to and embraces 
all primers containing stretches of thymidine residues, so-called oligo(dT) primers, that hybridize to 
the 3' end of eukaryotic poly(A)+ mRNAs to prime the synthesis of a first cDNA strand. 

20 Techniques for generating said oligo (dT) primers and hybridizing them to mRNA to subsequently 
prime the reverse transcription of said hybridized mRNA to generate a first cDNA strand are well 
known to those skilled in the art and are described in Current Protocols in Molecular Biology, John 
Wiley and Sons, Inc. 1 997 and Sambrook et al., 1989, Preferably, said oligo (dT) primers are 
present in a large excess in order to allow the hybridization of all mRNA 3'ends to at least one oligo 

25 (dT) molecule. The priming and reverse transcription steps are preferably performed between 37^C 
and 55°C depending on the type of reverse transcriptase used. Preferred oligo(dT) primers for 
priming reverse transcription of mRNAs are oligonucleotides containing a stretch of thymidine 
residues of sufficient length to hybridize specifically to the polyA tail of mRNAs, preferably of 12 
to 18 thymidine residues in length. More preferably, such oligo(T) primers comprise an additional 

30 sequence upstream of the poly(dT) stretch in order to allow the addition of a given sequence to the 
5'end of all first cDNA strands which may then be used to facilitate subsequent manipulation of the 
cDNA. Preferably, this added sequence is 8 to 60 residues in length. For instance, the addition of a 
restriction site in 5* of cDN As facilitates subcloning of the obtained cDNA. Ahematively, such an 
added 5' end may also be used to design primers of PCR to specifically amplify cDNA clones of 

35 interest. 

The first cDNA strand is then hybridized to a second primer. Any polynucleotide fragment 
of the invention may be used, and in particular those described in the "Oligonucleotide primers and 
probes" section. This second primer contains at least 1 0 consecutive nucleotides of a polynucleotide 
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of the invention. Preferably, the primer comprises at least 10, 12, 15, 17, 18, 20, 23, 25, or 28 
consecutive nucleotides of a polynucleotide of the invention. In some embodiments, the primer 
comprises more than 30 nucleotides of a polynucleotide of the invention. If it is desired to obtain 
cDNAs containing the full protein coding sequence, including the authentic translation initiation 
S site, the second primer used contains sequences located upstream of the translation initiation site. 
The second primer is extended to generate a second cONA strand complementary to the first cDNA 
strand. Alternatively, RT-PCR may be performed as described above using primers from both ends 
of the cDNA to be obtained. 

The double stranded cDNAs made using the methods described above are isolated and 
10 cloned. The cDNAs may be cloned into vectors such as plasmids or viral vectors capable of 
replicating in an appropriate host cell. For example, the host cell may be a bacterial, mammalian, 
avian, or insect cell. 

Techniques for isolating mRNA, reverse transcribing a primer hybridized to mRNA to 
generate a first cDNA strand, extending a primer to make a second cDNA strand complementary to 
1 5 the first cDN A strand, isolating the double stranded cDNA and cloning the double stranded cDN A 
are well known to those skilled in the art and are described in Current Protocols in Molecular 
Biology, John Wiley & Sons, Inc. 1997 and Sambrook et al, 1989. 

Consequently, the present invention encompasses methods of making cDNAs. In a first 
embodiment, the method of making a cDNA comprises the steps of 
20 ^ a) contacting a collection of mRNA molecules from human cells with a primer comprising 
at least 12, 15, 18, 20, 23, 25, 28, 30, 35, 40, or 50 consecutive nucleotides of a sequence selected 
from the group consisting of the sequences complementary to SEQ ID NOs:l-169, 339-455, 561- 
784 and sequences complementary to a clone insert of the deposited clone pool; 
b) hybridizing said primer to an mRNA in said collection; 
25 c) reverse transcribing said hybridized primer to make a first cDNA strand from said 

mRNA; 

d) making a second cDNA strand complementary to said first cDNA strand; and 

e) isolating the resulting cDNA comprising said first cDNA strand and said second cDNA 

strand. 

30 Another embodiment of the present invention is a purified cDNA obtainable by the method 

of the preceding paragraph. In one aspect of this embodiment, the cDNA encodes at least a portion 
of a human polypeptide. 

In a second embodiment, the method of making a cDNA comprises the steps of 

a) contacting a collection of mRNA molecules from human cells with a first primer capable 
35 of hybridizing to the poly A tail of said mRNA; 

b) hybridizing said first primer to said polyA tail; 

c) reverse transcribing said mRNA to make a first cDNA strand; 

d) making a second cDNA strand complementary to said first cDNA strand using at least 
one primer comprising at least 12, 15, 18, 20, 23, 25, 28, 30, 35, 40, or 50 consecutive nucleotides of 
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a sequence selected from the group consisting of SEQ IDNOs: 1-169, 339-455, 561-784 and 
sequences of clone inserts of the deposited clone pool; and 

e) isolating the resulting cDNA comprising said first cDNA strand and said second cDN A 

strand. 

5 In another aspect of this method the second cDNA strand is made by 

a) contacting said first cDNA strand with a second primer comprising at least 12, 15, 18, 20, 
23, 25, 28, 30, 35, 40, or 50 consecutive nucleotides of a sequence selected from the group consisting 
of SEQ ID NOs: 1-169, 339-455, 561-784 and sequences of clone inserts of the deposited clone 
pool, and a third primer which sequence is fully included within the sequence of said first primer; 
10 b) performing a first polymerase chain reaction with said second and third primers to 

generate a first PCR product; 

c) contacting said first PCR product with a fourth primer, comprising at least 12, 15, 18, 20, 
23, 25, 28, 30, 35, 40, or 50 consecutive nucleotides of said sequence selected from the group 
consisting of SEQ ID NOs: 1-1 69, 339-455, 561-784 and sequences of clone inserts of the deposited 

15 clone pool, and a fifth primer, which sequence is fully included within the sequence of said third 
primer, wherein said fourth and fifth hybridize to sequences within said first PCR product; and 

d) performing a second polymerase chain reaction, thereby generating a second PCR 

product. 

Alternatively, the second cDNA strand may be made by contacting said first cDNA strand 
20 with a second primer comprising at least 12, 15, 18, 20, 23, 25, 28, 30, 35, 40, or 50 consecutive 
nucleotides of a sequence selected from the group consisting of SEQ ID NOs: 1-169, 339-455, 561- 
784 and sequences of clone inserts of the deposited clone pool, and a third primer which sequence is 
fully included within the sequence of said first primer and performing a polymerase chain reaction 
with said second and third primers to generate said second cDNA strand. 
25 Alternatively, the second cDNA strand may be made by: 

a) contacting said first cDNA strand with a second primer comprising at least 12, 15, 18, 20, 
23, 25, 28, 30, 35, 40, or 50 consecutive nucleotides of a sequence selected from the group consisting 
of SEQ ID NOs:l-169, 339-455, 561-784 and sequences of clone inserts of the deposited clone 
pool; 

30 b) hybridizing said second primer to said first strand cDNA; and 

c) extending said hybridized second primer to generate said second cDNA strand. 

Another embodiment of the present invention is a purified cDN A obtainable by a method of 

making a cDNA of the invention. In one aspect of this embodiment, said cDNA encodes at least a 

portion of a human polypeptide. 

3 5 Other protocols 

Alternatively, other procedures may be used for obtaining homologous cDNAs. In one 
approach, cDNAs are prepared from mRNA and cloned into double stranded phagemids as follows. 
The cDNA library in the double stranded phagemids is then rendered single stranded by treatment 
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with an endonuclease, such as the Gene II product of the phage Fl and an exonuclease (Chang et 
ai, 1993, which disclosure is hereby incorporated by reference in its entirety). A biotinylated 
oligonucleotide comprising the sequence of a fragment of a known GENSET cDNA, genomic DNA 
or fragment thereof is hybridized to the single stranded phagemids. Preferably, the fragment 
5 comprises at least 10, 12, 1 5, 1 7, 1 8, 20, 23, 25, or 28 consecutive nucleotides of a sequence 
selected from the group consisting of the sequences of SEQ ID NOs:M69, 339-455, 561-784 and 
sequences of clone inserts of the deposited clone pool. 

Hybrids between the biotinylated oligonucleotide and phagemids are isolated by incubating 
the hybrids with streptavidin coated paramagnetic beads and retrieving the beads with a magnet 

10 (Fry et al 1992, which disclosure is hereby incorporated by reference in its entirety). Thereafter, 
the resulting phagemids are released from the beads and converted into double stranded DNA using 
a primer specific for the GENSET cDNA or fragment used to design the biotinylated 
oligonucleotide. Alternatively, protocols such as the Gene Trapper kit (Gibco BRL), which 
disclosure is which disclosure is hereby incorporated by reference in its entirety, may be used. The 

15 resulting double stranded DNA is transformed into bacteria. Homologous cDNAs to the GENSET 
cDNA or fragment thereof sequence are identified by colony PGR or colony hybridization. 

As a chromosome marker 

Chromosomal localization of the cDNA of the present invention were determined using 
information from public and proprietary databases. Table II lists the putative chromosomal location 

20 of the polynucleotides of the present invention. Column one lists the sequence identification number 
with the corresponding chromosomal location listed in column two. Thus, the present invention 
also relates to methods and compositions using the chromosomal location of the polynucleotides of 
the invention to construct a human high resolution map or to identify a given chromosome in a 
sample using any techniques known to those skilled in the art including those disclosed below. 

25 GENSET polynucleotides may also be mapped to their chromosomal locations using any 

methods or techniques known to those skilled in the art including radiation hybrid (RH) mapping, 
PCR-based mapping and Fluorescence in situ hybridization (FISH) mapping described below. 

Radiation hybrid mapping 

Radiation hybrid (RH) mapping is a somatic cell genetic approach that can be used for high 
30 resolution mapping of the human genome. In this approach, cell lines containing one or more 
human chromosomes are lethally irradiated, breaking each chromosome into fragments whose size 
depends on the radiation dose. These fragments are rescued by fusion with cultured rodent cells, 
yielding subclones containing different fragments of the human genome. This technique is 
described by Benham et al (1989) and Cox etal, (1990), which disclosures are hereby 
35 incorporated by reference in their entireties. The random and independent nature of the subclones 
permits efficient mapping of any human genome marker. Human DNA isolated from a panel of 80- 
100 cell lines provides a mapping reagent for ordering GENSET cDNAs or genomic DNAs. In this 
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approach, the frequency of breakage between markers is used to measure distance, allowing 
construction of fine resolution maps as has been done using conventional ESTs (Schuler e/a/., 
1996), which disclosure is hereby incorporated by reference in its entirety. 

RH mapping has been used to generate a high-resolution whole genome radiation hybrid 
5 map of human chromosome 17q22-q25.3 across the genes for growth hormone (GH) and thymidine 
kinase (TK) (Foster et al., 1996), the region surrounding the Gorlin syndrome gene (Obermayr ei 
aLy 1996), 60 loci covering the entire short arm of chromosome 12 (Raeymaekers e/a/., 1995), the 
region of human chromosome 22 containing the neurofibromatosis type 2 locus (Frazer et aL, 1992) 
and 13 loci on the long arm of chromosome 5 (Warrington ei ai^ 1991), which disclosures are 
10 hereby incorporated by reference in their entireties. 

Mapping of cDNAs to Human Chromosomes using PCR techniques 

GENSET cDNAs and genomic DNAs may be assigned to human chromosomes using PCR 
based methodologies. In such approaches, oligonucleotide primer pairs are designed from the 
cDNA sequence to minimize the chance of amplifying through an intron. Preferably, the 
1 5 oligonucleotide primers are 1 8-23 bp in length and are designed for PCR amplification. The 
creation of PCR primers from known sequences is well known to those with skill in the art. For a 
review of PCR technology see Eriich (1992), which disclosure is hereby incorporated by reference 
in its entirety. 

The primers are used in polymerase chain reactions (PCR) to amplily templates from total 

20 human genomic DNA. PCR conditions are as follows: 60 ng of genomic DNA is used as a template 
for PCR with 80 ng of each oligonucleotide primer, 0.6 unit of Taq polymerase, and I uCu of a 
labeled deoxycytidine triphosphate. The PCR is performed in a microplate thermocycler (Techne) 
under the following conditions: 30 cycles of 94 degrees Celsius, 1 .4 min; 55 degrees Celsius, 2 min; 
and 72 degrees Celsius, 2 min; with a final extension at 72 degrees Celsius for 10 min. The 

25 amplified products are analyzed on a 6% polyacrylamide sequencing gel and visualized by 

autoradiography. If the length of the resulting PCR product is identical to the distance between the 
ends of the primer sequences in the cDN A from which the primers are derived, then the PCR 
reaction is repeated with DNA templates from two panels of human-rodent somatic cell hybrids, 
BIOS PCRable DNA (BIOS Corporation) and NIGMS Human-Rodent Somatic Cell Hybrid 

30 Mapping Panel Number 1 (NIGMS, Camden, NJ). 

PCR is used to screen a series of somatic cell hybrid cell lines containing defined sets of 
human chromosomes for the presence of a given cDNA or genomic DNA, DNA is isolated from 
the somatic hybrids and used as starting templates for PCR reactions using the primer pairs from the 
GENSET cDNAs or genomic DNAs. Only those somatic cell hybrids with chromosomes 

35 containing the human gene corresponding to the GENSET cDNA or genomic DNA will yield an 
amplified fragment. The GENSET cDNAs or genomic DNAs are assigned to a chromosome by 
analysis of the segregation pattern of PCR products from the somatic hybrid DNA templates. The 
single human chromosome present in all cell hybrids that give rise to an amplified fragment is the 
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chromosome containing that GENSET cDNA or genomic DNA. For a review of techniques and 
analysis of results from somatic cell gene mapping experiments, see Ledbetter et aL, (1990), which 
disclosure is hereby incorporated by reference in its entirety. 

Mapping of cDNAs to Chromosomes Using Fluorescence in situ Hybridization 

5 Fluorescence in situ hybridization (FISH) allows the GENSET cDNA or genomic DNA to 

be mapped to a particular location on a given chromosome. The chromosomes to be used for 
fluorescence in situ hybridization techniques may be obtained from a variety of sources including 
cell cuhures, tissues, or whole blood. 

In a preferred embodiment, chromosomal localization of a GENSET cDNA or genomic 

10 DNA is obtained by FISH as described by Cherif et al (1990), which disclosure is hereby 
incorporated by reference in its entirety. Metaphase chromosomes are prepared from 
phytohemagglutinin (PHA)-stimulated blood cell donors. PHA-stimulated lymphocytes from 
healthy males are cultured for 72 h in RPMI-1640 medium. For synchronization, methotrexate (10 
uM) is added for 17 h, followed by addition of 5-bromodeoxyuridine (5-BudR, 0.1 mM) for 6 h. 

1 S Colcemid ( I ug/ml) is added for the last 1 5 min before harvesting the cells. Cells are collected, 
washed in RPMI, incubated with a hypotonic solution of KCI (75 mM) at 37 degrees Celsius for 15 
min and fixed in three changes of methanol :acetic acid (3:1). The cell suspension is dropped onto a 
glass slide and air dried. The GENSET cDNA or genomic DNA is labeled with biotin-16 dUTP by 
nick translation according to the manufacturer's instructions (Bethesda Research Laboratories, 

20 Bethesda, MD), purified using a Sephadex G-50 column (Pharmacia, Upssala, Sweden) and 

precipitated. Just prior to hybridization, the DNA pellet is dissolved in hybridization buffer (50% 
formamide, 2 X SSC, 10% dextran sulfate, 1 mg/ml sonicated salmon sperm DNA, pH 7) and the 
probe is denatured at 70 degrees Celsius for 5-10 min. 

Slides kept at -20 degrees Celsius are treated for 1 h at 37 degrees Celsius with RNase A 

25 (100 ug/ml), rinsed three times in 2 X SSC and dehydrated in an ethanol series. Chromosome 
preparations are denatured in 70% formamide, 2 X SSC for 2 min at 70 degrees Celsius, then 
dehydrated at 4 degrees Celsius. The slides are treated with proteinase K (10 ug/100 ml in 20 mM 
Tris-HCl, 2 mM CaCb) at 37 degrees Celsius for 8 min and dehydrated. The hybridization mixture 
containing the probe is placed on the slide, covered with a coverslip, sealed with rubber cement and 

30 incubated overnight in a humid chamber at 37 degrees Celsius. After hybridization and post- 
hybridization washes, the biotinylated probe is detected by avidin-FITC and amplified with 
additional layers of biotinylated goat anti-avidin and avidin-FITC. For chromosomal localization, 
fluorescent R-bands are obtained as previously described (Cherif et aL, 1990). The slides are 
observed under a LEICA fluorescence microscope (DMRXA). Chromosomes are counterstained 

35 with propidium iodide and the fluorescent signal of the probe appears as two symmetrical yellow- 
green spots on both chromatids of the fluorescent R-band chromosome (red). Thus, a particular 
GENSET cDNA or genomic DNA may be localized to a particular cytogenetic R-band on a given 
chromosome. 
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Use of cDNAs to Construct or Expand Chromosome Maps 

Once the GENSET cDNAs or genomic DNAs have been assigned to particular 
chromosomes using any technique known to those skilled in the art those skilled in the art, 
S particularly those described herein, they may be utilized to construct a high resolution map of the 
chromosomes on which they are located or to identify the chromosomes in a sample. 

Chromosome mapping involves assigning a given unique sequence to a particular 
chromosome as described above. Once the unique sequence has been mapped to a given 
chromosome, it is ordered relative to other unique sequences located on the same chromosome. 

10 One approach to chromosome mapping utilizes a series of yeast artificial chromosomes (YACs) 
bearing several thousand long inserts derived from the chromosomes of the organism from which 
the GENSET cDNAs or genomic DNAs are obtained. This approach is described in Nagaraja et al. 
^997), which disclosure is hereby incorporated by reference in its entirety. Briefly, in this 
approach each chromosome is broken into overlapping pieces which are inserted into the YAC 

1 5 vector. The YAC inserts are screened using PCR or other methods to determine whether they 
include the GENSET cDN A or genomic DNA whose position is to be determined. Once an insert 
has been found which includes the GENSET cDNA or genomic DNA, the insert can be analyzed by 
PCR or other methods to determine whether the insert also contains other sequences known to be on 
the chromosome or in the region from which the GENSET cDNA or genomic DNA was derived. 

20 This process can be repeated for each insert in the YAC library to determine the location of each of 
the GENSET cDNA or genomic DNA relative to one another and to other known chromosomal 
markers. In this way, a high resolution map of the distribution of numerous unique markers along 
each of the organisms chromosomes may be obtained. 

Identification of genes associated with hereditary diseases or drug response 

25 This example illustrates an approach useful for the association of GENSET cDNAs or 

genomic DNAs with particular phenotypic characteristics. In this example, a particular GENSET 
cDNA or genomic DNA is used as a test probe to associate that GENSET cDNA or genomic DNA 
with a particular phenotypic characteristic. 

GENSET cDNAs or genomic DNAs are mapped to a particular location on a human 

30 chromosome using techniques such as those described herein or other techniques known in the art. 
A search of Mendelian Inheritance in Man (V. McKusick, Mendeiian Inheritance in Man; available 
on line through Johns Hopkins University Welch Medical Library) reveals the region of the human 
chromosome which contains the GENSET cDNA or genomic DNA to be a very gene rich region 
containing several known genes and several diseases or phenotypes for which genes have not been 

35 identified. The gene corresponding to this GENSET cDNA or genomic DNA thus becomes an 
immediate candidate for each of these genetic diseases. 
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Cells from patients with these diseases or phenotypes are isolated and expanded in culture. 
PCR primers from the GENSET cDNA or genomic DNA are used to screen genomic DNA, mRNA 
or cDNA obtained from the patients. GENSET cDNAs or genomic DNAs that are not amplified in 
the patients can be positively associated with a particular disease by further analysis. Alternatively, 
5 the PCR analysis may yield fragments of diflFerent lengths when the samples are derived from an 
individual having the phenotype associated with the disease than when the sample is derived from a 
healthy individual, indicating that the gene containing the cDNA may be responsible for the genetic 
disease. 

Uses of polynucleotides in recombinant vectors 

10 The present invention also relates to recombinant vectors including the isolated 

polynucleotides of the present invention, and to host cells recombinant for a polynucleotide of the 
invention, such as the above vectors, as well as to methods of making such vectors and host cells 
and for using them for production of GENSET polypeptides by recombinant techniques. 

Recombinant Vectors 

1 5 The term "vector" is used herein to designate either a circular or a linear DNA or RNA 

molecule, which is either double-stranded or single-stranded, and which comprise at least one 
polynucleotide of interest that is sought to be transferred in a cell host or in a unicellular or 
multicellular host organism. The present invention encompasses a family of recombinant vectors 
that comprise a regulatory polynucleotide and/or a coding polynucleotide derived from either the 

20 GENSET genomic sequence or the cDNA sequence. Generally, a recombinant vector of the 
invention may comprise any of the polynucleotides described herein, including regulatory 
sequences, coding sequences and polynucleotide constructs, as well as any GENSET primer or 
probe as deFmed herein. 

In a first preferred embodiment, a recombinant vector of the invention is used to amplify 

25 the inserted polynucleotide derived from a GENSET genomic sequence or a GENSET cDNA, for 
example any cDNA selected from the group consisting of sequences 'of SEQ ID NOs: 1-169, 339- 
455, 561-784, sequences of clone inserts of the deposited clone pool, variants and fragments thereof 
in a suitable cell host, this polynucleotide being amplified at every time that the recombinant vector 
replicates. 

30 A second preferred embodiment of the recombinant vectors according to the invention 

comprises expression vectors comprising either a regulatory polynucleotide or a coding nucleic acid 
of the invention, or both. Within certain embodiments, expression vectors are employed to express 
a GENSET polypeptide which can be then purified and, for example be used in ligand screening 
assays or as an immunogen in order to raise specific antibodies directed against the GENSET 

35 protein. In other embodiments, the expression vectors are used for constructing transgenic animals 
and also for gene therapy. Expression requires that appropriate signals are provided in the vectors, 
said signals including various regulatory elements, such as enhancers/promoters from both viral and 
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mammalian sources that drive expression of the genes of interest in host cells. Dominant drug 
selection markers for establishing permanent, stable cell clones expressing the products are 
generally included in the expression vectors of the invention, as they are elements that link 
expression of the drug selection markers to expression of the polypeptide. 
5 More particularly, the present invention relates to expression vectors which include nucleic 

acids encoding a GENSET protein, preferably a GENSET protein with an amino acid sequence 
selected from the group consisting of sequences of SEQ ID NOs: 170-338, 456-560, 785-918, 
sequences of polypeptides encoded by the clone inserts of the deposited clone pool, variants and 
fragments thereof. The polynucleotides of the present invention may be used to express an encoded 

10 protein in a host organism to produce a beneficial effect. In such procedures, the encoded protein 
may be transiently expressed in the host organism or stably expressed in the host organism. The 
encoded protein may have any of the activities described herein. The encoded protein may be a 
protein which the host organism lacks or, alternatively, the encoded protein may augment the 
existing levels of the protein in the host organism. 

1 5 Some of the elements which can be found in the vectors of the present invention are 

described in further detail in the following sections. 

General features of the expression vectors of the invention 

A recombinant vector according to the invention comprises, but is not limited to, a YAC 
(Yeast Artificial Chromosome), a BAG (Bacterial Artificial Chromosome), a phage, a phagemid, a 
20 cosmid, a plasmid or even a linear DNA molecule which may comprise a chromosomal, non- 
chromosomal, semi-synthetic and synthetic DNA. Such a recombinant vector can comprise a 
transcriptional unit comprising an assembly of: 

(1) a genetic element or elements having a regulatory role in gene expression, for example 
promoters or enhancers. Enhancers are cis-acting elements of DNA, usually from about 10 to 300 

25 bp in length that act on the promoter to increase the transcription. 

(2) a structural or coding sequence which is transcribed into mRNA and eventually 
translated into a polypeptide, said structural or coding sequence being operably linked to the 
regulatory elements described in (1); and 

(3) appropriate transcription initiation and termination sequences. Structural units intended 
30 for use in yeast or eukaryotic expression systems preferably include a leader sequence enabling 

extracellular secretion of translated protein by a host cell. Alternatively, when a recombinant 
protein is expressed without a leader or transport sequence, it may include a N-terminal residue. 
This residue may or may not be subsequently cleaved from the expressed recombinant protein to 
provide a fmal product. 

35 Generally, recombinant expression vectors will include origins of replication, selectable 

markers permitting transformation of the host cell, and a promoter derived from a highly expressed 
gene to direct transcription of a downstream structural sequence. The heterologous structural 
sequence is assembled in appropriate phase with translation initiation and termination sequences. 
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and preferably a leader sequence capable of directing secretion of the translated protein into the 
periplasmic space or the extracellular medium. In a specific embodiment wherein the vector is 
adapted for transfecting and expressing desired sequences in mammalian host cells, preferred 
vectors will comprise an origin of replication in the desired host, a suitable promoter and enhancer, 
S and also any necessary ribosome binding sites, polyadenylation signals, splice donor and acceptor 
sites, transcriptional termination sequences, and S'-flanking non-transcribed sequences. DNA 
sequences derived firom the SV40 viral genome, for example SV40 origin, early promoter, 
enhancer, splice and polyadenylation signals may be used to provide the required non-transcribed 
genetic elements. 

10 The in vivo expression of a GENSET polypeptide of the present invention may be useful in 

order to correct a genetic defect related to the expression of the native gene in a host organism, for 
the treatment or prevention of any disease or condition that can be treated or prevented by 
increasing the level of GENSET polypeptide expression, or to the production of a biologically 
inactive GENSET protein. Consequently, the present invention also comprises recombinant 

1 5 expression vectors mainly designed for the in vivo production of a GENSET polypeptide the present 
invention by the introduction of the appropriate genetic material in the organism or the patient to be 
treated. This genetic material may be introduced in vitro in a cell that has been previously extracted 
from the organism, the modified cell being subsequently reintroduced in the said organism, directly 
in vivo into the appropriate tissue. 

20 Regulatory Elements 

The suitable promoter regions used in the expression vectors according to the present 
invention are chosen taking into account the cell host in which the heterologous gene has to be 
expressed. The particular promoter employed to control the expression of a nucleic acid sequence 
of interest is not believed to be important, so long as it is capable of directing the expression of the 

25 nucleic acid in the targeted cell. Thus, where a human cell is targeted, it is preferable to position the 
nucleic acid coding region adjacent to and under the control of a promoter that is capable of being 
expressed in a human cell, such as, for example, a human or a viral promoter. 

A suitable promoter may be heterologous with respect to the nucleic acid for which it 
controls the expression or alternatively can be endogenous to the native polynucleotide containing 

30 the coding sequence to be expressed. Additionally, the promoter is generally heterologous with 
respect to the recombinant vector sequences within which the construct promoter/coding sequence 
has been inserted. 

Promoter regions can be selected from any desired gene using, for example, CAT 
(chloramphenicol transferase) vectors and more preferably pKK232-8 and pCM7 vectors. 
35 Preferred bacterial promoters are the Lacl, LacZ, the T3 or T7 bacteriophage RNA 

polymerase promoters, the gpt, lambda PR, PL and trp promoters (EP 0036776), the polyhedrin 
promoter, or the plO protein promoter from baculovirus (Kit Novagen), (Smith et al.^ 1983; 



82 



wo 02/083898 PCT/IBO 1/009 14 

O'Reilly e( al., 1 992; which disclosures are hereby incorporated by reference in their entireties), the 
lambda PR promoter or also the trc promoter. 

Eukaryotic promoters include CMV immediate early, HSV thymidine kinase, early and late 
SV40, LTRs from retrovirus, and mouse metailothionein-L. Selection of a convenient vector and 
5 promoter is well within the level of ordinary skill in the art. The choice of a promoter is well within 
the ability of a person skilled in the field of genetic engineering. For example, one may refer to the 
book of Sambrook et al, (1989) or also to the procedures described by Fuller et a/., (1996), which 
disclosures are hereby incorporated by reference in their entireties. 

Other regulatory elements 

1 0 Where a cDNA insert is employed, one will typically desire to include a polyadenylation 

signal to effect proper polyadenylation of the gene transcript. The nature of the polyadenylation 
signal is not believed to be crucial to the successful practice of the invention, and any such sequence 
may be employed such as human growth hormone and SV40 polyadenylation signals. Also 
contemplated as an element of the expression cassette is a terminator. These elements can serve to 

1 5 enhance message levels and to minimize read through from the cassette into other sequences. 

Selectable Markers 

Selectable markers confer an identifiable change to the cell permitting easy identification of 
cells containing the expression construct. The selectable marker genes for selection of transformed 
host cells are preferably dihydrofolale reductase or neomycin resistance for eukaryotic cell culture, 
20 TRPl for S. cerevisiae or tetracycline, rifampicin or ampicillin resistance in £. Coli, or levan 
saccharase for mycobacteria, this latter marker being a negative selection marker. 

Preferred Vectors 
Bacterial vectors 

As a representative but non-limiting example, useful expression vectors for bacterial use 
25 can comprise a selectable marker and a bacterial origin of replication derived from commercially 

available plasmids comprising genetic elements of pBR322 (ATCC 37017). Such commercial 

vectors include, for example, pKK223-3 (Pharmacia, Uppsala, Sweden), and pGEMl (Promega 

Biotec, Madison, WI, USA). 

Large numbers of other suitable vectors are known to those of skill in the art, and 
30 commercially available, such as the following bacterial vectors: pQE70, pQE60, pQE-9 (Qiagen), 

pbs, pDlO, phagescript, psiX174, pbluescript SK, pbsks, pNH8A, pNH16A, pNHlSA, pNH46A 

(Stratagene); ptrc99a, pKK223-3, pICK233-3, pDR540, pRITS (Pharmacia); pWLNEO, pSV2CAT. 

pOG44, pXTl, pSG (Stratagene); pSVK3, pBPV, pMSG, pSVL (Pharmacia); pQE-30 

(QIAexpress). 
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The PI bacteriophage vector may contain large inserts ranging from about 80 to about 100 
kb. The construction of PI bacteriophage vectors such as pi 58 or pl58/neo8 are notably described 
by Sternberg (1992, 1994), which disclosure is hereby incorporated by reference in its entirety. 
5 Recombinant PI clones comprising GENSET nucleotide sequences may be designed for inserting 
large polynucleotides of more than 40 kb (See Linton et al,, 1993), which disclosure is hereby 
incorporated by reference in its entirety. To generate PI DNA for transgenic experiments, a 
preferred protocol is the protocol described by McCormick et aL (1994), which disclosure is hereby 
incorporated by reference in its entirety. Briefly, E. coli (preferably strain NS3529) harboring the 

10 PI plasmid are grown overnight in a suitable broth medium containing 25 ^g/ml of kanamycin. 
The PI DNA is prepared from the E. coli by alkaline lysis using the Qiagen Plasmid Maxi kit 
(Qiagen, Chatsworth, CA, USA), according to the manufacturer's instructions. The PI DNA is 
purified from the bacterial lysate on two Qiagen-tip 500 columns, using the washing and elution 
buffers contained in the kit. A phenol/chloroform extraction is then performed before precipitating 

15 the DNA with 70% ethanol. After solubilizing the DNA in TE (10 mM Tris-HCI, pH 7.4, 1 mM 
EDTA), the concentration of the DNA is assessed by spectrophotometry. 

When the goal is to express a PI clone comprising GENSET polypeptide-encoding 
nucleotide sequences in a transgenic animal, typically in transgenic mice, it is desirable to remove 
vector sequences from the PI DNA fragment, for example by cleaving the PI DNA at rare-cutting 

20 sites within the PI polylinker (5/71, Noil or Sal\y The PI insert is then purified from vector 

sequences on a pulsed-field agarose gel, using methods similar to those originally reported for the 
isolation of DNA from YACs {See e, g., Schedl et aL, 1993a; Peterson ei aL, 1993), which 
disclosures are hereby incorporated by reference in their entireties. At this stage, the resulting 
purified insert DNA can be concentrated, if necessary, on a Millipore Ultrafree-MC Filter Unit 

25 (Millipore, Bedford, MA, USA - 30,000 molecular weight limit) and then dialyzed against 

microinjection buffer (10 mM Tris-HCI, pH 7.4; 250 ^iM EDTA) containing 100 mM NaCl, 30 nM 
spermine, 70 ^xM spermidine on a microdyalisis membrane (type VS, 0.025 ^M from Millipore). 
The intactness of the purified PI DNA insert is assessed by electrophoresis on 1% agarose (Sea 
Kem GTG; FMC Bio-products) pulse-field gel and staining with ethidium bromide. 

30 Viral vectors 

In one specific embodiment, the vector is derived from an adenovirus. Preferred adenovirus 
vectors according to the invention are those described by Feldman and Steg (1 996), or Ohno et aL, 
(1994), which disclosures are hereby incorporated by reference in their entireties. Another 
preferred recombinant adenovirus according to this specific embodiment of the present invention is 
35 the human adenovirus type 2 or 5 (Ad 2 or Ad 5) or an adenovirus of animal origin (French patent 
application No. FR-93.05954), which disclosure is hereby incorporated by reference in its entirety. 

Retrovirus vectors and adeno-associated virus vectors are generally understood to be the 
recombinant gene delivery systems of choice for the transfer of exogenous polynucleotides in vivo , 
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particularly to mammals, including humans. These vectors provide efficient delivery of genes into 
cells, and the transferred nucleic acids are stably integrated into the chromosomal DNA of the host. 
Particularly preferred retroviruses for the preparation or construction of retroviral in vitro or w vitro 
gene delivery vehicles of the present invention include retroviruses selected from the group 
S consisting of Mink<;ell Focus Inducing Virus, Murine Sarcoma Virus, Reticuloendotheliosis virus 
and Rous Sarcoma virus. Particularly preferred Murine Leukemia Viruses include the 4070A and 
the 1504A viruses, Abelson (ATCC No VR-999), Friend (ATCC No VR-245), Gross (ATCC No 
VR-590), Rauscher (ATCC No VR-998) and Moloney Murine Leukemia Virus (ATCC No VR- 
190; PCT Application No WO 94/24298). Particularly preferred Rous Sarcoma Viruses include 

10 Bryan high liter (ATCC Nos VR-334, VR-657, VR.726, VR-659 and VR-728). Other preferred 
retroviral vectors are those described in Roth et al (1996), PCT Application No WO 93/25234, 
PCT Application No WO 94/ 06920, Roux et ai, (1989), Julan et al., (1992), and Neda et a/., 
(1991), which disclosures are hereby incorporated by reference in their entireties. 

Yet another viral vector system that is contemplated by the invention comprises the adeno- 

1 5 associated virus (AAV). The adeno-associated virus is a naturally occurring defective virus that 
requires another virus, such as an adenovirus or a herpes virus, as a helper virus for efficient 
replication and a productive life cycle (Muzyczka et aL^ 1992), which disclosure is hereby 
incorporated by reference in its entirety. It is also one of the few viruses that may integrate its DNA 
into non-dividing cells, and exhibits a high frequency of stable integration (Flotte et al. 1992; 

20 Samulski et aL, 1989; McLaughlin et al, 1989), which disclosures are hereby incorporated by 
reference in their entireties. One advantageous feature of AAV derives from its reduced efficacy 
for transducing primary cells relative to transformed cells. 

BAC vectors 

The bacterial artificial chromosome (BAC) cloning system (Shizuya ei al., 1992), which 
25 disclosure is hereby incorporated by reference in its entirety, has been developed to stably maintain 
large fragments of genomic DNA (100-300 kb) in E. colL A preferred BAC vector comprises a 
pBeloBACl 1 vector that has been described by Kim et al, (1996), which disclosure is hereby 
incorporated by reference in its entirety. BAC libraries are prepared with this vector using size- 
selected genomic DNA that has been partially digested using enzymes that permit ligation into 
30 either the Bam HI or HindlU sites in the vector. Flanking these cloning sites are T7 and SP6 RNA 
polymerase transcription initiation sites that can be used to generate end probes by either RNA 
transcription or PCR methods. After the construction of a BAC library in E. coli, BAC DNA is 
purified from the host cell as a supercoiled circle. Converting these circular molecules into a linear 
form precedes both size determination and introduction of the BACs into recipient cells. The 
3S cloning site is flanked by two Not 1 sites, permitting cloned segments to be excised from the vector 
by Not I digestion. Alternatively, the DNA insert contained in the pBeloBACl 1 vector may be 
linearized by treatment of the BAC vector with the commercially available enzyme lambda 
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terminase that leads to the cleavage at the unique co^N site, but this cleavage method results in a 
full length BAC clone containing both the insert DNA and the BAG sequences. 

Baculovirus 

Another specific suitable host vector system is the pVL1392/1393 baculovirus transfer 
5 vector (Pharmingen) that is used to transfect the SF9 cell line (ATCC No. CRL 171 1) which is 
derived from Spodoptera frugiperda. Other suitable vectors for the expression of the GENSET 
polypeptide of the present invention in a baculovirus expression system include those described by 
Chai et al,, (1993), Vlasak ei aL, (1983), and Lenhard et al, (1 996), which disclosures are hereby 
incorporated by reference in their entireties. 

1 0 Delivery Of The Recombinant Vectors 

To effect expression of the polynucleotides and polynucleotide constructs of the invention, 
the constructs must be delivered into a cell. This delivery may be accomplished in vitro, as in 
laboratory procedures for transforming cell lines, or in vivo or ex vivo, as in the treatment of certain 
diseases states. One mechanism is viral infection where the expression construct is encapsulated in 

1 5 an infectious viral particle. 

Several non-viral methods for the transfer of polynucleotides into cultured mammalian cells 
are also contemplated by the present invention, and include, without being limited to, calcium 
phosphate precipitation (Graham et al, 1973; Chen et al, 1987); DEAE-dextran (Gopal, 1985); 
electroporation (Tur-Kaspa et al,, 1986; Potter et aL, 1984); direct microinjection (Harland et aL, 

20 1985); DNA-loaded liposomes (Nicolau et a!., 1982; Fraley et aL, 1979); and receptor-mediated 
transfection. (Wu and Wu, 1987, 1988), which disclosures are hereby incorporated by reference in 
their entireties. Some of these techniques may be successfully adapted for in vivo or ex vivo use. 

Once the expression polynucleotide has been delivered into the cell, it may be stably 
integrated into the genome of the recipient cell. This integration may be in the cognate location and 

25 orientation via homologous recombination (gene replacement) or it may be integrated in a random, 
non-specific location (gene augmentation). In yet further embodiments, the nucleic acid may be 
stably maintained in the cell as a separate, episomal segment of DNA. Such nucleic acid segments 
or "episomes" encode sequences sufficient to permit maintenance and replication independent of or 
in synchronization with the host cell cycle. 

30 One specific embodiment for a method for delivering a protein or peptide to the interior of a 

cell of a vertebrate in vivo comprises the step of introducing a preparation comprising a 
physiologically acceptable carrier and a naked polynucleotide operatively coding for the 
polypeptide of interest into the interstitial space of a tissue comprising the cell, whereby the naked 
polynucleotide is taken up into the interior of the cell and has a physiological effect. This is 

35 particularly applicable for transfer in vitro but it may be applied to in vivo as well. 

Compositions for use in vitro and in vivo comprising a "naked" polynucleotide are 
described in PCT application No. WO 90/1 1092 (Vical Inc.) and also in PCT application No. WO 
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95/1 1307 (Institut Pasteur, INSERM, Universite d'Ottawa) as well as in the articles of Tascon et al 
(1996) and of Huygen et a/., (1996), which disclosures are hereby incorporated by reference in their 
entireties. 

In still another embodiment of the invention, the transfer of a naked polynucleotide of the 
5 invention, including a polynucleotide construct of the invention, into cells may be accomplished 
with particle bombardment (biolistic), said particles being DNA-coated microprojectiles accelerated 
to a high velocity allowing them to pierce cell membranes and enter cells without killing them, such 
as described by Klein et aL, (1987), which disclosure is hereby incorporated by reference in its 
entirety. 

10 In a further embodiment, the polynucleotide of the invention may be entrapped in a 

liposome (Ghosh and Bacchawat, 1991; Wong a/., 1980;Nicolaue/a/., 1987, which disclosures 
are hereby incorporated by reference in their entireties). 

In a specific embodiment, the invention provides a composition for the in vivo production 
of the GENSET polypeptides described herein. It comprises a naked polynucleotide operatively 

1 5 coding for this polypeptide, in solution in a physiologically acceptable carrier, and suitable for 
introduction into a tissue to cause cells of the tissue to express the said protein or polypeptide. 

The amount of vector to be injected to the desired host organism varies according to the site 
of injection. As an indicative dose, it will be injected between 0.1 and 100 \ig of the vector in an 
animal body, preferably a mammal body, for example a mouse body. 

20 In another embodiment of the vector according to the invention, it may be introduced in 

vitro in a host cell, preferably in a host cell previously harvested from the animal to be treated and 
more preferably a somatic cell such as a muscle cell. In a subsequent step, the cell that has been 
transformed with the vector coding for the desired GENSET polypeptide or the desired fragment 
thereof is reintroduced into the animal body in order to deliver the recombinant protein within the 

25 body either locally or systemically. 

Secretion vectors 

Some of the GENSET cDNAs or genomic DNAs of the invention may also be used to 
construct secretion vectors capable of directing the secretion of the proteins encoded by genes 
30 inserted in the vectors. Such secretion vectors may facilitate the purification or enrichment of the 
proteins encoded by genes inserted therein by reducing the number of background proteins from 
which the desired protein must be purified or enriched. Exemplary secretion vectors are described 
below. 

The secretion vectors of the present invention include a promoter capable of directing gene 
35 expression in the host cell, tissue, or organism of interest. Such promoters include the Rous 
Sarcoma Virus promoter, the SV40 promoter, the human cytomegalovirus promoter, and other 
promoters familiar to those skilled in the art. 
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A signal sequence from a polynucleotide of the invention, preferably a signal sequences 
selected from the group of signal sequences of SEQ ID NOs: 1-85, 339-400, 406-407, 413-415, 
561-594, and 634-651 and signal sequences of clone inserts of the deposited clone pool is operably 
linked to the promoter such that the mRNA transcribed from the promoter will direct the translation 
5 of the signal peptide. The host cell, tissue, or organism may be any cell, tissue, or organism which 
recognizes the signal peptide encoded by the signal sequence in the GENSET cDNA or genomic 
DNA. Suitable hosts include mammalian cells, tissues or organisms, avian cells, tissues, or 
organisms, insect cells, tissues or organisms, or yeast. 

In addition, the secretion vector contains cloning sites for inserting genes encoding the 
10 proteins which are to be secreted. The cloning sites facilitate the cloning of the insert gene in frame 
with the signal sequence such that a fusion protein in which the signal peptide is fused to the protein 
encoded by the inserted gene is expressed from the mRNA transcribed from the promoter. The 
signal peptide directs the extracellular secretion of the fusion protein. 

The secretion vector may be DNA or RNA and may integrate into the chromosome of the 
1 5 host, be stably maintained as an extrachromosomal replicon in the host, be an artificial 

chromosome, or be transiently present in the host. Preferably, the secretion vector is maintained in 
multiple copies in each host cell. As used herein, multiple copies means at least 2, 5, 10, 20, 25, 50 
or more than SO copies per cell. In some embodiments, the multiple copies are maintained 
extrachromosomal ly. In other embodiments, the multiple copies result from amplification of a 
20 chromosomal sequence. 

Many nucleic acid backbones suitable for use as secretion vectors are known to those 
skilled in the art, including retroviral vectors, SV40 vectors, Bovine Papilloma Virus vectors, yeast 
integrating plasmids, yeast episomal plasmids, yeast artificial chromosomes, human artificial 
chromosomes, P element vectors, baculovirus vectors, or bacterial plasmids capable of being 
25 transiently introduced into the host. 

The secretion vector may also contain a polyA signal such that the polyA signal is located 
downstream of the gene inserted into the secretion vector. 

After the gene encoding the protein for which secretion is desired is inserted into the 
secretion vector, the secretion vector is introduced into the host cell, tissue, or organism using 
30 calcium phosphate precipitation, DEAE-Dextran, electroporation, liposome-mediated transfection, 
viral particles or as naked DNA. The protein encoded by the inserted gene is then purified or 
enriched from the supernatant using conventional techniques such as ammonium sulfate 
precipitation, immunoprecipitation, immunochromatography, size exclusion chromatography, ion 
exchange chromatography, and hplc. Alternatively, the secreted protein may be in a sufficiently 
35 enriched or pure state in the supernatant or growth media of the host to permit it to be used for its 
intended purpose without further enrichment. 

The signal sequences may also be inserted into vectors designed for gene therapy. In such 
vectors, the signal sequence is operably linked to a promoter such that mRNA transcribed from the 
promoter encodes the signal peptide. A cloning site is located downstream of the signal sequence 
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such that a gene encoding a protein whose secretion is desired may readily be inserted into the 
vector and fused to the signal sequence. The vector is introduced into an appropriate host cell. The 
protein expressed from the promoter is secreted extracelluiarly, thereby producing a therapeutic 
effect. 

5 

Cell Hosts 

Another object of the invention comprises a host cell that has been transformed or 
transfected with one of the polynucleotides described herein, and in particular a polynucleotide 
either comprising a GENSET polypeptide-encoding polynucleotide regulatory sequence or the 

10 polynucleotide coding for a GENSET polypeptide. Also included are host cells that are transformed 
(prokaryotic cells) or that are transfected (eukaryotic cells) with a recombinant vector such as one of 
those described above. However, the cell hosts of the present invention can comprise any of the 
polynucleotides of the present invention. In a preferred embodiment, host cells contain a 
polynucleotide sequence comprising a sequence selected from the group consisting of sequences of 

15 SEQ ID NOs:l-169, 339^55, 561-784, sequences of clone inserts of the deposited clone pool, 
variants and fragments thereof Preferred host cells used as recipients for the expression vectors of 
the invention are the following: 

a) Prokaryotic host cells: Escherichia coli strains (I.E.DH5-a strain). Bacillus subtilis. 
Salmonella typhimurium, and strains from species like Pseudomortasy Streptomyces and 

20 Staphylococcus, 

b) Eukaryotic host cells: HeLa cells (ATCC No.CCL2; No.CCL2.1; No.CCL2.2), Cv 1 
cells (ATCC No,CCL70), COS cells (ATCC No.CRL1650; No.CRL1651), Sf-9 cells (ATCC 
No.CRLlTl 1), CI 27 cells (ATCC No. CRL-1804), 3T3 (ATCC No. CRL-6361), CHO (ATCC No. 
CCL.6I), human kidney 293. (ATCC No. 45504; No. CRL-1573) and BHK (ECACC No. 

25 84100501; No. 841 11301). 

c) Other mammalian host cells. 

The present invention also encompasses primary, secondary, and immortalized 
homologously recombinant host cells of vertebrate origin, preferably mammalian origin and 

30 particularly human origin, that have been engineered to: a) insert exogenous (heterologous) 

polynucleotides into the endogenous chromosomal DNA of a targeted gene, b) delete endogenous 
chromosomal DNA, and/ore) replace endogenous chromosomal DNA with exogenous 
polynucleotides. Insertions, deletions, and/or replacements of polynucleotide sequences may be to 
the coding sequences of the targeted gene and/or to regulatory regions, such as promoter and 

35 enhancer sequences, operably associated with the targeted gene. 

In addition to encompassing host cells containing the vector constructs discussed herein, the 
invention also encompasses primary, secondary, and immortalized host cells of vertebrate origin, 
particularly mammalian origin, that have been engineered to delete or replace endogenous genetic 
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material (e.g., coding sequence), and/or to include genetic material (e.g., heterologous 
polynucleotide sequences) that is operably associated with the polynucleotides of the invention, and 
which activates, alters, and/or amplifies endogenous polynucleotides. For example, techniques 
known in the art may be used to operably associate heterologous control regions (e.g., promoter 
S and/or enhancer) and endogenous polynucleotide sequences via homologous recombination, see, 
e.g., U.S. Patent No. 5,641,670, issued June 24, 1997; International Publication No. WO 96/2941 1, 
published September 26, 1996; International Publication No. WO 94/12650, published August 4, 
1994; Koller el a/., (1989); and Zijlstra et al (1989) (the disclosures of each of which are 
incorporated by reference in their entireties). 

1 0 The present invention further relates to a method of making a homologously recombinant 

host cell in vitro or in vivo, wherein the expression of a targeted gene not normally expressed in the 
cell is altered. Preferably the alteration causes expression of the targeted gene under normal growth 
conditions or under conditions suitable for producing the polypeptide encoded by the targeted gene. 
The method comprises the steps of: (a) transfecting the cell in vitro or in vivo with a polynucleotide 

1 5 construct, said polynucleotide construct comprising; (i) a targeting sequence; (ii) a regulatory 
sequence and/or a coding sequence; and (iii) an unpaired splice donor site, if necessary, thereby 
producing a transfected cell; and (b) maintaining the transfected cell in vitro or in vivo under 
conditions appropriate for homologous recombination. 

The present invention further relates to a method of altering the expression of a targeted 

20 gene in a cell in vitro or in vivo wherein the gene is not normally expressed in the cell, comprising 
the steps of: (a) transfecting the cell in vitro or in vivo with a polynucleotide construct, said 
polynucleotide construct comprising: (i) a targeting sequence; (ii) a regulatory sequence and/or a 
coding sequence; and (iii) an unpaired splice donor site, if necessary, thereby producing a 
transfected cell; and (b) maintaining the transfected cell in vitro or in vivo under conditions 

25 appropriate for homologous recombination, thereby producing a homologously recombinant cell; 
and (c) maintaining the homologously recombinant cell in vitro or in vivo under conditions 
appropriate for expression of the gene. 

The present invention further relates to a method of making a polypeptide of the present 
invention by altering the expression of a targeted endogenous gene in a cell in vitro or in vivo 

30 wherein the gene is not normally expressed in the cell, comprising the steps of: a) transfecting the 
cell in vitro with a polynucleotide construct, said polynucleotide construct comprising: (i) a 
targeting sequence; (ii) a regulatory sequence and/or a coding sequence; and (iii) an unpaired splice 
donor site, if necessary, thereby producing a transfected cell; (b) maintaining the transfected cell in 
vitro or in vivo under conditions appropriate for homologous recombination, thereby producing a 

35 homologously recombinant cell; and c) maintaining the homologously recombinant cell in vitro or 
in vivo under conditions appropriate for expression of the gene thereby making the polypeptide. 

The present invention further relates to a polynucleotide construct which alters the 
expression of a targeted gene in a cell type in which the gene is not normally expressed. This 
occurs when the polynucleotide construct is inserted into the chromosomal DNA of the target cell. 
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wherein said polynucleotide construct comprises: a) a targeting sequence; b) a regulatory sequence 
and/or coding sequence; and c) an unpaired splice-donor site, if necessary. Further included are a 
polynucleotide construct, as described above, wherein said polynucleotide construct further 
comprises a polynucleotide which encodes a polypeptide and is in-frame with the targeted 
5 endogenous gene after homologous recombination with chromosomal DNA. 

The compositions may be produced, and methods performed, by techniques known in the 
art, such as those described in U.S. Patent NOs: 6,054,288; 6,048,729; 6,048,724; 6,048,524; 
5,994,127; 5,968,502; 5.965,125; 5,869,239; 5,817,789; 5,783,385; 5,733,761; 5,641,670; 
5,580,734 ; International Publication NOs:W096/2941 1, WO 94/12650; and scientific articles 
1 0 described by Koller et al.j ( 1 994). (The disclosures of each of which are incorporated by reference 
in their entireties). 

GENSET gene expression in mammalian cells, preferably human cells, may be rendered 
defective, or alternatively may be altered by replacing endogenous GENSET polypeptide-encoding 
genes in the genome of an animal cell by a GENSET polypeptide-encoding polynucleotide 

15 according to the invention. These genetic alterations may be generated by homologous 
recombination using previously described specific polynucleotide constructs. 

Mammal zygotes, such as murine zygotes may be used as cell hosts. For example, murine 
zygotes may undergo microinjection with a purified DNA molecule of interest, for example a 
purified DNA molecule that has previously been adjusted to a concentration ranging from 1 ng/ml - 

20 for BAG inserts- to 3 ng/^l -for PI bacteriophage inserts- in 10 mM Tris-HCI, pH 7.4, 250 \iM 
EDTA containing 100 mM NaCl, 30 ^M spermine, and70 \xM spermidine. When the DNA to be 
microinjected has a large size, polyamines and high salt concentrations can be used in order to avoid 
mechanical breakage of this DNA, as described by Schedl et c^/ (1993b), which disclosure is hereby 
incorporated by reference in its entirety, 

25 Any one of the polynucleotides of the invention, including the polynucleotide constructs 

described herein, may be introduced in an embryonic stem (ES) cell line, preferably a mouse ES 
cell line. ES cell lines are derived from pluripotent, uncommitted cells of the inner cell mass of pre- 
implantation blastocysts. Preferred ES cell lines are the following: ES-E14TG2a (ATCC No.CRL- 
1 821), ES-D3 (ATCC No.CRL1934 and No. CRL-1 1632), YSOOl (ATCC No. CRL-1 1776), 36.5 

30 (ATCC No. CRL-1 1116). ES cells are maintained in an uncommitted state by culture in the 
presence of growth-inhibited feeder cells which provide the appropriate signals to preserve this 
embryonic phenotype and serve as a matrix for ES cell adherence. Preferred feeder cells are 
primary embryonic fibroblasts that are established from tissue of day 13- day 14 embryos of 
virtually any mouse strain, that are maintained in culture, such as described by Abbondanzo eial. 

35 (1993) and are growth-inhibited by irradiation, such as described by Robertson (1987), or by the 
presence of an inhibitory concentration of LIF, such as described by Pease and Williams (1 990), 
which disclosures are hereby incorporated by reference in their entireties. 

The constructs in the host cells can be used in a conventional manner to produce the gene 
product encoded by the recombinant sequence. 
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Following transformation of a suitable host and growth of the host to an appropriate cell 
density, the selected promoter is induced by appropriate means, such as temperature shift or 
chemical induction, and cells are cultivated for an additional period. Cells are typically harvested 
by centrifugation, disrupted by physical or chemical means, and the resulting crude extract retained 
5 for further purification. Microbial cells employed in the expression of proteins can be disrupted by 
any convenient method, including freeze-thaw cycling, sonication, mechanical disruption, or use of 
ceil lysing agents. Such methods are well known by the skilled artisan. 

Transgenic Animals 

The terms "transgenic animals" or "host animals" are used herein to designate animals that 
1 0 have their genome genetically and artificially manipulated so as to include one of the nucleic acids 
according to the invention. Preferred animals are non-human mammals and include those 
belonging to a genus selected from Mus (e.g. mice), Rauus (e.g. rats) and Oryctogalus (e.g. rabbits) 
which have their genome artificially and genetically altered by the insertion of a nucleic acid 
according to the invention. In one embodiment, the invention encompasses non-human host 
1 S mammals and animals comprising a recombinant vector of the invention or a GENSET gene 
disrupted by homologous recombination with a knock out vector. 

The transgenic animals of the invention all include within a plurality of their cells a cloned 
recombinant or synthetic DN A sequence, more specifically one of the purified or isolated nucleic 
acids comprising a GENSET polypeptide coding sequence, a GENSET polynucleotide regulatory 
20 sequence, a polynucleotide construct, or a DNA sequence encoding an antisense polynucleotide 
such as described in the present specification. 

Generally, a transgenic animal according the present invention comprises any of the 
polynucleotides, the recombinant vectors and the cell hosts described in the present invention. In a 
first preferred embodiment, these transgenic animals may be good experimental models in order to 
2S study the diverse pathologies related to the dysregulation of the expression of a given GENSET 
gene, in particular the transgenic animals containing within their genome one or several copies of an 
inserted polynucleotide encoding a native GENSET polypeptide, or alternatively a mutant GENSET 
polypeptide. 

In a second preferred embodiment, these transgenic animals may express a desired 
30 polypeptide of interest under the control of the regulatory polynucleotides of the GENSET gene, 

leading to high yields in the synthesis of this protein of interest, and eventually to tissue specific 

expression of the protein of interest. 

The design of the transgenic animals of the invention may be made according to the 

conventional techniques well known from the one skilled in the art. For more details regarding the 
35 production of transgenic animals, and specifically transgenic mice, it may be referred to US Patents 

Nos 4,873,191, issued Oct. 10, 1989; 5,464,764 issued Nov 7, 1995; and 5,789,215, issued Aug 4, 

1998; these documents being herein incorporated by reference to disclose methods producing 

transgenic mice. 
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Transgenic animals of the present invention are produced by the application of procedures 
which result in an animal with a genome that has incorporated exogenous genetic material. The 
procedure involves obtaining the genetic material which encodes either a GENSET polypeptide 
coding sequence, a GENSET polynucleotide regulatory sequence, or a DNA sequence encoding a 
5 GENSET polynucleotide antisense sequence, or a portion thereof, such as described in the present 
specification. A recombinant polynucleotide of the invention is inserted into an embryonic or ES 
stem cell line. The insertion is preferably made using electroporation, such as described by Thomas 
et al (1987), which disclosure is hereby incorporated by reference in its entirety. The cells 
subjected to electroporation are screened (e.g. by selection via selectable markers, by PGR or by 
10 Southern blot analysis) to find positive cells which have integrated the exogenous recombinant 
polynucleotide into their genome, preferably via an homologous recombination event. An 
illustrative positive-negative selection procedure that may be used according to the invention is 
described by Mansour et al. (1988), which disclosure is hereby incorporated by reference in its 
entirety. 

15 The positive cells are then isolated, cloned and injected into 3,5 days old blastocysts from 

mice, such as described by Bradley (1987), which disclosure is hereby incorporated by reference in 
its entirety. The blastocysts are then inserted into a female host animal and allowed to grow to 
term. Alternatively, the positive ES cells are brought into contact with embryos at the 2.5 days old 
8-16 cell stage (morulae) such as described by Wood et al, (1993), or by Nagy et al. (1993), which 

20 disclosures are hereby incorporated by reference in their entireties, the ES cells being internalized to 
colonize extensively the blastocyst including the cells which will give rise to the germ line. 

The offspring of the female host are tested to determine which animals are transgenic e.g. 
include the inserted exogenous DNA sequence and which ones are wild type. 

Thus, the present invention also concerns a transgenic animal containing a nucleic acid, a 

25 recombinant expression vector or a recombinant host cell according to the invention. 

In another embodiment, transgenic animals are produced by microinjecting polynucleotides 
ares microinjected into a fertilized oocyte. Typically, fertilized oocytes are microinjected using 
standard techniques, and then cultured in vitro until a "pre-implantation embryo" is obtained. Such 
pre-implantation embryos preferably contain approximately 16 to 150 cells. Methods for culturing 

30 fertilized oocytes to the pre-implantation stage are described, e.g., by Gordon et al. ((1984) Methods 
in Enzymology, 101,414); Hogan et al. ((1986) in Manipulating the mouse embryo. A Laboratory 
Manual. Cold Spring Harbor Laboratory Press, Cold Spring Harbor, N.Y) (for the mouse embryo); 
Hammer et al. ((1985) Nature, 31 5, 680) (for rabbit and porcine embryos); Gandolfi et al. ((1987) J. 
Reprod. Pert. 81, 23-28); Rexroad et al. ((1988) J. Anim. Sci. 66, 947-953) (for ovine embryos); and 

35 Eyestone et al. ((1989) J. Reprod. Pert. 85, 715-720); Camous et al. ((1984) J. Reprod. Pert. 72, 
779-785); and Heyman et al. ((1987) Theriogenology 27, 5968) (for bovine embryos); the 
disclosures of each of which are incorporated herein in their entireties. Pre-implantation embryos 
are then transferred to an appropriate female by standard methods to permit the birth of a transgenic 
or chimeric animal, depending upon the stage of development when the transgene is introduced. 
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As the frequency of transgene incorporation is often low, the detection of transgene 
integration in pre-implantation embryos is often desirable using any of the herein-described 
methods. Any of a number of methods can be used to detect the presence of a transgene in a pre- 
implantation embryo. For example, one or more cells may be removed from the pre-implantation 
5 embryo, and the presence or absence of the transgene in the removed cell or cells can be detected 
using any standard method e.g. PGR. Alternatively, the presence of a transgene can be detected in 
utero or post partum using standard methods. 

In a particularly preferred embodiment of the present invention, transgenic mammals are 
generated that secrete recombinant GENSET polypeptides in their milk. As the mammary gland is 

10 a highly efficient protein-producing organ, such methods can be used to produce protein 

concentrations in the gram per liter range, and often significantly more. Preferably, expression in 
the mammary gland is accomplished by operably linking the polynucleotide encoding the GENSET 
polypeptide to a mammary gland specific promoter and, optionally, other regulatory elements. 
Suitable promoters and other elements include, but are not limited to, those derived from 

15 mammalian short and long WAP, alpha, beta, and kappa, casein, alpha and beta lactoglobulin, beta- 
CN 5' genes, as well as the the mouse mammary tumor virus (MMTV) promoter. Such promoters 
and other elements may be derived from any mammal, including, but not limited to, cows, goats, 
sheep, pigs, mice, rabbits, and guinea pigs. Promoter and other regulatory sequences, vectors, and 
other relevant teachings are provided, e.g., by Clark (1998) J Mammary Gland Biol Neoplasia 

20 3:337-50; Jost et al. (1999) Nat. Biotechnol 17:160-4; U.S. Patent Nos. 5,994,616; 6,140,552; 
6,013,857; Sohn et al. (1999) DNA Cell Biol. 1 8:845-52; Kim et al. (1999) J. Biochem. (Japan) 
126:320-5; Soulier et al. (1999) Euro. J. Biochem. 260:533-9; Zhang et al. (1997) Chin, J. Biotech. 
13:271-6; Rijnkels et al. (1998) Transgen, Res. 7:5-14; Korhonen et al. (1997) Euro. J. Biochem. 
245:482-9; Uusi-Oukari et al. (1997) Transgen. Res. 6:75-84; Hitchin et al. (1996) Prot. Expr. Purif. 

25 7:247-52; Platenburg et al. (1994) Transgen. Res. 3:99-108; Heng-Cherl et al. (1993) Animal 
Biotech. 4:89-107; and Christa et al. (2000) Euro. J. Biochem. 267:1665-71; the entire disclosures 
of each of which is herein incorporated by reference. 

In another embodiment, the polypeptides of the invention can be produced in milk by 
introducing polynucleotides encoding the polypeptides into somatic cells of the mammary gland in 

30 vivo, e.g. mammary secreting epithelial cells. For example, plasmid DNA can be infused through 
the nipple canal, e.g. in association with DEAE-dextran (see, e.g.. Hens et al. (2000) Biochim. 
Biophys. Acta 1523:161-171), in association with a ligand that can lead to receptor-mediated 
endocytosis of the construct (see, e.g., Sobolev et al. (1998) 273:7928-33), or in a viral vector such 
as a retroviral vector, e.g. the Gibbon ape leukemia virus (see, e.g.. Archer et al. (1994) PNAS 

35 91 :6840-6844). In any of these embodiments, the polynucleotide may be operably linked to a 
mammary gland specific promoter, as described above, or, alternatively, any strongly expressing 
promoter such as CMV or MoMLV LTR. 

The suitability of any vector, promoter, regulatory element, etc. for use in the present 
invention can be assessed beforehand by transfecting cells such as mammary epithelial cells, e.g. 
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MacT cells (bovine mammary epithelial cells) or GME cells (goat mammary epithelial cells), in 
vitro and assessing the efficiency of transfection and expression of the transgene in the cells. 

For in vivo administration, the polynucleotides can be administered in any suitable 
formulation, at any of a range of concentrations (e.g. 1-500 ^g/ml, preferably 50-100 fig/ml), at any 
5 volume (e.g. 1 -1 00 ml, preferably 1 to 20 ml), and can be administered any number of times (e.g. 1 , 
2, 3, 5, or 1 0 times), at any frequency (e.g. every 1 , 2, 3, 5, 1 0, or any number of days). Suitable 
concentrations, frequencies, modes of administration, etc. will depend upon the particular 
polynucleotide, vector, animal, etc., and can readily be determined by one of skill in the art. 

In a preferred embodiment, a retroviral vector such as as Gibbon ape leukemia viral vector 

10 is used, as described in Archer et al. ((1994) PNAS 91:6840-6844). As retroviral infection typically 
requires cell division, cell division in the mammary glands can be stimulated in conjunction with 
the administration of the vector, e.g. using a factor such as estrodiol benzoate, progesterone, 
reserpine, or dexamethasone. Further, retroviral and other methods of infection can be facilitated 
using accessory compounds such as polybrene, 

15 In any of the herein-described methods for obtaining GENSET polypeptides from milk, the 

quantity of milk obtained, and thus the quantity of GENSET polypeptides produced, can be 
enhanced using any standard method of lacation induction, e.g. using hexestrol, estrogen, and/or 
progesterone. 

The polynucleotides used in such embodiments can either encode a full-length GENSET 
20 protein or a GENSET fragment. Typically, the encoded polypeptide will include a signal sequence 
to ensure the secretion of the protein into the milk. 

Recombinant Cell Lines Derived From The Transgenic Animals Of The Invention: 

A further object of the invention comprises recombinant host cells obtained from a 
25 transgenic animal described herein. In one embodiment the invention encompasses cells derived 
from non-human host mammals and animals comprising a recombinant vector of the invention or a 
GENSET gene disrupted by homologous recombination with a knock out vector. 

Recombinant cell lines may be established in vitro from cells obtained from any tissue of a 
transgenic animal according to the invention, for example by transfection of primary cell cultures 
30 with vectors expressing owc-genes such as SV40 large T antigen, as described by Chou (1989), and 
Shay et oL (1991), which disclosures are hereby incorporated by reference in their entireties. 

Uses of pol ypeptides of the invention 

The polypeptides and polynucleotides of the present invention can be used in any of a large 
35 number of ways, including numerous in vitro and in vivo uses. Specific uses for many of the 
herein-described polypeptides and polynucleotides are described in detail below. 
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Protein of SEP lDNO:255 (internal designation S00762786 2S5-24-S-Q-A2-R 104^ 

The cDNA of clone 500762786 J55-24-5-0-A2-R_l 04 (SEQ ID NO:86) encodes the 
human EDR4 protein 

5 LFPAPAPPPAPAFAPPPKVPSPERSAPRVPLPSPQPSYPFRPAASGGTPPPACLPPAQPCQGSP 
AMNLFRFLGDLSHLLAIILLLLKIWKSRSCAAHPQLPLSFCLSVCLSVSLSLSXSLSLSFSVSK 
KKKK (SEQ ID NO:255). It will be appreciated that all characteristics and uses of the 
polynucleotides of SEQ ID NO:86 and polypeptides of SEQ ID NO:355, described throughout the 
present application also pertain to the human cDNA of clone 500762786_255-24-5-0-A2-R_104 
10 and polypeptides encoded thereby. Polypeptide fragments having a biological activity described 
herein and polynucleotides encoding the same are included in the present invention. Related 
polynucleotide and polypeptide sequences included in the present invention are SEQ ID NOs:406 
and 520. 

The normal functioning of the eukaryotic cell requires that all newly synthesized proteins 

1 5 be correctly folded, modified, and delivered to specific inter and extracellular sites. Newly 
synthesized membrane and secretory proteins enter a cellular sorting and distribution network 
during or immediately after synthesis (cotranslationally or posttranslationally) and are routed to 
specific locations inside and outside of the cell. The initial compartment in this process is the 
endoplasmic reticulum (ER) where proteins undergo modifications such as giycosylation, disulfide 

20 bond formation, and assembly into oligomers. The proteins are then transported through an 
additional series of membrane-bound compartments which include the various cistemae of the 
Golgi complex, where further carbohydrate modifications occur. Transport between compartments 
occurs by means of vesicles that bud and fuse in a specific manner; once within the secretory 
pathway, proteins do not have to cross a membrane to reach the cell surface. 

25 The complexity of this system has advantages for the cell because it allows proteins to fold 

and mature in closed compartments that contain the appropriate enzyme catalysts. It is, however, 
dependent on sorting mechanisms that position the enzymes correctly and maintain them in place. 

The first organelle in this system, the ER, contains multiple enzymes involved in protein 
structure modifications. Among these are BiP (binding protein) which directs the correct folding of 

30 proteins and, PDl (protein disulfide isomerase) and a homologue of the 90 kDa heat-shock protein, 
both of which catalyze the formation and rearrangement of disulfide bonds (Gething, M. J. and 
Sambrook, J. (1992) Nature 355:33-45). These abundant soluble proteins must be retained in the ER 
and must be distinguished from the newly synthesized secretory proteins which are rapidly 
transported to the Golgi apparatus. The signal for retention in the ER in mammalian cells consists of 

35 the tetrapeptide sequence, KDEL, located at the carboxy terminus of proteins. This sequence was 
first identified when the sequences of rat BiP and PDI were compared and it was subsequently 
found at the carboxy terminus of other luminal ER proteins from a number of species (Munro, S. 
(1986) Cell 46:291-300; Pelham, H. R. (1989) Ann. Rev. Cell. Biol. 5:1-23). Proteins containing 
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this sequence leave the ER but are quickly retrieved from the early Golgi compartment and returned 
to the ER, while proteins without this signal continue through the distribution pathway. 

Two endoplasmic retrieval receptors were first identified in S. cerevesiae; two human 
endoplasmic retrieval receptors were subsequently isolated by the use of degenerate PCR primers 
5 based on the S. cerevesiae sequences (Hardwick, K. G. (1990) EMBO J. 9:623-630; Semenza, J. C. 
(1990) Cell 61:1349-1357; Lewis, M, J. and Pelham, H. R. (1990) Nature 348:162-163; Lewis, M. 
J. and Pelham, H. R. (1992) J. Mol. Biol. 226:913-916). Comparisons of these sequences shows that 
they consist of a conserved 7-transmembrane domain structure with only short loops in the cell 
cytoplasm and the ER lumen. Studies with these endoplasmic retrieval receptors show that ligand 

10 binding controls the movement of the receptor; when expressed in COS cells, the human receptor is 
normally concentrated in the Golgi, but moves to the ER when bound to a ligand such as KDEL- 
tagged hen lysozyme (Lewis, M. J. and Pelham, H. R. (1992) Cell 68:353-364). 

The ER retrieval function of these molecules serves to maintain the pool of enzymes in the 
ER that are necessary to perform protein structure modifications, retains newly synthesized proteins 

1 5 in the ER until they have been correctly modified, and regulates the structure of the Golgi 

apparatus. Saccharomyces cerevisiae cells that lack an ER retrieval receptor (Erd2) have a defective 
Golgi apparatus and fail to grow. Analysis of yeast Erd2 mutants suggests that their growth requires 
both the retention of multiple proteins in the ER and the selective removal of specific proteins from 
the Golgi (Townsley, F. M. (1994) J. Cell Biol. 127:21-28). Overexpression of a human ER 

20 retrieval receptor in COS cells results in hyperactive retrograde traffic from the Golgi to the ER 
leading to a loss of the Golgi structure and the breakdown of the secretory pathway (Hsu V. W. 
(1992) Cell 69:625-635). 

Disruptions in the cellular secretory pathway have been implicated in several human 
diseases. In familial hypercholesterolemia the low density lipoprotein receptors remain in the ER, 

25 rather than moving to the cell surface (Pathak, R. K. (1988) J. Cell Biol. 106:1831-1841). A form of 
congenital hypothyroidism is produced by a deficiency of thyroglobulin, the thyroid prohormone. In 
this disease the thyroglobulin is incorrectly folded and is therefore retained in the ER (Kim, P. S. 
(1996) J.Cell Biol. 133:5 17-527). Mutant forms of proteolipid protein (PLP) have been examined as 
they play a role in generating dysmyelinating or hypomyelinating diseases. In this case, the 

30 mutations that result in disease are mutations that arrest transport of PLP in the ER and the early 
Golgi; the subsequent accumulation of PLP in the ER results in rapid oligodendrocyte death (Gow, 
A. (1994) J.Neurosci. Res. 37:574-583). 

The human ER retrieval receptor function is necessary for processing and presentation of 
specific antigens to T cells. Many antigens must be processed intracellularly before they can be 

35 presented, in association with major histocompatability complex (MHC) molecules at the cell 
surface, for recognition by the antigen-specific receptor of T cells. Disruption of the ER retrieval 
receptor function with an antibiotic, Brefeldin A, abolishes the ability of a cell to present these 
specific antigen complexes to T cells. These antigenic proteins must be retained in the ER for 
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cleavage to smaller peptides which can then bind to MHC molecules and be released for 
presentation at the cell surface. (Kakiuchi, T. (1991) J. Immunol. 147:3289-3295). 

The discovery of polynucleotides encoding a novel human KDEL receptor, and the 
molecules themselves, provides the means to further investigate the regulation of the cellular 
5 protein secretory pathway. Discovery of molecules related to a novel human KDEL receptor 
satisfies a need in the art by providing a means or a tool for the study of this pathway and the 
diseases that involve the dysfunction of this pathway. 

In an embodiment of the present invention, ERD4 polypeptides of the present invention are 
used to purify KDEL containing proteins and other homologous proteins with similar signals for ER 

10 retention such as "HDEL", "DDEL", "ADEL", »*SDEL", "RDEL", "KEEL", "QEDL", "HIEL", 
"HTEL" and "KQDL". This may be carried out by covalently or non-covalently attached the EDR4 
polypeptides of the present invention to a column or other solid support using techniques well 
known in the art (e.g., affinity chromatography, panning, etc.). Once bound to the ERD4 
polypeptide, the complex is washed to remove contaminants. The target protein is released using 

15 increasing salt concentrations either in a gradient or step type purification. The bound target protein 
may also be released from the ERD4 polypeptide by a single step up in salt concentration. 

In another embodiment of the present invention, the EDR4 polypeptides of the present 
invention are used to detect KDEL containing proteins and other homologous sequences as 
described above by methods comprising the steps of contacting KDEL or other homologous 

20 sequences with an EDR4 polypeptide under conditions that allow binding to said sequence, and 
detecting the presence of bound EDR4. The presence of bound EDR4 can be detected using 
methods known in the art, such as by labeling EDR4 directly or indirectly. Bound EDR4 can be 
detected, for example, by using an antibody that specifically binds to EDR4 or another EDR4 - 
binding compound that is detectable directly or indirectly. 

25 Preferred ERD4 polypeptides for binding KDEL containing proteins and other homologous 

sequences described above comprise the amino acid sequence -KIWK- or - 
MNLFRFLGDLSHLLAIILLLLKIWKSRSCA-. 

The present invention is further directed to a transformant comprising the following 
expression units in a co-expressible state: an expression unit containing a gene coding for an ERD4 

30 polypeptide which is capable of binding to a protein localizing in the endoplasmic reticulum and 
having a signal for staying therein; an expression unit containing a gene coding for said protein 
localizing in endoplasmic reticulum; and an expression unit containing a foreign gene coding for a 
polypeptide which is a subject of function of said protein localizing in endoplasmic reticulum, and 
to a transformant comprising, in a co-expressible state, a fusion gene which is composed of a DNA 

35 fragment coding for a human serum albumin prepro-sequence and a foreign gene coding for a useful 
polypeptide. The present invention is also directed to a process for producing said polypeptide by 
co-expressing said genes in said transformant such that the polypeptide is predominantly secreted 
out of the transformant cell. Consequently, the invention has an advantage of improving the 
productivity of said polypeptide. 
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More particularly, the invention relates to: A transformed yeast cell comprising the 
following expression units integrated on a yeast chromosome in a co-expressible state: a first 
expression unit containing a gene coding for a receptor for an endoplasmic reticulum retention 
signal, wherein the receptor is the receptor protein ERD4 or a fragment thereof which is capable of 
5 binding to a retention signal selected from the group consisting of "KDEL", "HDEL", "DDEL", 
"ADEL". "SDEL", "RDEL", "KEEL", "QEDL", "HIEL", "HTEL" and "KQDL". and a second 
expression unit containing a gene encoding a protein disulfide isomerase, wherein said isomerase 
comprises an endoplasmic reticulum retention signal, or a gene encoding a fusion protein 
comprising the amino acid sequence of said isomerase and a human serum albumin prepro- 
10 sequence. These methods can be carried out using methods known in the art or described in U.S. 
Patent 5,578,466, incorporated herein by reference in its entirety. 



Proteins of SEP !DNO:193 and 194 fintemal designation 585770 215-16-5-0-E8-F and 
123996 140-002-5-0-B4-F^ 

15 The cDNA of clones 585770^21 5- 16-5-0-E8-F (SEQ ID NO:24) and 123996^140-002-5-0- 

B4-F (SEQ ID NO:25) encode the human Smooth Muscle and Pain Effector (SMPE) proteins: 
MRGATRVSIMLLLVTVSDCAVITGACERDVQCGAGTCCAISLWLRGLRMCTPLGRXGEEC 
HPGSHKIPFFRKRKHHTCPCLPNLLCSRFPDGRYRCSMDLKNINF (SEQ ID NO: 193) and 
MRGATRVSIMLLLVTVSDCAVITGACERDVQCGAGTCCAISLWLRGLRMCTPLGREGEEC 

20 HPGSHKIPFFRKRKHHTCPCLPNLLCSRFPDGRYRCSMDLKNINF (SEQ ID NO: 1 94), 

respectively. It will be appreciated that all characteristics and uses of the polynucleotides of SEQ 
ID NOs:24 and 25 and polypeptides of SEQ ID NO; 193 and 194, described throughout the present 
application also pertain to the human cDNA of clones 585770^21 5- 16-5-0-E8-F and 123996^1 40- 
002-5-0-B4-F, and the polypeptides encoded thereby. Polypeptide fragments having a biological 

25 activity described herein and polynucleotides encoding the same are also included in the present 
invention. Related polynucleotide and polypeptide sequences included in the present invention are 
SEQ IDNOs:360 and 447. 

SMPE contracts longitudinal ileal muscle and distal colon, and relaxes the proximal colon. 
SMPE binds with a high affinity to both ileum and brain membranes. Therefore, included as 

30 embodiments of the present invention is a method of causing gastrointestinal smooth muscle cells to 
contract, in vitro or in vivo, comprising the steps of contacting said cells with a contracting effective 
amount of an SMPE polypeptide. Preferrably, the gastrointestinal smooth muscle cells are those of 
the longitudinal ileal or distal colon. A further embodiments of the present invention is a method of 
causing gastrointestinal smooth muscle cells to relax comprising the steps of contacting said cells 

35 with a relaxing effective amount of an SMPE polypeptide. Preferrably, the gastrointestinal smooth 
muscle cells are proximal colon cells. SMPE can also be used in the same manner to contract 
uterine cells. Therefore, included in the present invention is a method of causing uterine smooth 
muscle cells to contract comprising contacting said cells with a contracting effective amount of an 
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SMPE polypeptide. Further included in the present invention is a method of causing smooth muscle 
cells (e.g., bladder, vascular) to contract comprising contacting said cells with a contracting 
effective amount of an SMPE polypeptide. Further included in the present invention is a method of 
inhibiting angiogenesis comprising contacting vascular endothelial cells with an angiogenesis 
5 inhibiting effective amount of an SMPE polypeptide. The SMPE anti-angiogenic affect can be 
measured using assays known in the art. For example, the anti-angiogenic effect in vivo can be 
assayed by using the lO-day-old embryo chick chorioallantoic membrane model 

SMPE binds with a high affmity to both ileum and brain membranes. Thefore, as a further 
embodiment of the present invention is a method of binding an SMPE polypeptide to ileum or brain 

10 membranes. The method can be further used as a method of detecting ileum or brain membranes 
comprising the steps of contacting ileum or brain membranes with an SMPE polypeptide under 
conditions that allow binding to said membranes, and detecting the presence of SMPE. The 
presence of SMPE can be detected using methods known in the art, such as by labeling SMPE 
directly or indirectly. Bound SMPE can be detected, for example, by using an antibody that 

1 5 specifically binds to SMPE or another SMPE-binding compound that is detectable directly or 
indirectly. 

SMPE is also expressed in spermatocytes. Therefore, a further embodiment of the present 
invention is a method of detecting testes or spermatocytes by detecting an SMPE polypeptide or 
nucleic acid. An SMPE polypeptide can be detected using anti-SMPE antibodies or other SMPE- 

20 binding compounds. SMPE polynucleotides, such as mRNA, can be detected using methods known 
in the art such as PCR (RT*PCR), hybridization (Northern blot analysis), etc. 

SMPE elicits hyperalgesia when it contacts the GNS, e.g., the brain. Therefore, the present 
invention includes a method of causing hyperalgesia comprising contacting the CNS with a 
hyperalgesia effecting amount of an SMPE polypeptide, SMPE can be delivered to the CNS using 

25 methods well known in the art including those described in PCT application WO9906060, 

incorporated herein by reference in its entirety. Using the methods of WO9906060, the TGF-alpha 
or other polypeptide that binds the epidermal growth factor (EGF) receptor, is substituted with an 
SMPE polypeptide of the present invention. 

Further included in the present invention are methods of inhibiting the above SMPE 

30 activities using an inhibitor of SMPE. A preferred inhibitor of SMPE is an anti-SMPE antibody. 
Thus, an embodiment of the present invention is a method of inhibiting smooth muscle contraction 
(bladder, gastrointestional cells, uterine) or pain comprising the step of contacting said cells with an 
effective contractive or pain inhibiting amount of an anti-SMPE antibody or other SMPE inhibitor. 
The invention further relates to a method of screening for test compounds that bind and/or 

35 inhibit an SMPE activity above comprising the steps of contacting an SMPE polypeptide with said 
test compound and detecting or measuring whether said test compound binds said SMPE 
polypeptide. Alternatively, the method comprises the steps of contacting an SMPE polypeptide 
with a binding target (e.g., smooth muscle cells or brain cells) of said SMPE polypeptide in the 
presence of a test compound, and detecting or measuring the binding of the SMPE polypeptide to 
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said binding target, wherein a difference in the amount of said binding in the presence of said test 
compound relative to the amount of binding in the absence of the test compound indicates that the 
test compound modulates, preferably inhibits, the binding of said polypeptide to said binding target. 
The method may alternatively comprise the steps of contacting an SMPE polypeptide with a 
5 binding target in the presence of a test compound, wherein the binding of said SMPE polypeptide 
with said binding target elicits or causes a biological activity (e.g., activities described above) which 
is detected or measured, and further wherein a difTerence in the level of said biological activity in 
the presence of the test compound relative to the amount of biological activity in the absence of the 
test compound indicates that the test compound modulates, preferably inhibits or activates, the 
1 0 biological activity of said SMPE polypeptide. 

Preferred SMPE polypeptides for use in the methods described herein include the amino 
acid sequences - 

AVITGACERDVQCGAGTCCAISLWLRGLRMCTPLGREGEECHPGSHKIPFFRKRKHH- or - 
TGACERDVQCGAGTCCAISLWLRGLRMCT- of SEQ ID NO: 193 or 194. 

15 

Protein of SEQ IDNO:30S (internal designation 500691428 255-2-5-Q-D4-R 104) 

The human cDNA of clone 50069 1428_255-2-5-0-D4-R_l 04 (SEQ ID NO:136) encodes 
the human VESICLE-ASSOCIATED MEMBRANE PROTEIN 10 or VAMP-10 protein: 
MSATAATAPPAAPAGEGGPPAPPPNLTSNRRLQQTQAQVDEWDIMRVNVDKVLERDQKL 

20 SELDDRADALQAGPSQFETSAAKLKRKY WWKNLKMMIILGVICAULIIIIVYFST (SEQ ID 
NO:305). It will be appreciated that all characteristics and uses of the polynucleotides of SEQ ID 
NO: 136 and polypeptides of SEQ ID NO:305 described throughout the present application also 
pertain to the human cDNA of clone 50069 1428_255-2-5-0-D4-R_ 104 and the polypeptides 
encoded thereby. Polypeptide fragments having a biological activity described herein and 

25 polynucleotides encoding the same are also included in the present invention. Related 

polynucleotide and polypeptide sequences included in the present invention are SEQ ID NOs:432 
and 546. 

VAMP-10 is an integral membrane protein involved in the movement of vesicles from the 
plasmalemma of one ceil, across the synapse, to the plasma membrane of the receptive neuron. This 

30 regulated vesicle trafficking pathway and the endocytotic process may be blocked by the highly 
specific action of clostridial, tetanus toxin (TeTx) and botulinum toxin (BoNT) and other 
metalloendoprotease neurotoxins which prevents neurotransmitter release by cleaving VAMPs. 
VAMP-10 is important in membrane trafficking. It participates in axon extension via exocytosis 
during development, in the release of neurotransmitters and modulatory peptides, and in 

35 endocytosis. The tightly-regulated synaptic vesicle cycle at the nerve terminal consists of the 
formation of synaptic vesicles, the docking of vesicles comprising VAMP-10 to the presynaptic 
plasma membrane, the fusion of these membranes and consequent neurotransmitter release, 
endocytosis of the empty vesicles and the regeneration of fresh vesicles. Endocytotic vesicular 
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transport includes such intracellular events as the fusions and fissions of the nuclear membrane, 
endoplasmic reticulum, Golgi apparatus, and various inclusion bodies such as peroxisomes or 
lysosomes. 

VAMP- 10, like other VAMPs, has a three domain organization. The domains include a 
5 variable proline-rich, N-terminal sequence, a highly conserved central hydrophilic core of amino 
acids, and a hydrophobic sequence of amino acids presumed to be the membrane anchor. 

In one aspect, the invention includes a VAMP- 10 polypeptide composition for use in 
delivering a second composition, preferably nucleic acids, polypeptides, or small molecules such as 
therapeutic drugs, to target biological cells either in vitro or in vivo. The composition comprises a 

10 VAMP-10 polypeptide as a first molecule and a second molecule. The second molecule may, if 
desirable, be covalently or non-covalently attached or fused to the VAMP- 1 0 polypeptide. The 
VAMP-10 polypeptide composition may further comprise artificial lipids to facilitate delivery of 
the second molecule by lipisomes or lipid vesicles. Methods for using VAMP-10 polypeptides in 
these methods are known in the art and include U.S. Patents 6,074,844, 6,203,794 and 6,099,857, 

1 5 incorporated by reference in their entireties. In a preferred embodiment, VAMP- 1 0 polypeptides 
are used to faciliate delivery of a second composition, e.g., lipisome mediated DNA transfection, to 
cells in culture, preferably neuronal cells, and fiirther preferably to the presynaptic membrane. 

VAMP-10 polypeptides are also useful in methods of inhibiting the release of 
neurotransmitters by preventing the docking and/or fusing of a presynaptic vesicle to the 

20 presynaptic membrane. These polypeptides may be referred to as excitation-secretion uncoupling 
peptides (ESUPs). Fragments of VAMP-10 having this blocking activity can be identified using 
methods knov^n in the art (See e.g., U.S. Patents 6,090,63 1 and 6,169,074 incorporated by reference 
in their entireties). ESUPs of the present invention comprise synthetic and purified VAMP-10 
peptide fragments which correspond in primary structure to peptides which serve as binding 

25 domains for the assembly of a ternary protein complex ("docking complex") which is critical to 
neuronal vesicle docking with the cellular plasma membrane prior to neurotransmitter secretion. 
Preferably, the primary sequence of the ESUPs of the invention also includes amino acids which are 
identical in sequence to the VAMP-10 peptide products of BoTx and TeTx proteolytic cleavage in 
neuronal cells, or fragments thereof ("proteolytic products"). For optimal activity, ESUPs of the 

30 invention have a minimum length of about 20 amino acids and a maximal length of about 28 amino 
acids, although they may be larger or smaller. Preferably, the ESUPs correspond in primary 
structure to binding domains in the docking complex, most preferably the region of such binding 
domains that are involved in the formation of a coiled-coil structure in the native docking complex ' 
proteins. ESUPs may also be used as pharmaceutical carriers as part of fusion proteins to deliver 

35 substances of interest into neural cells in a targeted manner. Preferred VAMP-10, or ESUP, 
polypeptides for use in inhibiting the release of neurotransmitters include those comprising - 
NRRLQQTQAQVDEWDIMRVNVDKVLERDQKLSELDDRADALQAGPSQFETSAAKLKRK 
- of SEQ ID NO:305. More preferred ESUP polypeptides comprise an amino acid sequence portion 
of SEQ ID NO:305 selected from the group consisting of: RVNVDKVLERDQKLSELDD; 
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KVLERDQKLSELDDRA; VNVDKVLERDQKLSELDDRA; 

DIMRV>JVDKVLERDQKLSELDDRADAL; DEVVDIMRVNVD; QAQVDEVVDIMRVNVD; 
LQQTQAQVDEVVDIMRVNVD; QQTQAQVDEVVD; NRRLQQTQAQVDEVVD; and 
NLTSNRRLQQTQAQVDEVVD. 
5 The ESUPs above may be used to inhibit or treat pain according to U.S. Patent 6,1 13,915 or 

5,989,545 (incorporated by reference herein in their entireties) by substituting the polypeptides of 
the present invention for BoTx type A. 

Because VAMP- 10 is a component of vesicles, antibodies to VAMP- 10 are useful in the 
detection of vesicles, preferably neuronal vesicles transporting neurotransmitters. VAMP-10 can be 

10 used during purification of vesicles as a marker for vesicles or vesicles can be detected using 

antibodies to VAMP-10 in assays such as immunohistochemistry. Following exocytosis of vesicles, 
a portion of the VAMP-10 inserted in the vesicle appears on the surface of the axon, thus making 
VAMP-10 useful for the detection and monitoring of exocytosis of synaptic vesicles. 

Detection of VAMP-10 expression (mRNA or protein) levels or mutated forms of VAMP- 

15 10 is further useful in the determination or diagnosing of whether someone is at risk of developing 
or has a neurological disorder, such as mood disorders selected from depression, bipolar disorder, 
schizophrenia, etc.), wherein a decreased level in expression of VAMP-10, mRNA or protein, as 
compared to an individual without a neurological disorder indicates the individual has the disorder 
or is as risk of having the disorder in the future. 

20 The present invention furdier includes a novel assay system for toxins, such as clostridial, 

tetanus toxin (TeTx) and botulinum toxin (BoNT), using novel reagents. Preferably, methods of 
U.S. Patent 6,043,042, incorporated by reference in its entirety, are used to perform the assay, 
wherein a VAMP-10 polypeptide is the substrate cleaved by the test compound. More specifically, 
the assay comprises the steps of: 

25 The invention relates to an assay for botulinum toxin or tetanus toxin comprising the steps 

of: 

(a) combining a test compound with a substrate and with antibody, wherein the substrate 
has a cleavage site for the toxin and when cleaved by toxin forms a product, and wherein the 
antibody binds to the product but not to the substrate; and wherein the substrate is a VAMP- 1 0 

30 polypeptide; and 

(b) testing for the presence of antibody bound to the product, which product is attached to a 
solid phase assay component. 

Preferably, in the practice of this invention, the VAMP-10 polypeptide is cleaved by the 
toxin to generate new peptides having N- and C-terminal ends. In addition, the peptide substrate is 
35 attached to a solid phase component of the assay. 

The assay according to the invention may utilize assay components (a) and (b): 
(a) a peptide linked to a solid-phase, the peptide being cleavable by the toxin to generate a 
cleavage product. 
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(b) an antibody that binds to the cleavage product but not to the uncleaved polypeptide or 
an antibody that binds a cleavage product that is either the N-terminal or C-terminal portion of the 
VAMP-10, and the assay may comprise the steps of: 

(i) combining a test compound that may contain or consist of the toxin w^ith the solid-phase 
5 peptide to form an assay mixture, 

(ii) subsequently or simultaneously combining the assay mixture with the antibody, and 

(iii) subsequently or simultaneously determining whether there has been formed any 
conjugate between the antibody and the cleavage product. 

Preferably, the step (i) of the assay is carried out in the presence of a zinc compound and a 
10 VAMP-10 polypeptide. 

In this embodiment, the assay comprises: 

(i) combining the test compound with a solid phase comprising a VAMP-10 polypeptide, 

(ii) washing the test compound from the solid phase, 

(iii) combining the solid phase with an antibody adapted for binding selectively with 
1 5 peptide cleaved by toxin, and 

(iv) detecting a conjugate of the antibody with cleaved peptide. 
In another embodiment, the assay comprises: 

(1) adding a test solution to an assay plate comprising immobilized peptide, the peptide 
being a VAMP-10 polypeptide; 

20 (ii) incubating the assay plate, 

(iii) washing the plate with a buffer, 

(iv) adding to the plate an antibody solution, said solution comprising an antibody adapted 
selectively to bind to a peptide selected from the group consisting of (I) the 50 C-terminal amino 
acid residues SEQ ID NO;305, the 30 C-terminal amino acid residues SEQ ID NO:305, and the 20 

25 C-terminal amino acid residues SEQ ID NO:305 (any other VAMP-10 polypeptide of the present 
invention may also be selected). 

(2) a peptide the N-terminal end of which is selected from the group consisting of: (1) the 50 
N-terminal amino acid residues SEQ ID NO:305, the 30 N-terminal amino acid residues SEQ ID 
NO:305, and the 25 N-terminal amino acid residues SEQ ID NO:305 (any other VAMP-10 

30 polypeptide of the present invention may also be selected). 

(v) incubating the assay plate, 

(vi) washing the plate with a buffer, and 

(vii) measuring the presence of antibody on the assay plate. 

In this embodiment, the antibody may be linked to an enzyme and the presence of antibody 
35 on the plate is measured by adding an enzyme substrate and measuring the conversion of the 
substrate into detectable product. The detectable product may be colored and measured by 
absorbance at a selected wavelength. 

In the practice of the invention, the inactive toxin present in the test compound may be 
converted to active toxin. This may be accomplished by adding a protease to the test compound. 
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The antibody-peptide conjugate may be detected using a further antibody specific to the 
first antibody and linked to an enzyme. 



5 Proteins of SEP ID NO: 171 (internal designation Clone 10:5891 15) and related protein of SEP ID 
NP:457. 

The polynucleotides of SEQ ID N0:2 and SEQ ID NP:340 and polypeptides of SEQ ID 
NO: 171 and 457 encode a C-terminal variant of Apolipoprotein Al, herein referred to as ApoAI- 
CTV. An embodiment of the invention includes compositions of SEQ ID NP:2, 340, 171, and 457 
10 which encode for this novel variant of the apolipoprotein family of lipid transporting proteins. 
Specifically, ApoAJ-CTV is a component of high density lipoprotein which functions to remove 
cholesterol from circulation and thus providing protection against the development of 
atherosclerosis, coronary atherosclerotic lesions and subsequent microvascular and cardiovascular 
disease. 

1 5 Preferred polynucleotides of the invention are compositions of the novel portion of the 

cDNA from bases 465 to 521 of SEQ ID NP:2 including the nucleic acids comprising the 
sequences - 

GCAGCTTTCTTAACTATCCTAACAAGCCTTGGACCAAATGGAAATAAAGCTTTTTGA-,- 
GAAGGCAGCTTTCTTAACTATCCTAACAAGCCTTGGACCAAATGGAAATAAAGCTrTTT 
20 GA-,or- 

AGCTCTACCGCCAGAAGGCAGCTTTCTTAACTATCCTAACAAGCCTTGGACCAAATGG 
AAATAAAGCTTTTTGATGAAAAAA- of SEQ ID NP:2 and 340. 

Preferred polypeptides of the invention are compositions of the novel C-terminal portion 
comprising the amino acid sequence -AAFLTILTSLGPNGNKAF, - 
25 MELYRQKAAFLTILTSLGPNGNKAF, or-QKKWQEEMELYRQKAAFLTILTSLGPNGNKAF 
of SEQ ID NG: 1 7 1 and SEQ ID NP:457. 

Further preferred polypeptides of the invention include the compositions comprising the 
apolipoprotein domain 

KAAVLTLAVLFLTGSQARHFWQQDEPPQSPWDRVKDLATVYVDVLKDSGRDYVS 
30 QFEGSALGKQLNLKLLDNWDSVTSAFSNLREQLGPVTQEXWDNLEKETEGLRQEMSKDLE 
EVKAKVQPYLDDFQKKWQEEMELYRQKAAFLTILTSLGPNGNKA of SEQ ID NO: 1 71 and 
457, or the amino acid residue positions -1 7 to +1 4 1 of SEQ ID NP: 171. 

An embodiment of the invention includes a method for treatment of atherosclerosis or 
cardiovascular diseases, comprising administering to an individual a therapeutically effective 
35 amount of apoAI-CTV or variants or mixtures thereof to lower total plasma cholesterol at least 5% 
of pretreatment levels. 

Further utility of the polypeptides of the present invention may be further confirmed by 
methods of production and use of other apolipoproteins by those skilled in the art or as described by 



105 



wo 02/083898 PCT/IBOl/00914 

Age land et al in US Patent 5990081, which disclosure is hereby incorporated by reference in its 
entirety. 

It will be appreciated that all characteristics and uses of the polynucleotides of SEQ ID 
N0:2 and SEQ ID NO:340 and polypeptides of SEQ ID NO: 1 7 1 and 457 described throughout the 
5 present application also pertain to the human cDNA of clone 5891 15. 

Proteins of SEP ID NO:302 (internal designation Clone ID: 1000853793^ and related protein of 
SEP ID NQ:543. 

MRLFLSLPVLVWLSIVLEGPAPAQGTPDVSSALDKLKEFGNTLEDKARELISRIKQ 
10 SELSAKMREWFSETFQKVKDKLKJDS 

The polynucleotides of SEQ ID NG: 133 and SEQ ID NO:429 encode human apolipoprotein 
CI (ApoCI) polypeptide of SEQ ID NO:302 and SEQ ID NP:543, respectively. The ApoCI of the 
invention differs by 1 amino acid comprising the amino acid sequence FQKVKDKLKI, where 
aspartate (D at position 77 of SEQ ID NP:302) replaces a glutamate (E) of the ApoCI of GENPEP 
15 accession X00570, AF050154, and M20902. ApoCI is a member of the apolipoprotein family of 
lipid binding and transporting proteins specifically functioning to transport cholesterol esters. 

It will be appreciated that all characteristics and uses of the polynucleotides of SEQ ID 
NO: 133 and SEQ ID NO:429 and polypeptides of SEQ ID NO:302 and SEQ ID NO:543, described 
throughout the present application also pertain to the human cDNA of clone 1000853793, and the 
20 polypeptides encoded thereby. 

Proteins of SEP ID NO:295 (internal designation Clone ID:642948y SEP IP NP:296 (internal 
designation Clone 1D:638743\ and SEP ID NP:S39. 

MEASALTSSAVTSVAKVVRVASGSAVVLPLARIATVVIGGVVAMAAVPMVLSAMGFTAA 
25 GIASSSIAAKMMSAAAIANGGGVASGSLVATLQSLGATGLSGLTKFILGSIGSAIAAVIARFY 

The polynucleotides of SEQ ID NG:126, 127, and 425 and the polypeptides of SEQ ID 
NP:295, 296 and 539 encode human transmembrane, alpha-interferon-inducible polypeptides, 
aINFIP-1, aINFIP-l, and arNFIP-3, respectively. Preferred polynucleotides and polypeptides of the 
invention comprise the nucleic acid sequences of SEQ ID NP:126, 127, and 425. and amino acid 

30 sequences of SEQ ID NO:295, 296 and 539. 

Preferred polypeptides of SEQ ID NP:295 and SEQ ID NP:539 for use in the methods 
described herein include the amino acid sequences comprising - 
VLSAMGFTAAGIASSSIAAKMMSAAAIANGGGVASG-,- 
SSIAAKMMSAAAIANGGGVASGSLVATLQSLGAT-, or - 

35 VIGGVVAMAAVPMVLSAMGFTAAGIASSSIAAKMMSAAAIANGGGVASGSLVATLQSLG 
ATGLSGLTK-. 
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Preferred polypeptides of SEQ ID NO: 296 for use in the methods described herein include 
the amino acid sequence -AAAIANXGGVASGSLVATLQSLGATGLSGLTKF- or - 
LSAMGFTAAG1ASSSIAAKMMSAAAIANXGGVASGSLVATLQSLGATGLS-. 

It will be appreciated that all characteristics and uses of the polynucleotides of SEQ ID 

5 NOs: 1 26, 1 27, and 425 and the polypeptides of SEQ ID NOs:295, 296 and 539, described 

throughout the present application also pertain to the human cDNA of Clone ID:642948 and Clone 
ID:638743, and the polypeptides encoded thereby. 

Sites of glycine myristylation within a polypeptide function to modulate the activity and 
compartmentalization of the protein (Resh, M. D. Biochim Biophys Acta 1451:1-16 (1999)). 

10 Preferred polypeptides of the invention include fragments comprising the sites of N- 

myristylation. Preferred amino acids of said sites within SEQ ID NO:295 and SEQ ID NO:539 
include GGVVAM (positions 39-44), GIASSS (positions 60-65), GGGVAS (positions 79-84), 
GSLVAT (positions 85-90), and GSIGSX (positions 108-1 13). Further preferred are amino acids 
within 6 residues preceding or 6 residues following said amino acid sequences. Further preferred 

1 5 amino acids include sequences comprising the sites of N-myristylation in the polypeptides of SEQ 
ID NO:296. Preferred amino acids of said sites within SEQ ID NO:296 include 
lATVVIGGVVAMAAVPMV, MGFTAAGIASSSIAAKMM, AAIANXGGVASGSLVATL, 
NXGGVASGSLVATLQSLGA, and LTKFILGSIGSAIAAVIAR. 

Interferons (IFNs) are a part of the group of intercellular messenger proteins known as 

20 cytokines and are part of the body's natural defense to viruses and tumors. Type 1 IFNs (alpha and 
beta interferons) are produced in a variety of cells types and their biosynthesis is stimulated by 
viruses and other pathogens, and by various cytokines and growth factors. Both a- and y-IFNs are 
immunomodulators and anti-inflammatory agents, activating macrophages, T-cells and natural killer 
cells (reviewed in Jonasch and Haluska, Oncologist 6(1):34 (2001)). As part of the body's natural 

25 defense to viruses and tumors, INFs affect the function of the immune system and have direct action 
on pathogens and tumor cells. IFNs mediate these multiple effects by inducing the synthesis of 
cellular proteins, including the polypeptides of the present invention, aINFIP-1, aINFIP-1, and 
aINFIP-3. 

Antiviral activity of the aINFIP polypeptides are assayed according to conventional 
30 methods (Tovey et al, Proc. Soc. Exp. Biol, and Med., 1974 146: 809-81 5). Preferred polypeptides 
of SEQ ID NO:295, 296 and 539 and fragments thereof include those which possess antiviral 
function, where preferred antiviral activity is against herpes simplex virus and hepatitis virus C, 
alone or in combination with known antiviral treatments such as interferon alpha. 

The antitumor activity the alNFlP polypeptides of the invention can be demonstrated by 
35 similar methods using tumor cell lines rather than treatment of cells with virus as used to test 

antiviral activity. Tumor cell lines examples include MCF-7 (human breast cancer derived), NOS-1 
(human oral primary squamous cell carcinoma derived), and MedB-l (human primary mediastinal 
large B-cell lymphoma derived). 
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Further utility of the polypeptides of the present invention may be further confirmed by 
methods of interferon inducible proteins in the inhibition of viral functions such as cell penetration, 
uncoating, RNA and protein synthesis, assembly and release described in Hardman et al., 
Pharmacological Basis of Therapeutics, McGraw-Hill, New York NY pp 121 1-1214, 25 (1996), 
5 disclosure of which is hereby incorporated by reference in its entirety. 

Another embodiment of the present invention relates to the use of aINFlP polypeptides or 
fragments thereof to treat and/or prevent the ill-efTect of bacterial infection. In a preferred 
embodiment, the protein of the invention may be used to counteract the effects of the bacterial 
endotoxin lipopolysaccharide (LPS). The methods for using such compositions is described in 

10 Dziegieiewska and Andersen, Biol. Neonate, 74:372-5 (1998), the disclosure of which is 
incorporated herein by reference in its entirety. 

Furthermore, the aINFIP polypeptides or fragments thereof may be used to identify specific 
molecules with which it binds such as agonists, antagonists or inhibitors. Another embodiment of 
the present invention relates to methods of using the aINFlP polypeptides or fragments thereof to 

15 identify and/or quantify cytokines of the interferon family as well as other cytokines such as ILIO 
and tumor antigens, which may interact with the aINFlP polypeptides of the invention. 

The aINFlP polypeptides of the invention or fragments thereof are included in 
pharmaceutical preparations for treatment, prevention or alleviation of cancers. In another 
embodiment of the present invention, the aINFIP polypeptides of the invention or fragments thereof 

20 are used included in pharmaceutical preparations for treatment, prevention or alleviation of viral or 
bacterial infections. In another embodiment of the present invention, the aINFIP polypeptides of 
the invention or fragments thereof are used to inhibit and/or modulate the effect of cytokines and 
related molecule such as 11-2, TNF alpha, CTLA4, CD28, and others, by preventing the binding of 
the endogenous cytokine to their natural receptors, thereby blocking cell proliferation or inhibitory 

25 signals generated by the ligand-receptor binding event. 

In another embodiment of the present invention, the aINFIP polypeptides of the invention 
or fragments thereof are useful to correct defects in in vivo models of disease such as autoimmune, 
inflammation and tumor models, by injecting the protein either intra peritoneally intravenously, 
subcutaneously or directly in the diseased tissue. 

30 The polynucleotides of SEQ ID NO: 1 26, 1 27, and 425 or fragments thereof is useful in 

diagnostic assays for aINFIP-1, aINFIP-2, or aINFIP-3 gene expression in in vitro models or in 
conditions associated with expression of the aINFlP polypeptides of the invention. The diagnostic 
assay is useful to distinguish between absence, presence, and excess expression of the gene and to 
monitor regulation of levels of the gene of the invention during therapeutic intervention. The DNA 

35 may also be incorporated into effective eukaryotic expression vectors and directly targeted to a 
specific tissue, organ, or cell population for use in gene therapy to treat the above mentioned 
conditions, including tumors and/or to correct disease- or genetic-induced defects in any of the 
above mentioned proteins including the protein of the invention. 
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Protein of SEP ID NO: 170 (internal desienation Clone 1D:5Q2084) and related protein of SEP ID 



NP:456. 

The polynucleotides of SEQ ID NP:339 and polypeptides of SEQ ID NP:456 encode 
neutrophil stimulating protein 2, previously described in WP 9006321 (GENPEP accession 
5 AO 1 3 1 9) as a novel factor having neutrophil-stimulating activity. The polynucleotide of SEQ ID 
NO: 1 encodes a novel polypeptide variant, neutrophil stimulating protein 2v, comprising the amino 
acid sequence of SEQ ID NO: 1 70 in which an aspartate (D) residue is located at position +1 6 of 
SEQ ID NO: 1 70 rather than a glutamate (E). Preferred compositions of the invention include the 
polypeptides of SEQ ID NO: 1 70. Further preferred amino acids of SEQ ID NO: 1 70 comprise the 

10 sequence LAKGKDESLDS, QXKRNLAKGKDESLDSDLYAE, or 

SSTKGQXKRNLAKGKDESLDSDLYAELRCMCIKTTSGIHPKNIQSLEVIGKGTHCNQVEVIA 
TLKDGRKICLDPDAPRIKKIVQKKLAGDESAD. It will be appreciated that all characteristics 
and uses of the polynucleotides of SEQ ID NO: I and SEQ ID NP:339 and polypeptides of SEQ ID 
NP: 1 70 and SEQ ID NP:456, described throughout the present application also pertain to the 

1 5 human cDNA of clone 502084, and the polypeptides encoded thereby. 

A preferred embodiment of the invention includes use of the novel neutrophil stimulating 
protein 2v of SEQ ID NP: 1 70 in a method to stimulate wound healing by contacting the wound area 
with effective amount of polypeptide of SEQ ID NP: 1 70 or further use as described in US Patent 
5,804,176, which disclosure is hereby incorporated by reference in its entirety. A further preferred 

20 includes use of neutrophil stimulating protein 2v in the enhancement of angiogenesis for 
revascularization after injury such following myocardial infarction, wherein site of injury is 
contacted with effective amount of polypeptide of SEQ ID NP: 1 70 or use as further described in 
US Patent 5,871,723, which disclosure is hereby incorporated by reference in its entirety. 
Antibodies against neutrophil stimulating protein 2v, by preventing or blocking the deposition of 

25 connective tissue matrix, are useful in the treatment of fibrotic disorders by contacting the 

polypeptides of SEQ ID NP: 1 70 with fibrotic tissue, such as in scleroderma, liver cirrhosis, and 
myelofibrosis. 

An embodiment of the invention includes fragments of SEQ ID NP:170 which comprise 
domains which impart function to this cytokine. Preferred fragments include the amino acid 

30 sequence comprising the IL8 domain, 

DSDLYAELRCMCIKTTSGIHPKNIQSLEVIGKGTHCNQVEVIATLKDGRKICLDPDAPRIKKI 
VQKKL. Further preferred amino acids include the small cytokines (intercrine/chemokine) C-x-C 
subfamily signature of the amino acid sequence comprising 
CMCIKTTSGIHPKNIQSLEVIGKGTHCNQVEVIATLKDGRXICLD. 

35 Further preferred polypeptides include portions comprising sites of Protein Kinase C 

phosphorylation including amino acid residues 2 to 4, residues 1 3 to 15, residues 36 to 38 and 
residues 97 to 99 of SEQ ID NP: or amino acids sequence comprising SLR, SAR, STK, and TLK. 
Further preferred polypeptides include portions of the amino acid sequence comprising sites of 
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Casein kinase II phosphorylation including amino acid residues 97 to 100 or the amino acid 
sequence comprising TLKD. 

Further preferred polypeptides include portions of the amino acid sequence comprising sites 
of N-myristylation or the amino acid residues comprising GTHCNQ. 
5 Further preferred polypeptides include the small cytokines (intercrine/chemokine) C-x-C 

subfamily signature of the amino acid sequence comprising 
CMCIKTTSGIHPKNIQSLEVIGKGTHCNQVEVIATLKDGRKICLD. 

Proteins of SEP ID NO: 227 (internal designation Clone ID: 1 66601) and related protein of SEP ID 
10 NP: 502. 

Polynucleotides of SEQ ID NP:58 and SEQ ID NP:385 encode the polypeptides of SEQ 
ID NP:227 and SEQ ID NP;502, respectively, with amino acid sequence 

MAAAAVPSLLLSLPPHQGLTFSNKIQPFGAQGVLHPEPGLRDWLLPTCSRQLRVALPEKGS 

EGSLCQTQLPATPCFLPSNTVRT. It will be appreciated that all characteristics and uses of the 
1 5 polynucleotides of SEQ ID NGs:58 and 3 85 and polypeptides of SEQ ID NP:227 and 502, 

described throughout the present application also pertain to the human cDNA of clone 166601, and 

the polypeptides encoded thereby 

The polynucleotides of SEQ ID NP:58 and 385 and polypeptides of SEQ ID NO:227 and 

502 encode a transcriptional regulatory protein. Preferred polynucleotides of the invention include 
20 the nucleic acid sequences comprising Clone 1 6660 1 , the polynucleotides comprising SEQ ID 

NP:58 and the polynucleotides comprising SEQ ID NP:385. Preferred polypeptides of the 

invention include the amino acid sequences derived from the nucleic acid sequence comprising 

Clone 166601, the polypeptides comprising the amino acid sequences of SEQ ID NP:227 and the 

polypeptides comprising the amino acid sequences of SEQ ID NP:502. 
25 In an embodiment of the invention, preferred polypeptides include the portion comprising 

the site of protein kinase C phosphorylation or the amino acid sequences comprising SNK or TVR 

ofSEQIDNP:227and 502. 

In another embodiment, preferred polypeptides of the invention include the portion of the 

amino acid sequence comprising sites of myristylation or the amino acids comprising the sequence 
30 GLTFSN or GSEGSL of SEQ ID NO:227 or 502. 

Proteins of SEP ID NP:268 (internal designation Clone ID:21 1056) and related protein of SEP ID 
NP:530. 

The polynucleotides of SEQ ID NO:99 and SEQ ID NP:416 and polypeptides of SEQ ID 
35 NP:268 and SEQ ID NO:530, respectively, encode a novel human tryptophan hydroxylase, 
including the amino acid sequence hereafter referred to as nhTPH. Tryptophan is taken up by 
active transport into the neurons where it is hydroxylated to 5-hydroxytryptophan (5HTP). The 
latter is then decarboxylated to serotonin, a neurotransmitter involved in central nervous disorders, 
especially mood disorders, sleep disorders, and eating disorders. Activity of the polypeptide of the 
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invention increases production of serotonin levels and increase the metabolism of tryptophan. Thus 
polypeptides of the invention are usefiil in the in vitro production of the serotonin and metabolism 
of tryptophan. As example, an expression vector containing the polynucleotides of SEQ ID NO:99 
or SEQ ID NO:416 can be introduced into a cell line by methods known in the art such as by 
S calcium precipitation; tryptophan can be supplied in the media; and serotonin produced by the cells 
can be extracted by known methods. 

The invention further relates to a method of screening for test compounds that bind hnTOH 
comprising the steps of contacting a hnTOH polypeptide with said test compound and detecting or 
measuring whether said test compound binds said hnTOH polypeptide. The invention further 

1 0 relates to a method of screening for test compounds that activate hnTOH comprising the steps of 
contacting a hnTOH polypeptide with said test compound and detecting or measuring whether said 
test compound activates said hnTOH polypeptide, for example by measuring serotonin production 
or tryptophan depletion. 

Another embodiment includes physiologically acceptable compositions of test compounds 

IS found to increase serotonin production, referred to as activators, in a screen. Further embodiments 
include methods to use activators that have been identified in a screen or previously known in the 
art in the preparation of physiological acceptable formulations for use in in vivo. Further preferred 
are methods to use activators in a physiologically acceptable formulation in the treatment of CNS 
disorders in which tryptophan and serotonin levels are aberrant, particularly depression, anxiety 

20 disorder, bipolar disorder, and eating disorders. 

It will be appreciated that all characteristics and uses of the polynucleotides of SEQ ID 
NO:99 and SEQ ID NO:416 and polypeptides of SEQ ID NO:268 and SEQ ID NO:530, described 
throughout the present application also pertain to the human cDNA of clone 21 10S6, and the 
polypeptides encoded thereby. 

25 

Proteins of SEO ID NO: 190 (internal designation Clone 10:147648) and related protein of SEO ID 
NO:474. 

The polynucleotides of SEQ ID N0:21 and SEQ ID NO:357 and polypeptides of SEQ ID 
NO: 190 and SEQ ID NO:474 encode a novel DNA binding polypeptide containing a leucine zipper 

30 pattern multimerization domain, thereafter referred to as LZP, also known as bZIP transcription 
factor basic domain signature (Hai et al., Genes Dev. 3:2083(1989)). An embodiment of the present 
invention includes the polynucleotides, polypeptides and fragments thereof comprising the 
sequences of SEQ ID N0:2I, 357, 190, and 474 of the invention. Preferred polypeptides of the 
present invention are directed to the amino acid sequences which comprise the leucine zipper 

35 domain selected from the following amino acids of SEQ ID NO: 1 90 and 474 including 
LAAGAVTLGIGFFALASALWFL; PKGFFNYLTYFLAAGAVTLGIG; or 
FFALASALWFLICKRREIFQNS. It will be appreciated that all characteristics and uses of the 
polynucleotides of SEQ ID N0:21 and SEQ IDNO:357 and polypeptides of SEQ ID NO: 1 90 and 
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SEQ ID NO:474, described throughout the present application also pertain to the human cDNA of 
clone 147648, and the polypeptides encoded thereby, 

Leucine-zippers permit dimerization of various cytoplasmic hormone receptors and 
enzymes (Forman, et al., Mol Endocrinol y 3, 1610-1626 (1989)). Leucine zippers are also a 
5 common feature of transcription factors, where they permit homo- or heterodimerization resulting in 
tight binding to DNA strands (for reviews, see Abel, et al.. Nature 341, 24-25 (1989); Jones, et al., 
CelieU 9-1 1 (1990); Lamb, et al., Trends in Biochemical Sciences 16, 417-422 (1991)). Therefore, 
preferred polypeptides of the present invention are useful tools in several areas of biotechnology, 
especially in protein engineering, where their ability to mediate homo-dimerization or hetero- 

10 dimerization has found several applications, including but not limited to immunochemistry, 
antibody generation, preparation of soluble oligomeric proteins, complementation assasys. The 
utility of the present invention may be further confirmed by methods described, for example, by 
Bosslet et al (US patent 5,643,73 1) in which use of a pair of leucine zippers for in vitro diagnosis, in 
particular for the immunochemical detection and determination of an analyte in a biological liquid ; 

1 5 by Tso et al (US patent 5,932,448) in which use of leucine zippers for producing bispecific antibody 
heterodimers ; by Conrad et al (US patent 5,965,712) , Ciardelli et al (US patent 5,837,816) , and 
Spriggs et al (W094 10308) in which methods of preparing soluble oligomeric proteins using 
leucine zippers have been described; and by Pelletier et al (WO9834I20) in which methods to use 
leucine zipper forming sequences in protein fragment complementation assays to detect 

20 biomolecular interactions has been described, all examples which disclosures are hereby 
incorporated by reference in their entireties. 

The multimerization activity of the polypeptides of the present invention containing leucine 
zipper domains may be assayed using any of the assays known to those skilled in the art including 
circular dichroism spectrum and thermal melting analyses as described in US patent 5,942,433, 

25 which disclosures are hereby incorporated by reference in their entirety. Alternatively, the leucine 
zipper motif in LZP could be used by those skilled in art as a ^'bait protein" in a well established 
yeast double hybridization system to identify its interacting protein partners in vivo from cDNA 
library derived from different tissues or cell types of a given organism. Alternatively, LZP or part 
thereof could be used by those skilled in art in mammalian cell transfection experiments. When 

30 fused to a suitable peptide tag such as [Hisje tag in a protein expression vector and introduced into 
culture cells, this expressed fusion protein can be immunoprecipitated with its potential interacting 
proteins by using anti-tag peptide antibody. This method could be chosen either to identify the 
associated partner or to confirm the results obtained by other methods such as those just mentioned. 
In a preferred embodiment, the invention relates to compositions and methods of using the 

35 LZP polynuceotides and polypeptides of SEQ ID NO: and SEQ ID NO: or fragment thereof for 
preparing soluble multimeric proteins, which consist in multimers of fusion proteins containing a 
leucine zipper fused to a protein of interest, using any technique known to those skilled in the art 
including those described in international patent W094 10308, which disclosure is hereby 
incorporated by reference in its entirety. In another preferred embodiment, LZP or derivative 
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thereof is used to produce bispecific antibody heterodimers as described in US patent 5,932,448, 
which disclosure is hereby incorporated by reference in its entirety. Briefly, leucine zippers capable 
of forming heterodimers are respectively linked to epitope binding components with dififerent 
specificities. Bispecific antibodies are formed by pairwise association of the leucine zippers, 
S forming an heterodimer which links two distinct epitope binding components. In still another 
preferred embodiment, LZP or part thereof or derivative thereof is used for detection and 
determination of an analyte in a biological liquid as described in US patent 5,643,731, which 
disclosure is hereby incorporated by reference in its entirety. Briefly, a first leucine zipper is 
immobilized on a solid support and the second leucine zipper is coupled to a specific binding 

10 partner for an analyte in a biological fluid. The two peptides are then brought into contact thereby 
immobilizing the binding partner on the solid phase. The biological sample is then contacted with 
the immobilized binding partner and the amount of analyte in the sample bound to the binding 
partner determined. In still another preferred embodiment, the LZP or part thereof may be used to 
synthesize novel nucleic acid binding proteins which are able to multimerize with proteins of 

1 5 interest, for example to inhibit and/or control cellular growth using any genetic engineering 

technique known to those skilled in the art including the ones described in the US patent 5,942,433, 
which disclosure is hereby incorporated by reference in its entirety. 

In another embodiment, the invention relates to compositions and methods using the LZP or 
part thereof or derivative thereof in protein fragment complementation assays to detect 

20 biomolecular interactions in vivo and in vitro as described in international patent WO9834120, 
which disclosures is hereby incorporated by reference in its entirety. Such assays may be used to 
study the equilibrium and kinetic aspects of molecular interactions including protein-protein, 
protein-nucleic acid, protein-carbohydrate and protein-small molecule interactions, for screening 
cDNA libraries for binding to a target protein with unknown proteins or libraries of small organic 

25 molecules for biological activity. 

Still, another object of the present invention relates to the use of the LZP or part thereof for 
identifying new leucine zipper domains using any techniques for detecting protein-protein 
interaction known to those skilled in the art. Among the traditional methods which may be 
employed are co-immunoprecipitation, crosslinking and co-purification through gradients or 

30 chromatographic columns of cell lysates. Once isolated as a protein interacting with the LZP, such 
an intracellular protein can be identified (e.g. its amino acid sequence determined) and can, in tum, 
be used, in conjunction with standard techniques, to identify other proteins with which it interacts. 
The amino acid sequence thus obtained may be used as a guide for the generation of oligonucleotide 
mixtures that can be used to screen for gene sequences encoding such intracellular proteins. 

35 Screening may be accomplished, for example, by standard hybridization or PCR techniques. 

Techniques for the generation of oligonucleotide mixtures and the screening are well-known. (See, 
e.g., Ausubel et aA, cds,. Current Protocols in Molecular Biology, J.Wiley and Sons (New York, 
NY 1993) and PR Protocols: A Guide to Methods and Applications, 1990, Innis, M. et al., eds. 
Academic Press, Inc., New York ). 
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Alternatively, methods may be employed which result in the simultaneous identification of 
genes which encode the intracellular proteins that can dimerize with the LZP or part thereof using 
any technique known to those skilled in the art. These methods include, for example, probing cDNA 
expression libraries, in a manner similar to the well known technique of antibody probing of 
S lambda.gt] 1 libraries, using as a probe a labeled version of the LZP or part thereof, or fusion 
protein, e.g., the LZP or part thereof fused to a marker (e.g., an enzyme, fiuor, luminescent protein, 
or dye), or an Ig-Fc domain (for technical details on screening of cDNA expression libraries, see 
Ausubel etal, supra). Alternatively, another method for the detection of protein interaction in vivo, 
the two-hybrid system, may be used. 

10 

Proteins of SEP ID N0:3 1 8 (internal designation Clone ID: 124608) and related protein of SEP ID 
NP:SS6. 

The polynucleotides of SEQ ID NP:149 and SEQ ID NP:442 and polypeptides of SEQ ID 
NP:3 1 8 and SEQ ID NP:556 encode an RNA-binding protein, hgRBP, which functions in RNA 
1 5 processing and protein expression. The preferred composition of SEQ ID NP:3 1 8 and 556 include 
MERPDKAALNALQPPEFRNESSLASTLKTLLFFTALMITVPIGLYFTTKSYIFEGALGMSNR 
DSYFYAAIVAVVAVHVVLALFVYVAWNEGSRQWREGKQD. 

Further preferred polypeptides include those of SEQ ID NP:3 1 8 or SEQ ID ND:556 
comprising an N-myristoylation site or the amino acid sequence at positions 43-48 or comprising 
20 the amino acid sequence GLYFTT which targets the protein to the membrane of the endoplasmic 
reticulum for function of hgRBP in translation of cellular mRNA into protein. 

It will be appreciated that all characteristics and uses of the polynucleotides of SEQ ID 
NP: 149 and SEQ ID NO:442 and polypeptides of SEQ ID NP:3 1 8 and SEQ ID NP:556, described 
throughout the present application also pertain to the human cDNA 124608of clone , and the 
25 polypeptides encoded thereby. 

Protein of SEP ID NP:337 (internal designation Clone ID: 11 3448) 

The polynucleotides of SEQ ID NP: 1 68 and related SEQ ID NP:454 and polypeptides of 
SEQ ID NP:337 encode a novel human RNA-binding protein involved in RNA processing and 
30 protein expression which is related to Clone ID: 1 83902 and Clone ID:63S993. 

It will be appreciated that all characteristics and uses of the The polynucleotides of SEQ ID 
NP:168 and related SEQ ID NP:454 and polypeptides of SEQ ID NP:337, described throughout 
the present application also pertain to the human cDNA of clone 1 13448, and the polypeptides 
encoded thereby. 

35 

Protein of SEP ID NP:328 (internal designation Clone ID: 1 83902) 

Polynucleotides of SEQ ID NP: 159 and related SEQ IDNP:450 and polypeptides of SEQ 
ID NP:328 encode a novel human RNA-binding protein involved in RNA processing and protein 
expression which is related to Clone ID: 1 13448 and Clone ID:635993. 
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It will be appreciated that all characteristics and uses of the The polynucleotides of SEQ ID 
NO: 1 59 and related SEQ ID NO:450 and polypeptides of SEQ ID NO:328, described throughout 
the present application also pertain to the human cDNA of clone 183902, and the polypeptides 
encoded thereby. 

5 

Protein of SEP IDNO:329 (internal desipnation Clone ID:635993) 

Polynucleotides of SEQ ID NO: 160 and related SEQ ID N0:4S1 and polypeptides of SEQ 
ID NO:329 encode a novel human RNA-binding protein involved in RNA processing and protein 
expression which is related to Clone ID: 1 83902 and Clone ID: 1 13448. It will be appreciated that 

10 all characteristics and uses of the The polynucleotides of SEQ ID NO: 160 and related SEQ ID 
NO:45 1 and polypeptides of SEQ ID NO:329, described throughout the present application also 
pertain to the human cDNA of clone 635993, and the polypeptides encoded thereby. 

Many eukaryotic proteins that bind single-stranded RNA contain one or more copies of a 
putative RNA-binding domain of about 90 amino acids. This is known as the eukaryotic RNA- 

1 5 binding region, RNP-1 signature or RNA recognition motif (RRM) (Bandziulis et al. Genes Dev. 
3:43 1 (1989); Swanson et al. Trends Biochem. Sci. 13: 86-91 (1988)). RRMs are found in a 
variety of RNA binding proteins, including heterogeneous nuclear ribonucleoproteins (hnRNPs), 
proteins implicated in regulation of alternative splicing, and protein components of small nuclear 
ribonucleoproteins (snRNPs). The polypeptides of SEQ ID NO:337, 328, and 329 encode novel 

20 human RNA binding protein, hereafter referred to as ghRBP which contains one copy of an RRM. 
Further characteristic of a protein which binds to nucleic acids, ghRBP contains a zinc finger motif 
comprising the amino acid sequence. Preferred polynucleotides of the invention include 
polynucleotides comprising the nucleic acids of SEQ ID NO: 1 59, 1 60, 1 68, 450, 45 1 ,and 454. 
Preferred polypeptides of the invention are polypeptides comprising the amino acids of SEQ ID 

25 NO:337, 328 and 329. It will be appreciated that all characteristics and uses of the polynucleotides 
of SEQ ID NO: 1 59, 1 60. 1 68, 450, 45 1 ,and 454 described throughout the present application also 
pertain to the human cDNA of Clone ID: 1 83902, Clone ID:635993 and Clone ID:1 13448. 

Preferred amino acids of the invention are residues which comprise the RNA-binding 
domain or portion thereof. Preferred amino acid sequences are selected from the following set of 

30 sequences including 

AFVRRXPWTAASSQLKEHFAQFGHVRRCILPFDKETGFHRGLGWVQFSSEEGLRNALQQE 

NHIIDGVKVQV; 

SINQPVAFVRRXPWTAASSQLKEHFAQFGHVRRCILPFDKETGFHRGLGWVQFSSEEGLRN 
ALQQENHIID; 

35 PWTAASSQLKEHFAQFGHVRRCILPFDICETGFHRGLGWVQFSSEEGLRNALQQENHnDGV 
KVQVHTRRP. Further preferred are polypeptides of the invention include any fragment of SEQ 
ID NO: which binds to RNA. 

An embodiment of the invention relates to methods of using the polypeptides of the 
invention to bind to RNA molecules in vitro by techniques that are known in the art. Preferred use 
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of the polypeptides of the invention includes extraction of RNA from biological samples, chemical 
reagents, cell homogenates and tissue homogenates. Further utility of the polypeptides of the 
present invention or part thereof may be further confirmed by binding methods described in 
Trifillis, et aL, RNA 5(8): 1071-82 (1999) and U.S. Patent 6,107,029, which disclosures are hereby 
5 incorporated by reference in their entireties. 

Proteins of SEP ID NO:248 rintemal designation Clone ID:199782\ SEP ID NO:249 (internal 
designation Clone ID:821212\ SEP ID NP:250 (internal designation Clone 1D:202863^ and related 
protein of SEP ID NP:S 1 8. 

1 0 The polynucleotides of SEQ ID NPs:79, 80, 8 1 and 40 1 and polypeptides of SEQ ID 

NPs:248, 249, 250 and 518 encode human RNA-associated polypeptides which act as splicing 
factors. It will be appreciated that all characteristics and uses of the polynucleotides of SEQ ID 
NOs:79, 80, 81 and 401 and polypeptides of SEQ ID NPs:248, 249, 250 and 518, described 
throughout the present application also pertain to the human cDNA of clones 199782, 821212, and 

1 5 202863, and the polypeptides encoded thereby. 

The translation of genetic information into protein depends on RNA and the first step in this 
process is the transcription of DNA into RNA while retaining all the genetic information encoded in 
DNA. The RNA transcript undergoes various processing steps which include splicing and 
polyadenylation. The mature RNA transcript is translated into protein by the ribosomal machinery. 

20 Nascent RNA transcripts are spliced in the nucleus by the spliceosomal complex which catalyzes 
the removal of introns and the rejoining of exons. At least 40 splicing factors have been identified 
and interaction of these factors are important in the conformational changes needed for the 
enzymatic removal of introns and religation of the exons. Both protein and RNA components are 
involved in the spliceosome assembly and the splicing reaction. There are 2 distinct catalytic steps 

25 involved in the RNA splicing reaction with distinct proteins and RNA species. Alternative splicing 
factors include developmentally regulated proteins that play key roles in developmental processes 
such as pattern formation and sex determination, respectively (Hodgkin, J. et al. (1994) 
Development 120:3681-3689). Alternate splicing is also involved in the tissue specific expression 
of isoforms of proteins, including structural proteins and enzymes. 

30 An embodiment of the present invention relates to compositions of the polynucleotides of 

SEQIDNP:79,80,81 and 401 and polypeptides of SEQ ID NP:248, 249, 250 and 518. Preferred 
amino acids of the invention comprise the zinc finger region or fragment thereof and are selected 
from the following sequences of amino acids from SEQ ID NP:248, 249, 250 and 518 including 
GACENCGAMTHKKKDCFE; NSIITKYRKGACENCGAM; or THKKKDCFERPRRVGAKF. 

35 The polypeptides of SEQ ID NG:248, 249, 250 and 5 1 8 are involved in the splicesome 

complex and have function in the processing of RNA processing. Alternatively, the polypeptides of 
the present invention are involved in RNA processing and thus involved in protein expression. A 
preferred embodiment of the invention relates to a method of using the polynucleotides of 
polynucleotides of SEQ ID NP:79, 80, 81 and 401 in vitro. A preferred method of use relates to 
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introduction of said polypeptides or fragments thereof into cells by techniques known in the art such 
as transfection or microinjection. Further preferred are methods to use the polynucleotides of the 
invention to alter protein expression in the given cell. Alternately, polypeptides of the present 
invention can be used in combination with reagents known in the art to alter protein expression in 
5 cell free expression systems, mammalian expression systems, insect expression systems, or bacterial 
expression systems. Furthermore, methods to use the polynucleotides or polypeptides the present 
invention to increase or decrease protein expression is preferred. The utility of the polypeptides of 
the invention or part thereof may be further confirmed using methods described in US Patent 
6020164 and Chua and Reed. Gene Devel 13:841-850 (1999), which disclosures are hereby 

10 incorporated by reference in their entireties. 

in another embodiment, methods to screen for inhibitors and activators of the polypeptides 
of the invention are preferred. In another embodiment, molecules or compounds which are 
identified in such a screen are further preferred. Further preferred are compounds which activate or 
inhibit activity of the polypeptides of the current invention. Activity of the the polypeptides of the 

IS invention is modified by phosphorylation at cAMP and cGMP dependent phosphorylation sites 
(including 3-6:SS-S8; 107-1 10) and casein kinase I! phosphorylation sites (including 33-36:58- 
61 : 126-129). Preferred activators include but are not limited to compounds which promote 
accumulation of intracellular cAMP and cGMP. Further preferred activators include those 
compounds which activate casein kinase 11. Preferred inhibitors are those compounds which inhibit 

20 intracellular cAMP and cGMP accumulation, or those compounds which promote cAMP and cGMP 
degradation. Further preferred inhibitors include compounds which promote the deactivation of 
casein kinase II. Furthermore, inhibitors and activators of the polypeptides of the present invention 
include compounds known in the art as well as compounds to be identified by the method of 
screening. Furthermore, compounds that inhibit or activate the activity of the polypeptides of the 

25 present invention by means other than phosphorylation or dephosphorylation are also preferred. 

In another embodiment, a method for the use of the polynucleotides or polypeptides of the 
present invention in the treatment, prevention, attenuation or diagnosis of disorders of RNA 
processing or protein processing are preferred. Such disorders are selected from a group which 
includes but is not limited to cancers such as adenocarcinoma, leukemia, sarcoma, teratocarcinoma, 

30 and any disorder associated with cell growth and differentiation, embryogenesis, and 

morphogenesis involving any tissue, organ, or system, e.g., the brain, adrenal gland, or reproductive 
system. 

Proteins of SEP ID NO:2S6 (internal designation Clone lD:822794y SEP ID NO:2S7 fintemal 
35 designation Clone 1D:337572) and related protein of SEP ID NP:S21 . 

The polynucleotides of SEQ ID NO:87, 88 and 407 and polypeptides of SEQ ID NP:256, 
257 and 521 encode human nuclear polypeptides which interact with transcription factors of the 
Signal Transducers and Activators of Transcription (STAT) family of proteins involved in the 
regulation of cell division. It will be appreciated that all characteristics and uses of the 
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polynucleotides of SEQ ID NO:87, 88 and 407 and polypeptides of SEQ ID NO:256, 257 and 521, 
described throughout the present application also pertain to the human cDNA of clones 822794 and 
337572, and the polypeptides encoded thereby. 

STATs are pleiotropic transcription factors which mediate cytokine-stimulated gene 
5 expression in multiple cell populations (Levy, Cytokine Growth Factor Rev., 8:81 (1997)). All 
STAT proteins contain a DNA binding domain, a Src homology 2 (SH2) domain, and a 
transactivation domain necessary for transcriptional activation of target gene expression. Janus 
kinases (JAK), including JAKl, JAK2, Tyk, and JAK3, are cytoplasmic protein tyrosine kinases 
(PTKs) which play pivotal roles in initiation of cytokine-triggered signaling events by activating the 

1 0 cytoplasmic latent forms of STAT proteins via tyrosine phosphorylation on a specific tyrosine 
residue near the SH2 domain (Ihle et a!., Trends Genet., 1 1 : 69 (1995); Darnell, Science 
277(5332): 1630 (1997); Johnston et al.. Nature, 370: 1513 (1994)). Tyrosine phosphorylated STAT 
proteins dimerize through specific reciprocal SH2-phosphotyrosine interactions and translocate 
from the cytoplasm to the nucleus where they stimulate the transcription of specific target genes by 

1 5 binding to response elements in their promoters (Leonard, Nature Medicine, 2: 968 (1996); Zhong 
et al., PNAS USA, 91:4806 (1994) Darnell, Science, 277:1630 (1997)). 

In an embodiment of the present invention, compositions of the polynucleotides and 
polypeptides or fragments thereof SEQ ID NO:87, 88 and 407 and SEQ ID NO:256, 257 and 521, 
respectively are included. Further preferred are polypeptides of the present invention which interact 

20 with activated STAT 3, but may also interact with STATl, STAT2 or other STAT homologues. 
Preferred polypeptides of the invention act to inhibit or decrease the activity of STATs. Further 
preferred amino acids of SEQ ID NO:256, 257 and 521 include the SAP domain 
VSSFRVSELQVLLGFAGRNKSGRKHDLLMRALHLL. Activation of cytokine receptors by 
their cognate ligands activate JAKs which in turn, activate STATS. Therefore cytokines and other 

25 hormones which signal through cytokine-like receptors may be modulated by polypeptides or 
polynucleotides of the present invention. Cytokines and other hormones which can thus be 
modulated by the present invention include but are not limited to interferons, interleukins, prolactin, 
and growth hormone. The utility of the polypeptides of the present invention or part thereof may be 
further confirmed using the methods described in WIPO Publication W09928465 which disclosure 

30 is hereby incorporated by reference in its entirety. 

The polypeptides of this invention can be used in a method of inhibiting the activity of 
STAT proteins in a cell in vitro, the method comprising introducing a nucleic acid into the cell, 
wherein the nucleic acid comprises a nucleotide sequence encoding the amino acid sequence of 
SEQ IDNO:256, 257 and 521 or the amino acid sequence of SEQ ID NO:256, 257 and 521 with 

35 one or more conservative amino acid alterations, and wherein the nucleic acid expresses the amino 
acid sequence in an amount and for a time sufficient for the amino acid sequence to specifically 
bind to STAT proteins and to decrease STAT activity, thereby decreasing STAT activity in the cell. 

Suitable compositions of polypeptides or polynucleotides of the present invention are useful 
as a method of treatment of pathologies such as diseases, syndromes, or other undesirable 
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conditions resulting from defects in cell cycle progression. Such cell cycle defects may result from 
defects in the regulation of activated STAT or an upstream factor such as activated JNKs or 
activated cytokine receptors. Alternatively, polypeptides or polynucleotides of the present 
invention may be used in a method of treating pathologies resulting from defects in cell cycle 
S progression due to defects in a step "downstream" of STAT regulation of cell cycle progression. In 
preferred embodiments, agonists of polypeptides or polynucleotides of the present invention are 
useful in the treatment of pathologies such as but not limited to hyperproliferative diseases such as 
cancer (e.g., leukemia, lymphoma, breast cancer, colon cancer, prostate cancer, Wilms' tumor), 
coronary artery disease, pulmonary vascular obstructive disease, either primary or as a feature of 

10 Eisenmenger's syndrome, and other disorders of abnormal cellular proliferation. Cells to be treated 
include but are not limited to hyperproliferative cells, cancer cells, vascular smooth muscle cells, 
endothelial cells, and gametes. 

In some embodiments of the invention, antagonists of the polypeptides or polynucleotides 
of the present invention are used to stimulate, promote, or facilitate progression through the cell 

1 S cycle, such as in the cellular regeneration of terminally differentiated cardiac myocytes or tissues, 
e.g., striated muscle myocytes. For example, this could allow restoration of damaged myocardium 
after cardiac injury, myocardial infarction, myocarditis, cardiomyopathy, trauma, as a consequence 
of cardiac surgery, etc., or repletion of striated muscle exhausted by muscular dystrophy. 

In further embodiments, expression of the polypeptides encoded by the nucleic acids is 

20 expected to prevent, ameliorate, or lessen the cell cycle defect of the host cell, or to restore normal 
cell cycle progression of the host cell. Whether provided via nucleic acid or polypeptides delivered 
directly to cells, the therapeutic formulations of the invention can also be used as adjuncts to other 
forms of therapy, including but not limited to chemotherapy, and radiation therapy. 

25 Protein of SEP ID NO:330 (internal designation Clone ID:398703) and related protein of SEP ID 

mm. 

The polynucleotides of SEQ ID NP: 161 and SEQ ID NP:452 and polypeptides of SEQ ID 
NP:330 encode a novel human deubiquitinating enzyme (GNPrAFOl 7306). Deubiquitinating 
en^mes serve a number of functions (Hochstrasser Cur Ppin Cell Biol 4:1024 (1992); Rose, In: 

30 Ubiquitin, Plenum Press, New York (1988)). First, ubiquitin must be cleaved from a set of 

biosyntheticprecursors, which occur either as a series of ubiquitin monomers in head-to-tail linkage 
or as fusions to certain ribosomal proteins (Finley & Chau, Annu Rev Cell Biol 7, 2S-69 (1991)). 
Secondly, ubiquitin must be recycled from intracellular conjugates, both to maintain adequate pools 
of free ubiquitin and, in principle at least, to reverse the modification of inappropriately targeted 

35 proteins. Finally, deubiquitinating reactions may be integral to the degradation of ubiquitinated 
proteins by the 26S proteasome, a complex ATP-dependent enzyme whose exact composition and 
range of activities remain poorly characterized (Hershko & Ciechanover, Annu Rev Biochem 61, 
761-807 (1992); Hadari et al., J Biol Chem 267, 719-727 (1992); Murakami et al., Nature 360, 597- 
9 (1992); Rechsteiner, J. Biol. Chem. 268. 6065-6068 (1993)). 
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An embodiment of the invention includes preferred polypeptides with ubiquitin-specific 
protease activity with a novel N-terminus of Clone ID:398703 comprising the amino acid sequence 
MCTTSLPCPIIMEPWGLATTKAAYVLFYQRRDDEFYKTPSLSSSGSSDGGTRPSSSQQGFGD 
DEACSMDTN encoded by 
5 ATGTGTACGACCTCATTGCCGTGTCCAATCATTATGGAGCCATGGGGGTTGGCCACTAC 
TAAAGCAGCTrATGTGCTATTTTACCAACGTCGAGATGATGAATTTTATAAGACACC^ 
CACTTAGCAGTTCTGGTTCCTCTGATGGAGGGACACGACCAAGCAGCTCTCAGCAGGG 
CTTTGGGGATGATGAGGCTTGCAGCATGGACACCAACTAA of SEQ ID NO: 1 61 . 

The preferred polypeptides of the invention are those which prevent or reverse 
10 ubiquitination of cellular proteins in vitro or in vivo. Further preferred are polypeptides of the 
invention which prevent or reverse ubiquitination of extracellular proteins in vitro or in vivo. 

The polynucleotides of SEQ ID NO: 1 6 1 and SEQ ID NO:452 encode polypeptides of SEQ 
ID NO:330 which contain protein domains or motifs including but not limited to a Protein kinase C 
phosphorylation site comprising the amino acid fragment SAR, and an N-myristylation site 
1 5 comprising amino acid fragment GLNMSE. Further preferred amino acids of SEQ ID NO : 330 
include the Ubiquitin carboxyl-terminal hydrolases family 2 signature or amino acid sequence 
YDLIAVSNHYGAMGVGHY. 

It will be appreciated that all characteristics and uses of the polynucleotides of SEQ ID 
NO: 1 61 and SEQ ID NO:452 and polypeptides of SEQ ID NO:330, described throughout the 
20 present application also pertain to the human cDNA of clone 398703, and the polypeptides encoded 
thereby. 

The utility of the polynucleotides and polypeptides of the present invention or part thereof 
may be further confirmed using methods which assess activity or function of deubiquitinating 
enzymes described in United States Patents 5391490 and 5565352 which disclosures are hereby 
25 incorportated by reference in their entireties. 

Proteins of SEO ID NO:277 (internal designation Clone ID:653966) and related protein of SEP ID 
NO:53S. 

The polynucleotides of SEQ ID NO: 1 08 and SEQ ID NO:42 1 and polypeptides of SEQ ID 
30 NO:277 and SEQ ID NO:535 encode human liver fatty acid binding protein (L-FABP) comprising 
the amino acid sequence of SEQ ID NO:277 and 535. The amino acid sequence of SEQ ID NO:277 
and 535 are the same as human L-FABP (Genbank accession GNP:M10617; Lowe et al., JBC 
260:3413-17 (1985)) and homologous to human FABP (Genbank accession GNP:M10050). The 
polypeptides of the present invention belong to the FABP/P2/CRBP/CRABP family of transporters 
35 and functionally binds to free fatty acids and derivatives thereof. L-FABP is normally expressed in 
the cytoplasm of hepatocytes, but preferred embodiments include use of the polypeptides of the 
present invention as extracellular polypeptides. Further preferred embodiments include use of the 
polypeptides of the present invention as serum or plasma polypeptides. Further preferred 
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embodiments use polypeptides of the invention in vitro. Still further preferred embodiments 
include use of the polypeptides of the present invention in vivo. 

Preferred amino acids of the invention include the lipocalin domain, from 2 to 127 or 
polypeptides comprising the amino acid sequence 
5 SFSGKYQLQSQENFEAFMKAIGLPEELIQKGKDIKGVSEIVQNGKHFKFTITAGSKVIQNEFT 
VGEECELETMTGEKVKTVVQLEGDNKLVTTFKNIKSVTELNGDIITNTMTLGDIVFKRISKR 
I. The polypeptides of SEQ ID NO:277 and SEQ ID NO:535 contain a cytosolic fatty-acid binding 
. protein signature comprising the amino acid sequence GKYQLQSQENFEAFMKAI which 
functions in the polypeptides ability to bind small hydrophobic molecules, such as lipids, steroid 
10 hormones, and retinoids. Preferred amino acids of SEQ ID NO:277 and SEQ ID NO:535 include 
GKYQLQSQENFEAFMKAI, MSFSGKYQLQSQENFEAF, and LQSQENFEAFMKAIGLPE. 

Phosphorylation status modulates the activity of L-FABP. Preferred polypeptides of the 
invention include the amino acids sequence comprising the sites of cAMP- and cGMP-dependent 
protein kinase phosphorylation including residues of SEQ ID NO:277 and 535 comprising the 
1 5 sequence KRIS. 

Further preferred polypeptides of the invention include the amino acid sequence comprising 
the sites of Protein kinase C phosphorylation including residues at positions 4 to 6, 94 to 96, and 
124 to 126 of SEQ ID NO:277 and 535. Still further preferred polypeptides of SEQ ID NO:277 and 
535 include the amino acid sequence comprising SGK, TFK, and SKR. 

20 Further preferred are polypeptides of SEQ ID NO: 277 and 535 include the amino acid 

sequence comprising a Casein kinase H phosphorylation sites. Preferred amino acids of SEQ ID 
NO: 277 and 535 include positions 64 to 67, 100 to 103, and 1 14 to 1 17. Further preferred amino 
acids comprise the sequences TVGE, SVTE, and TLGD. 

A preferred polypeptide of SEQ ID NO: 277 and 535 is one in which the amino acid 

25 asparagine (Asn) is located at residue 105, further referred to as the N-isoform. Further preferred is 
the polypeptide of SEQ ID NO: 277 and 535 in which the amino acid aspartate (Asp) is located at 
residue 1 05 further referred to as the D-isoform. The rat homologue of the human D-isoform of the 
present invention was shown to have a greater affinity to lysophospholipids, prostaglandins, 
retinoids, bilirubin and bile salts compared to the rat homologue of the human N-isoform of the 

30 present invention by methods described by DiPietro and Santome, Biochim Biophys Acta 

1478 : 1 86-200 (2000) which disclosure is hereby incorporated by reference in its entirety. The rat 
homologues share only 82% identity with the of the human D- and N-isoforms, therefore it is not 
predictable to find that the human D-isoform has equal or greater affinity to lysophospholipids, 
prostaglandins, retinoids, bilirubin, bile salts and fatty acid compared to the human N-isoform. 

35 Further preferred polypeptides of the present invention include the D-isoform polypeptide 

and fragments thereof which have an equal or at least 10%, 20%, 30%, 40%, 50%, 60% or 75% 
greater affinity for fatty acids, and lipophilic compounds selected from a group including but not 
limited to lysophospholipids, prostaglandins, retinoids, bilirubin, bile salts, steroid hormones (such 
as testosterone and estradiol), and cholesterol compared to the N-isoform. 
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Another embodiment of the invention includes polynucleotides or polypeptides of the 
invention or fragments thereof which bind lipophilic compounds selected from a group including 
but not limited to free fatty acids, lysophospholipids, prostaglandins, retinoids, bilirubin, bile salts, 
steroid hormones (such as testosterone and estradiol), and cholesterol in serum or plasma. Further 
S preferred are polypeptides of the invention which bind lipophilic compounds in serum or plasma 
separated from whole blood in a process of purifying serum or plasma for use in vitro or in vivo. 
Further preferred are polypeptides of the invention which bind lipophilic compounds in serum or 
plasma in vivo. 

In a further embodiment, polynucleotides or polypeptides of the invention or fragments 
10 thereof, in physiological appropriate formulations, are useful in the prevention, treatment or 
attenuation of conditions in which lipophilic compounds are elevated in the serum of mammals, 
preferably humans. Such conditions are selected from a group which include but are not limited to 
obesity, hyperlipidemia, hypercholesterolemia, hypertriglyceridemia, diabetes type 1 (IDDM) 
diabetes type II (NIDDM), atherosclerosis, and hypertension. 
1 5 Mammary-derived growth inhibitor (MDGI) and heart-fatty acid binding protein (FABP), 

which belong to the FABP family, specifically inhibit growth of normal mouse mammary epithelial 
cells (MEC) and promote morphological differentiation, stimulates its own expression and promotes 
milk protein synthesis (US Patent 5977309, 24 March 1995). In further preferred embodiments, 
polypeptides of the invention include those which locally signal growth cessation and stimulate 
20 differentiation of the developing epithelium. Further preferred polypeptides of the invention 
suppress the mitogenic effects of EGF family members, and inhibit c-fos, c-myc and c-ras 
expression. 

In a further aspect of the present invention, there is provided a method for producing such 
polypeptide by recombinant techniques comprising culturing recombinant prokaryotic and/or 
25 eukaryotic host cells, containing a human fatty acid binding polypeptides or polynucleotides of the 
invention acid under conditions promoting expression of said protein and subsequent recovery of 
said protein. 

In a further embodiment of the present invention, there is provided a method for utilizing 
such polypeptides, or polynucleotides of the invention for therapeutic purposes, for example, as a 

30 cell growth inhibitor and as to cause differentiation stimulatory activity on various responsive types 
of tissues and cells in vitro. Further preferred are methods for use of polypeptides, or 
polynucleotides of the invention, in appropriate physiological form, for therapeutic purposes for to 
inhibit cell proliferation or to induce cell differentiation in mammals, preferably humans. 

It will be appreciated that all characteristics and uses of the polynucleotides of SEQ ID 

35 NO: 108 and SEQ ID NO:421 and polypeptides of SEQ ID NO:277 and SEQ ID NO:535, described 
throughout the present application also pertain to the human cDNA of clone 653966, and the 
polypeptides encoded thereby. 
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Proteins of SEP ID NQ:313 (internal designation Clone ID:633418), SEP ID NO:314 (internal 

designation Clone ID:422878) and related protein of SEC IDNO:5S2. 

The polynucleotides of SEQ IDNP:144, 145 and 438 and polypeptides of SEQ IDNP:313, 
3 14and 552 encode a cleavage stimulation factor important in mRNA processing and protein 
S expression. Protein kinase C phosphorylation increases activity of said polypeptides and preferred 
amino acids include SEK and SGR. Further, sites of tyrosine kinase phosphorylation increase 
activity of said polypeptides and preferred amino acids of SEQ ID N0:3 14, 552 include 
KKLEENPY. 

It will be appreciated that all characteristics and uses of the polynucleotides of SEQ ID 
10 NO: 1 44, 1 45 and 438 and polypeptides of SEQ ID N0:3 13,31 4and 552, described throughout the 
present application also pertain to the human cDNA of clones 63341 8 and 422878, and the 
polypeptides encoded thereby. 

Proteins of SEP ID NP:219 (internal designation Clone ID:589848\ SEP IDNP:220 (internal 
15 designation Clone ID:21 1883V SEQ ID NO:221 (internal designation Clone ID:642603\ SEP ID 
NO:222 (internal designation Clone ID: 1933 16), and related protein of SEP ID NO:497. 

Polynucleotides of SEQ ID NP:50, 51, 52, 53, 380 and polypeptides of SEQ ID NP:219, 
220, 221 and 497 encode RNA associated proteins with a ribosomal L34 domain comprising the 
amino acid sequence NEYQPSNIKRKNKHGWVRRLXTPAGXXXILRRMLKGRKSLSH or 
20 NEYQPSNIKRKNKHGWVRRLXTPAGVQVILRRMLKGRKSLSH. It will be appreciated that 
all characteristics and uses of the polynucleotides of SEQ ID NGs:50, 51, 52, 53, 380 and 
polypeptides of SEQ ID NDs:219, 220, 221 and 497, described throughout the present application 
also pertain to the human cDNA of clones 589848, 21 1883, 642603 and 193316, and the 
polypeptides encoded thereby. 

25 

Proteins of SEP ID NP:302 (internal designation Clone ID:1Q0Q89I255) and related protein of 
SEPIPNP:543. 

Polynucleotides of SEQ ID NP: 1 33 and 429 and polypeptides of SEQ ID NP:302 and 543 
encode human ribosomal protein, hRlBPRT. An embodiment of the invention includes the 

30 compositions of the polypeptides of SEQ ID NG:302 and 543, comprising the amino acid sequence 
MVAAKKTKKSLESIKSRLQLVMKSGKYVLGYKQTLKMIRQGKAKLVILANNCPALRK^ 
EYYAMLAKTGVHHYSGNNIELGTACGKYYRVCTLAIIDPXDSXIIRSMPEQTGEK, and the 
polynucleotides of SEQ ID ND: 133 and 429, respectively, which encode human ribosomal protein, 
hRIBPRT. The polypeptides of the invention contain the ribosomal protein 

35 L30e/L7Ae/S12e/Gadd4 signature (Koonin EV, J Mol Med 75:236-238 (1997) and Nakanishi et al., 
Gene 35:289-96 (1985)). 

Preferred polypeptides of the invention include the amino acid sequence comprising 
KSLESIKSRLQLVMKSGKYVLGYKQTLKMIRQGKJ\KLVILANNCPALRKSEIEYYAMLAK 
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TGVHHYSGNNIELGTACGKYYRVCTLAIIDPXDSXIIR; 
KSLESIKSRLQLVMKSGKYVLGYKQTLKMIRQGKAKLVILANNCPALRK ; and 
SEIEYYAMLAKTGVHHYSGNNIELGTACGKYYRVCTLAIIDPXDSXIIR. Further preferred 
amino acids of the invention include sites of PKC phosphorylation, comprising the amino acid 
S sequences of SEQ lDNO:302and 543 including TKK (positions 7-13); SIK (positions 13-15); 
SGK (positions 24-26); and TLK (positions 34-36). Further preferred amino acids of the invention 
include sites of Casein Kinase II phosphorylation, comprising the amino acid sequences SEIE 
(positions 58-61) and SMPE (positions 107-1 10). 

In another embodiment, the proteins of SEQ ID NO:302 and 543 can be used to bind to 

10 nucleic acids, preferably RNA, alone or in combination with other substances. For example, the 
proteins of the invention or part thereof can be added to a sample containing RNAs in optimum 
conditions for binding, and allowed to bind to RNAs. In a preferred such embodiment, the proteins 
of the invention or part thereof may be used to purify mRNAs, for example to specifically isolate 
RNA, e.g. from a specific cell type or from cells grown under particular conditions. Such RNAs 

15 could then be reverse transcribed and cloned, could be analyzed for relative expression analyses, 
etc. In addition, such methods may be used to specifically remove RNA from a sample, for 
example during the purification of DNA. To carry out any of these methods, the proteins of the 
invention or part thereof may be bound to a chromatographic support, either alone or in 
combination with other RNA binding proteins, to form an affinity chromatography column. A 

20 sample containing a mixture of nucleic acids to purify is then run through the column. 

Immobilizing the proteins of the invention or part thereof on a support is particularly advantageous 
for embodiments in which the method is to be practiced on a commercial scale. This 
immobilization facilitates the removal of RNAs from the batch of resin-coupled protein after 
binding, and allows subsequent re-use of the protein. Immobilization of the proteins of the 

25 invention or part thereof can be accomplished, for example, by inserting any matrix binding domain 
in the protein according to methods known to those skilled in the art. The resulting fusion product 
including the proteins of the invention or part thereof is then covalently, or by any other means, 
bound to a protein, carbohydrate or matrix (such as gold, ""Sephadex" particles, polymeric surfaces). 
Another embodiment of the present invention relates to methods and compositions using 

30 the proteins of the invention, or part thereof, to associate specific mRNAs to the inner face of lipidic 
bilayers of liposomes in order to further introduce these mRNAs into the cytoplasm of eukaryotic 
cells. Preferably, specific mRNAs are first associated with the protein of the invention and the 
RNA/protein complex formed in that way is then mixed with liposomes according to methods 
known to those skilled in the art. These liposomes are added to an in vitro culture of eukaryotic 

35 cells. In vivo, such a method might treat and/or prevent disorders linked to dysregulation of gene 
transcription such as cancer and other disorders relating to abnormal cellular differentiation, 
proliferation, or degeneration. 

A decrease in ribosome function results in a significant inhibition of cell growth. 
Therefore, in another embodiment, the present proteins and nucleic acids can be used to modulate 
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the rate of cell growth in vitro or in vivo. Accordingly, compounds that inhibits the expression or 
function of the proteins of the invention can be used to inhibit the growth rate of cells, and can thus 
be used, e.g. in the treatment or prevention of diseases or conditions associated with excessive cell 
growth, such as cancer or inflammatory conditions. Such compounds include, but are not limited 
S to, antibodies, antisense molecules, dominant negative forms of the proteins, and any heterologous 
compounds that inhibit the expression or the activity of the proteins. 

It will be appreciated that all characteristics and uses of the polynucleotides of SEQ ID 
NO:133 and 429 and polypeptides of SEQ ID NO:302 and 543, described throughout the present 
application also pertain to the human cDNA of clone 1000891 255, and the polypeptides encoded 
10 thereby. 

Proteins of SEP IDNO:271 (Internal designation Clone 10:493328). related clones 1S326L 
1 52042, 599054. and 650872 and related protein of SEP ID NO:533. 

15 The polynucleotides of SEQ ID Nos:102, 103, 104, 105, and 106 and polypeptides of SEQ 

ID Nos:271, 272, 273, 274, 275 and 533 encode for the HUMAN GENSET BINDING PRGTEIN 
or HGBP-1. Preferred polypeptides comprise the amino acid sequence 

MKVKIKCWNGVATWLWVANDENCGICRMAFNGCCPDCKVPGDDCPLVWGQCSHCFHM 
HCILKWLHAQQVQQHCPMCRQEWKFKE. It will be appreciated that all characteristics and 
20 uses of the polynucleotides of SEQ ID Nos: 1 02, 1 03, 1 04, 1 05, and 1 06 and polypeptides of SEQ 
ID Nos:271, 272, 273, 274, 275 and 533, described throughout the present application also pertain 
to the human cDNA of clones 493328, 153261, 152042, 599054, and 650872, and the polypeptides 
encoded thereby. 

The protein of SEQ ID NG: 271 encoded by the extended cDN A SEQ ID NG: 1 02 is the 
25 same as a hepatocellular carcinoma associated ring finger protein (EMBL AF247565) and Genset 
protein in WGOI00806 (Genpep accession AX061622) with homology to an anaphase-promoting 
complex (APC) subunit from Drosophila (Embl accession number AJ251510). In addition, HGBP- 
l exhibits the pfam PHD zinc fmger signature from positions 33 to 79. 

Zinc binding domains which contain a C3HC4 sequence motif are known as RING domains 
30 (Lovering, R. et al. (1993) Proc. Natl. Acad. Sci. USA 90:21 12-2116). Zinc fmger domains are 
found in numerous zinc binding proteins which are involved in protein-protein and protein-nucleic 
acid interactions. They are independently folded zinc-containing mini-domains which are used in a 
modular repeating fashion to achieve sequence-specific recognition of DNA (Klug 1993 Gene 135, 
83-92). Such zinc binding proteins are commonly involved in the regulation of gene expression, 
35 and usually serve as transcription factors, either by directly affecting transcription or recruiting co- 
activators or co-repressors (see US patents 5,866,325; 6,013,453 and 5,861 ,495). PHD fingers are 
C4HC3 zinc fingers spanning approximately 50-80 residues and distinct from RING fingers or LIM 
domains. They are thought to be mostly DNA or RNA binding domain but may also be involved in 
protein-protein interactions (for a review see Aasland et al, Trends Biochem Sci 20:56-59 (1995)). 
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HGBP-1 or part thereof is a zinc binding protein, which is able to bind nucleic acids, more 
preferably a transcription factor. Preferred polypeptides of the invention are polypeptides 
comprising the amino acids of SEQ ID NO: 27 1 from positions 33 to 79. Other preferred 
polypeptides of the invention are fragments of SEQ ID NO: 271 having any of the biological 
5 activity described herein. The nucleic acid binding activity of the protein of the invention or part 
thereof may be assayed using any of the assays known to those skilled in the art including those 
described in US patent 6,013,453. 

The invention relates to methods and compositions using the protein of the invention or part 
thereof to bind to nucleic acids, preferably DNA, alone or in combination with other substances. 
1 0 For example, the protein of the invention or part thereof is added to a sample containing nucleic 
acid in conditions allowing binding, and allowed to bind to nucleic acids. In a preferred 
embodiment, the protein of the invention or part thereof may be used to purify nucleic acids such as 
restriction fragments. 

In another preferred embodiment, HGBP polypeptides or parts thereof may be used to 

1 5 visualize nucleic acids when the polypeptide is linked to an appropriate fusion partner, or is 
detected by probing with an antibody. Thus, HGBP polypeptides can be used to diagnose... 

Alternatively, the protein of the invention or part thereof may be bound to a 
chromatographic support, either alone or in combination with other DNA binding proteins, using 
techniques well known in the art, to form an affinity chromatography column. A sample containing 

20 nucleic acids to purify is run through the column. Immobilizing the protein of the invention or part 
thereof on a support advantageous is particularly for those embodiments in which the method is to 
be practiced on a commercial scale. This immobilization facilitates the removal of the protein from 
the batch of product and subsequent reuse of the protein. Immobilization of the protein of the 
invention or part thereof can be accomplished, for example, by inserting a cellulose-binding domain 

25 in the protein. One of skill in the art will understand that other methods of immobilization could 
also be used and are described in the available literature. 

In another embodiment, the present invention relates to compositions and methods using the 
protein of the invention or part thereof, especially the zinc binding domain, to alter the expression 
of genes of interest in a target cells. Such genes of interest may be disease related genes, such as 

30 oncogenes or exogenous genes from pathogens, such as bacteria or viruses using any techniques 
known to those skilled in the art including those described in US patents 5,861,495; 5,866,325 and 
6.013,453. 

In a further embodiment, the protein of the invention or part thereof may be used to 
diagnose, treat and/or prevent disorders linked to dysregulation of gene transcription such as cancer 
35 and other disorders relating to abnormal cellular differentiation, proliferation, or degeneration, 
including hyperaldosteronism, hypocortisolism (Addison's disease), hyperthyroidism (Grave's 
disease), hypothyroidism, colorectal polyps, gastritis, gastric and duodenal ulcers, ulcerative colitis, 
and Crohn's disease. The invention relates to methods of diagnosing, treating and/or preventing 
disorders described herein, comprising delivering to a patient, or causing to be present therein, a 
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zinc finger polypeptide which inhibits the expression of a gene enabling the cells to divide. The 
target could be, for example an oncogene or a normal gene, which is overexpressed in the cancer 
cells. 



5 ISPG Iron-sulfur cluster protein (Clone ID: 1000872335) 

The polynucleotides of SEQ ID NOs:43 and 374 encodes the amino acids sequence of SEQ 
ID NOs:212 and 491 respectively, an iron-sulfur protein which mediates electron transfer in 
metabolic reactions, also referred to as ISPG. It will be appreciated that all characteristics and uses 
of the polynucleotides of SEQ ID NOs 43 and 374 and polypeptides of SEQ ID NOs:212 and 491, 

10 described throughout the present application also pertain to the human cDNA of clone 1000872335, 
and the polypeptides encoded thereby. 

ISPG is an iron-sulfur protein that belongs to the broad family of the 2Fe-2S-type 
ferredoxins. The 2Fe-2S-type ferredoxins are proteins or domains of around one hundred amino 
acid residues that bind a single 2Fe-2S iron-sulfur cluster and are found in plants, animals and 

15 bacteria. Iron-sulfur cluster proteins are well known classes of proteins and are recognized as ideal 
devices for accepting, donating, storing and shifting electrons. It will be appreciated by the skilled 
artisan that stuctural aspect of iron-sulfur proteins have been studied extensively (reviewed in 
Beinert et al, Nature 1 Aug 1997, 277:653-659), allowing modifications of the ISPG proteins to fine 
tune its properties for desired uses while retaining its biological function in mediating electron 

20 transfer and possible protein stabilization and iron or sulfide storage functions. The ISPG protein of 
SEQ ID NO 491 comprisese a glycosaminoglycan attachment site at amino acid positios 34 
(SGSG); protein kinase C phosphorylation sites at amino acid positions 14 (SAR), 44 (TTR), and 
86 (SGR); N-myristoylation sites at amino acid positions 1 1 (GGVSAR), 24 (GTXWNR), 31 
(GGTSGS), 39 (GVALGT), 106 (GACEAS), a cytochrome c family heme-binding site at amino 

25 acid positions 114-119 and an iron-sulfur binding region signature at amino acid positions 108-1 18. 
In view of its role in electron transfer reactions, ISPG is thought to be involved in a wide 
variety of metabolic reactions and disorders, and may be useful in the treatment of disorders of 
metabolism such as obesity, in the detection of toxic compounds, in prediction, diagnosis or 
treatment of conditions or traits related to drug metabolism or in treatments related to the synthesis 

30 of eg. steroid hormones. 

In a preferred example, overexpression or administration of the ISPG protein may be used 
as a therapeutic treatment for obesity by accelerating the metabolic rate of a subject in need of 
treatment. There is accumulating evidence to support the hypothesis that a low-energy-output 
phenotype is at high risk of weight gain and obesity, irrespective of whether this is owing to a low 

35 resting metabolic rate and/or physical inactivity. The low-energy-output phenotype is associated 
with impaired appetite control, which is improved if energy output is increased, serving as the 
background for pharmacologic stimulation of energy expenditure as a tool to improve the results of 
obesity management. The ISPG protein and agonists or stimulators thereof may serve as a means to 
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increase electron transfer and hence the metabolic rate of an individual in a similar goal as 
commonly cited targets such as leptin receptors, the sympathetic nervous system and its peripheral 
bela-adrenoceptors, selective thyroid hormone derivatives, and stimulation of the mitochondrial 
uncoupling proteins. 

S In addition, iron-sulfur proteins such as ISPG are generally recognized as being capable of 

several functions that are not of an oxidoreductive nature such as the binding and activation of 
substrates at the unique iron site (in the catalytic ilinction of aconitase and relates enzymes), and 
apparently stabilizing radicals in reactions occurring by a free-radical pathway. There is also 
evidence suggesting that such iron-sulfur clusters can function in coupling electron transfer to 

10 proton transport. By binding Cys ligands from different subunits, iron-sulfur clusters effect dimer 
formation, as in the Fe protein of nitrogenase. Further, by straddling protein structural elements, 
iron-sulfur clusters are able to stabilize structures that are required for specific functions (eg. 
endonuclease III of E. coli). Proteins of ISPG's class have also been shown to protect proteins from 
the attack of intracellular proteases. Finally, proteins of ISPG's class are thought to be capable of 

I S serving as storage devices for iron and possibly sulfide. 

Thus, in a few examples, ISPG may be used advantageously as an iron or metal biosensor, 
for the treatment and/or diagnosis of iron overload disorders, or in applications involving stabilizing 
target proteins such as for protein production or for mediating protein interactions. 

Additionally, structural aspects of ISPG suggest that it may be capable of mediating steroid 

20 hormone synthesis, either in human or animals, or in engineered cell culture systems for the large 
scale production of hormones. ISPG may be used to act as an adrenal ferredoxin (known as 
adrenodoxin (ADX)), a vertebrate mitochondrial protein which transfers electrons from 
adrenodoxin reductase to cytochrome P450scc, which is involved in cholesterol side chain cleavage. 
Its primary function as a soluble electron carrier between the NADPH-dependent adrenodoxin 

25 reductase and several cytochromes P450 makes it an irreplaceable component of the steroid 
hormones biosynthesis in the adrenal mitochondria of vertebrates. 

DRUG METABOLISM 

Previous studies have revealed that cytochrome P-4S0 iso^mes are responsible for drug 
metabolism, and oxidation by P-450 isozymes is a common aspect of the overall clearance of drugs. 

30 Further studies have revealed that genetic polymorphism of cytochrome P-450 isozymes underlies a 
wide spectrum of substrates specificity in drug oxidation. In certain cases, genetic mutation and/or 
deletion of one critical isozyme gene results in a significant alteration of a phenotype projected on 
substrate specificity. It has been reported that CYP2D6 oxidizes more than 30 drugs (for example, 
M, Eichelbaum et al., Pharmacol. Ther., Vol. 46, pp. 377-, 1990). Many anti-cancer drugs are 

35 known to be oxygenated by cytochrome P450 enzymes to yield metabolites that are cytotoxic or 
cytostatic toward tumor cells. These include several commonly used cancer chemotherapeutic 
drugs, such as cyclophosphamide (CPA), its isomer ifosfamide (IFA), dacarbazine, procarbazine, 
thio-TEPA. etoposide, 2-aminoanthracene, 4-ipomeanol, and tamoxifen (LeBlanc, G. A. and 
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Waxman, D. J., Drug Metab. Rev. 20:395-439 (1989); Ng, S. F. and Waxman D. J., Intl. J. 
Oncology 2:731-738 (1993); Goeptar, A. R., et al., Cancer Res. 54:241 1-2418 (1994); van Maanen, 
J. M., et al., Cancer Res, 47:4658-4662 (1987); Dehal, S. S., et al., Cancer Res. 57:3402-3406 
(1997); Rainov, N. G., et al., Human Gene Therapy 9:1261-1273 (1998)). Bioreductive metabolism 
5 that results in drug activation is also catalyzed by cytochrome P450 enzymes for a variety of anti- 
cancer drugs. Examples of such drugs include Adriamycin, mitomycin C, and 
tetramethylbenzoquinone (Goeptar, A. R., et al., Crit. Rev. Toxicol. 25:25-65 (1995); Goeptar, A. 
R., et al., Mol. Pharmacol. 44:1267-1277 (1993)). Those who have homozygous alteration in this 
recessive gene, are so-called "poor metabolizers (PMs)" and may suffer from severe side effects due 

10 to poor metabolism of drugs (for example, see M. Eichelbaum et a!., Pharmacol. Then, Vol. 46, pp. 
377-, 1990). Such genetic alterations occur at rates of from 1 to 30% in different ethnic populations 
(for example, L. M. Distlerath et al., J. Biol. Chem., Vol. 260, pp. 9057-, 1985). 

The ISPG protein of the invention is thought to be capable of functioning as a soluble 
electron carrier in the electron transport chain involving one or more of the several available 

1 5 cytochromes P450 enzymes. The ISPG protein may thus be useful in methods of killing neoplastic 
cells involving P450 (and ISPG) gene transfer and the use of bioreductive drugs that are activated 
. by cytochrome P450, and in methods for evaluating the susceptibility of a sample compound to 
metabolism with respect to a specific cytochrome P-450 isozyme system. 

Thus, in a first aspect, a drug activation/gene therapy strategy has been developed based on 

20 a cytochrome P450 gene ("CYP" or "P450") in combination with a cancer chemotherapeutic agent 
that is activated through a P450-catalyzed monoxygenase reaction (Chen, L. and Waxman, D. J., 
Cancer Research 55:581-589 (1995); Wei, M. X., et al.. Hum. Gene Ther. 5:969-978 (1994); U.S. 
Pat. No. 5,688,773, issued Nov. 18, 1997). Presently known drug-enzyme combinations can utilize 
established chemotherapeutic drugs widely used in cancer therapy. Such methods to obtain 

25 enhanced chemosensitivity have been demonstrated both in vitro and in studies using a 

subcutaneous rodent solid tumor model and human breast tumor grown in nude mice in vivo, and is 
strikingly effective in spite of the presence of a substantial liver-associated capacity for drug 
activation in these animals (Chen, L., et al., Cancer Res. 55:581-589 (1995); Chen, L., et al., Cancer 
Res. 56:1331-1340 (1996)). The P450-based approach also shows significant utility for gene 

30 therapy applications in the treatment of brain tumors (Wei, M. X., et al., Human Gene Ther. 5:969- 
978 (1994); Manome, Y., et al., Gene Therapy 3:513-520 (1996); Chase, M., et al.. Nature 
Biotechnol. 16:444-448(1998)). 

Although the P450/drug activation system has shown great promise against several tumor 
types, further enhancement of the activity of this system is needed to achieve clinically effective, 

35 durable responses in cancer patients. This requirement is necessitated by two characteristics that are 
inherent to the P450 enzyme system: (1) P450 enzymes metabolize drugs and other foreign 
chemicals, including cancer chemotherapeutic drugs, at low rates, with a typical P450 turnover 
number (moles of metabolite formed/mole P450 enzyme) of only 10-30 per minute; and (2) P450 
enzymes metabolize many chemotherapeutic drugs with high Km values, typically in the millimolar 
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range. This compares to plasma drug concentrations that are only in the micromolar range for many 
chemotherapeutic drugs, including drugs such as CPA and IFA. Thus, current approaches to P450 
gene therapy may result in intratumoral drug activation at a low absolute rate and under conditions 
that are not saturating with respect to drug substrate. Furthermore, since P450 is expressed at a very 
5 high level in liver tissue, only a very small fraction of the administered chemotherapeutic drug is 
metabolized via the tumor cell P4S0 gene product using the currently available methods for P4S0 
gene therapy (Chen, L and Waxman, D. J., Cancer Res. 55:581-589 (1995)). As described in U.S. 
Patent No. 6,207,648, one enhancement involves introducing a P450 reductase (RED) gene in 
combination with a cytochrome P450 gene (and thus a P450 gene product) into neoplastic cells, the 

10 enzymatic conversion of a P450-activated chemotherapeutic drug to its therapeutically active 
metabolites is greatly enhanced within the cellular and anatomic locale of the tumor, thereby 
increasing both the selectivity and efficiency with which neoplastic cells are killed. 

In a preferred embodiment, further enhancements to known prodrug-enzyme strategies may 
be achieved by introducing an ISPG gene into neoplastic cells, either alone or in combination with a 

1 5 P450 gene and/or a P450 reductase gene. Suitable vectors for the introduction and expression of 
said ISTG and P450 genes are known to one of skill in the art. The introduction of the ISPG gene 
and subsequent expression of the ISPG gene product may increase the enzymatic conversion of a 
P450-activated chemotherapeutic drug to its therapeutically active metabolites. Thus, the invention 
comprises a method for killing neoplastic cells comprising: (a) infecting the neoplastic cells with a 

20 vector for gene delivery, the vector comprising an ISPG gene capable of mediating enzymatic 
conversion of a chemotherapeutic agent by a P450 enzyme; (b) optionally infecting the neoplastic 
cells with a vector for gene delivery, the vector comprising a cytochrome P450 gene and/or a gene 
encoding RED; (b) treating the neoplastic cells with a chemotherapeutic agent that is activated by 
the product of the cytochrome P450 gene; and (c) killing the neoplastic cells. 

25 The present invention also provides a reagent composition for use in evaluating drug 

metabolism by a specific cytochrome P-450 isozyme, which comprises a liver microsome lacking 
said specific P-450, said specific P-450 isozyme and a carrier material. The liver microsome may be 
of human source lacking CYP2D6, CYP2C 19, or CYP2A6. The CYP2D6 isozyme, CYP2C 19 
isozyme, and CYP2A6 isozyme to be added may be a recombinant CYP2D6-expressing 

30 microsome, a recombinant CYP2C 1 9-expressing microsome, or a recombinant CYP2 A6- 
expressing microsome. The reagent composition may comprise more than one kind of PM 
microsomes. 

According to the present invention, there can be provided a reagent composition and a 
method for accurately quantitating the contribution of certain P-450 isozymes such as CYP2D6, 
35 CYP2C 19, and CYP2A6 in drug metabolism. The present invention provides a method for 
evaluating the susceptibility of a sample compound to metabolism with respect to a specific 
cytochrome P-450 isozyme, which comprises contacting the sample compound with a reagent 
composition prepared by adding said specific cytochrome P-450 isozyme and an ISPG protein to 
liver microsomes lacking said specific cytochrome P-450 isozyme in a carrier material. ISPG 
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would be useful in order to enhance efficiency of the P-450 isozyme in drug metabolism, thereby 
effectively amplifying the power of the assay to detect the contribution of a particular P-450 
enzyme. In another embodiment the contribution to drug metabolism of a particular P-4S0 system 
can be assessed by focussing on a particular iron-sulfur protein associated with said specific P-4S0 
S system. In this aspect, the method comprises contacting the sample compound with a reagent 
composition prepared by adding said specific ISPG protein to liver microsomes lacking said ISPG 
protein in a carrier material. The method may further comprise (a) incubating a mixture of the 
sample compound and the reagent composition; (b) extraction of the reaction mixture obtained in 
Step (a); and (c) analyzing the reaction products isolated in Step (b). For the purposes of 
10 quantitating the assay, a plurality of the reagent compositions having different amount of the 
specific P-450 isozyme or ISPG protein may be subjected to Step (a) to (c), respectively. For 
example, the specific P-450 isozyme to be used in the method may be selected from CYP2D6, 
CYP2C19,CYP2A6, CYPlAl andCYP2El. 

IRON BIOSENSOR 

1 5 Iron-sulfur clusters have been found to serve as sensors of iron, dioxygen, superoxide ion 

and possibly nitric oxide. Two main mechanisms of sensing have been described. In one example, 
the oxidation [Fe^S^]!"*" [Fe2s2]2+ by dioxygen provides the signal for activation of a defense 
mechanism against superoxide, as observed with the SoxR protein of E. coli, and thus may serve a 
cytoprotective function (eg. useful for treatment of ischemia, etc.). In an alternative mechanism, 

20 oxidative disassembly or reassembly of a cluster provides the controlling signal as in the FNR 
protein of E.coli. 

In a further detailed example, the ISPG protein of the invention maybe be used as an iron 
biosensor. An example of an iron biosensor is provided in U.S. Patent No. 5,5 16,697 (Kruzel et 
al.). In summary, ISPG can be immobilized in the vicinity of a device to measure the change in pH, 

25 causing a detectable variation of the potential upon the binding of iron. 

In the process of sequestering iron, the sensing element, the ISPG protein is expected to 
release a number of protons of hydrogen (H^) directly proportional to the atoms of iron bound. The 
release of protons during the binding of iron by ISPG becomes the operative feature which is 
measured by the biosensors. The release of protons causes a change in pH and is measured by an 

30 ion-selective field effect transistor or by pH sensitive paper. A sample containing iron is placed into 
a buffered solution, usually water. The sample may be diluted one or more times. In one 
embodiment of the sensors of the present invention, the release of protons is measured as the 
variation of the potential on the surface of an ion-selective field effect transistor (an ISFET) 
(Reviewed in: Biosensor Technology, edited by Buck et al. and published by Marcel Decker, Inc., 

35 1990, entitled "Solid State Potentiometric Sensors" by Jiri Janata, pp 17-34). In another 
embodiment of the present invention, the protons released upon binding of iron by ISPG are 
detected by the change in pH using pH sensitive paper. Preferably, the iron selective element 
(ISPG) is incorporated in close proximity or integrated with the signal transducer, to give a 
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reagentless sensing system for iron. Since the signal can be amplified, only small quantities of ISPG 
are needed for detection of iron. The ISFET is modified by immobilizing ISPG on the surface of the 
ISFET or by a disposable membrane with immobilized ISPG that is in close proximity to the ISFET 
by attaching the ISPG -modified membrane to the surface of the ISFET. A sample containing iron, 
5 for example a biological sample such as body fluid from a mammal, particularly a human, is then 
contacted with the ISPG -modified ISFET. 

In order to produce an ISPG-modified ISFET as an independent sensor, an existing system 
which uses an ISFET designed to measure pH can be modified. Systems which presently use an 
ISFET to measure pH are the Sentron 2001 pH system, manufactured by Integrated Sensor 

10 Technology, Federal Way, Wash.; the Coming 360i pH system, manufactured by Coming 
Incorporated, Coming N.Y.; or Orion 610 pH system, manufactured by Orion Analytical 
Technology, Inc., Boston, Mass. The modification required to measure the amount of iron in a 
sample is either to place an immobilized layer of ISPG on the ISFET of such a system or, 
alternatively, to provide a ISPG -modified membrane, i.e. a membrane coated with ISPG, which 

IS will be in close proximity to the existing ISFET so as to detect the release of protons when the ISPG 
binds iron in a sample and records the change in potential. 

Iron sensitive biosensors (as well as treatments for iron overload disorders) are extremely 
valuable It is estimated that 30,000,000 Americans suffer from different types of iron related 
disorders, including a substantial proportion with profound iron deficiency syndrome. Detection of 

20 bioaccessible iron is one of the most important measurements that doctors can use for early 

detection if iron deficiency, iron overload or other types of immunological disorders. To date iron 
is measured through a combination of blood tests that detect iron and iron binding capacity of 
transferrin, the protein that transports iron through the body. The current technology involves very 
sophisticated instmmentation which make this analysis prohibitively expensive and often requires 

25 qualified personnel to analyze the sample. Therefore, there is a need for the direct assay of iron that 
combines simplicity and economics. ISPG may therefore be advantageously used in development of 
a biosensor for detecting the amount of iron in a sample. 

STEROID BIOSYNTHESIS 

As noted above, ISPG may be used as an adrenal ferredoxin (known as adrenodoxin 
30 (ADX)), a vertebrate mitochondrial protein which transfers electrons from adrenodoxin reductase to 
cytochrome P450scc, which is involved in cholesterol side chain cleavage and is an irreplaceable 
component of the steroid hormones biosynthesis in the adrenal mitochondria. 

In therapeutic embodiments, ISPG may have particular importance in treatment of disorders 
where it is desired to increase the level of steroid hormone synthesis. As P4S0scc has a critical role 
35 in synthesis of the conversion of cholesterol into pregnenolone, ISPG may be used as a limiter or 
enhancer of steroid synthesis. In but one example, evidence has been shown that cytochrome 
P450scc activity in the human placenta is limited by the supply of electrons to the P450scc, 
Furthermore, Tuckey et al, Eur. J. Biochem. 1999 Jul;263(2): 319-325 have shown that p450scc 
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activity can be increased considerably by adding adrenodoxin reductase and adrenodoxin. Thus, 
ISPG may be useful in the treatment of reproductive disorders by augmenting the electron supply to 
P450scc, and thus increasing the level of progesterone synthesis. Accordingly, in another example, 
ISPG may be used to limit steroid synthesis, whether for therapeutic or for research uses. 
S ISPG may also be used in biological steroid synthesis processes for the production of 

steroid hormones. For example, Duport et al, Nat Biotechnol 1998 Feb; 16(2): 1 86-9 report a system 
for self-sufficient biosynthesis of pregnenolone and progesterone in engineered yeast wherein the 
first two steps of the steroidogenic pathway were reproduced in Saccharomyces cerevisiae. 
Engineering of sterol biosynthesis by disruption of the delta 22-desaturase gene and introduction of 
10 the Arabidopsis thaliana delta 7-reductase activity and coexpression of bovine side chain cleavage 
cytochrome P450, adrenodoxin, and adrenodoxin reductase, lead to pregnenolone biosynthesis from 
simple carbon source. As ISPG is thought to be capable of functioning as an adrenodoxin protein, 
ISPG may be used as a function substitute in the system of Duport et al for adrenodoxin. . 



15 MTG (METALLOTHIONEIN) (Clone 10:6546271 

SEQ IDNOS 96 and 413 and clone FLl l:654627_,182-5-3-0-F10.F encode the polypeptide 
of SEQ ID NOs:265 and 527 respectively, a metal loth ionein protein which binds heavy metal. Said 
polypeptide of the invention is also referred herein as MTG. It will be appreciated that all 
characteristics and uses of the polynucleotides of SEQ ID NOs:96 and 413 and polypeptides of SEQ 

20 ID NO:265 and 527, described throughout the present application also pertain to the human cDNA 
of clone 654627, and the polypeptides encoded thereby. 

Metallothioneins (MT) [1,2,3] are small proteins which bind heavy metals such as zinc, 
copper, cadmium, nickel, etc., through clusters of thiolate bonds. MPs occur throughout the 
animal kingdom and are also found in higher plants, fiingi and some prokaryotes and are thought to 

25 play a role in metal detoxification or in the metabolism and homeostasis of metals. On the basis of 
structural relationships MPs have been subdivided into three classes. Class I includes mammalian 
MPs as well as MPs from crustacean and molluscs, but with clearly related primary structure. 
Class II groups together MPs from various species such as sea urchins, fungi, insects and 
cyanobacteria which display none or only very distant correspondence to class I MPs. 

30 Class III MPs are atypical polypeptides containing gamma-glutamylcysteinyl units. 

Vertebrate class I MPs such as the MTG protein of the invention are proteins of typically 
60 to 68 amino acid residues, 20 of these residues are cysteines that bind to 7 bivalent metal ions. 
As a signature pattern we chose a region that spans 19 residues and which contains seven of the 
metal-binding cysteines, this region is located in the N-terminal section of class-I MPs. A 

35 consensus pattern for class I MPs is as follows: C-x-C-[GSTAP]-x(2)-C-x-C-x(2)-C-x-C-x(2)-C-x- 
K. 

The MTG protein of SEQ ID NO 527 has a metallothionein domain (Prosite ref PS00203) 
at amino acid positions 13-31; an N-glycosylation site at position 4 (NCSC), a protein kinase C 
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phosphorylation site at amino acid positions 18 (SCK), 28 (SCK) and 55 (SQR); a casein kinase II 
phosphorylation site at amino acid position 41 (TLVD); an N-myristoylation site at amino acid 
positions 10 (GVSCTC); and a prokaryotic membrane lipoprotein lipid attachment site at amino 
acid position 3 (PNCSCAAGVSC). 

5 THERAPEUTICS 

Discovery of new proteins related to metallothioneins, and the polynucleotides that encode 
them, satisfies a need in the art by providing new diagnostic or therapeutic compositions useful in 
diagnosing and treating heavy metal toxicity, cancer, inflammatory disease and immune disorders. 
Acute or chronic exposure to heavy metals such as lead, arsenic, mercury or cadmium leads 

10 to a variety of diseases and disorders involving neuromuscular, CNS, cardiovascular, and 

gastrointestinal effects. MTs may play a role in the prevention or alleviation of these conditions. In 
addition, MTs are transcriptionally regulated by glucocorticoids, which suggests that MTs have a 
direct role in the effects of glucocorticoids to treat inflammatory disease, immune disorders, and 
cancer. It is therefore thought that MTG may have important applications in the treatment of 

1 5 inflammatory disease^ immune disorders, and cancer as well as in cytoprotection in a variety of 
therapeutic applications. 

In one preferred example, the MTG nucleic acids and protein may be used for suppressing 
the production of sunburn cells which is applicable in various manners with minimal adverse side 
effects, a method of inducing metallothionein, a method of treating skin diseases and a method of 

20 screening ultraviolet rays, and further relates to cosmetic compositions and UV screening 
compositions. 

Conventionally, steroids and zinc oxide formulations have been topically used as medicines 
for treating skin diseases such as dermatitis, sunburn, neurodermatitis, eczema and anogenital 
pruritus. Steroids, however, have been difficult to administer in large quantities for a prolonged 

25 period due to their strong adverse side effects. Zinc oxide formulations, which have local astringent 
action, involve problems with respect to the manufacture of pharmaceuticals, since they are 
insoluble in water and are not usually administered internally. 

Zinc, one of the indispensable trace metals in the living body, is known to participate in the 
development of sexual organs, promotion of wound healing and is also known to be a component of 

30 a metalloenzyme, an accelerator for dehydrogenase, and to have various functions such as activating 
the immune system. Zinc is further known to be an inducing factor of metallothionein (MT), a 
metal-combining protein. It is reported that MT functions as a scavenger of free radicals which are 
generated at the onset of inflammations ["Dermatologica", Hanada, k., et al., 179 (suppl. 1) 143 
(1989)]. 

35 As proposed in U.S. Patent No. 5,582,817 (Otsu et al), MTG may be useful in treatment of 

dermatological inflammations caused by external irritative stimulants, such as sunburn or the like, 
where MTG could act to quench the free radicals released from leukocytes, especially granulocytes 
which gather at the inflamed region, and thereby exhibit an anti-oxidation action to diminish cell 
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damage, especially to normal lymphocytes, to activate the immune system and further to prevent the 
accelerated aging of the skin. Formation of sunburn cells (SBCs) could be suppressed by 
administering zinc for inducing MTG to be present, or to increase MTG in the epidermal keratinous 
layer. Anti-oxidation action of MTG can also be useful in the treatment of skin problems resulting 
S from radiation therapy by X rays, alpha rays, beta rays, gamma rays, neutron rays and accelerated 
electron rays. 

Various zinc compounds have been studied by Otsu et al (supra) with respect to their 
pharmacological activities, who reported that zinc salts or zinc complexes of a certain compound 
have an excellent action of inducing metallothionen (MT) and suppressing sunburn cell (SBC) 
10 production due to UV rays, and thereby useful as components of cosmetic compositions or 

medicines for purposes of ameliorating sunburn, preventing sunburn, ameliorating sufferings from 
skin diseases and ameliorating other radiation induced disorders, leading to completion of the 
invention. 

There are two different types of dermatological reactions caused by sunlight, one is an acute 
1 5 inflammatory change in the skin called sunburn, and the other is a subsequent melanin pigmentation 
called suntan. The light having a wave length in the range of 320 nm or less, called UVB, induces 
sunburn and is responsible for erythematous change. The erythemic reaction caused by U V rays, as 
opposed to a bum injury, does not occur immediately after the exposure to the sunlight, but rather 
occurs after a latent period of several hours. When sunburned skin is histopathologically examined, 
20 various degrees of inflammatory changes are recognized in the epidermis and dermis depending on 
the dose of radiation. Among such changes, a notable one is the generation of so-called sunburn 
cells (SBC) in the epidermis. A histologically stained tissue sample presents strongly and 
acidophilically stained cells which have pyknotic nuclei. This phenomenon indicates the necrosis of 
epidermal cells ("Fragrance Journal", 9, 1 5-20 (199 1 ). In order to prevent sunburn, para- 
25 aminobenzoic acid derivatives, cinnamic acid derivatives or the like U V absorbers mentioned above 
are used, but their UV absorbing effects are not necessarily satisfactory. What is more, they raise 
problems of cumbersome handling upon use, poor stability, low compatibility with other 
components of the composition, and also involve unsolved problems in water-resistance and oil- 
resistance. 

30 In the field of medicines for the treatment of skin diseases, development of medicines 

which have minimal adverse side effects, and which have novel functions obtainable by both 
external and internal administrations has been desired. Also, in the field of the therapy and 
prevention of radiation disorders, medicines which can suppress and cure the disorders caused by 
oxidative reactions have been desired. Lastly, in the field of the manufacture of cosmetics, 

35 cosmetics which overcome the above-mentioned problems such as handling upon use and stability 
of the composition have been desired. Accordingly, the present invention encompasses providing 
therapeutic agents for treating skin diseases having the above-mentioned characteristics, wherein 
said agents are capable of inducing MTG for suppressing the formation of sunburn cells, and for use 
in cosmetic compositions. Also encompassed are methods of screening for therapeutic agents for 
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treating skin diseases comprising bringing a test compound into contact with a cell, tissue or animal 
model of disease, and detecting induction of MTG expression or function. 

GENE EXPRESSION SYSTEMS 

The MTG nucleic acid and proteins of the invention may also be advantageously used in the 
5 production of recombinant proteins as biopharmaceutical products at commercial scale. 

Previously, genes have been extensively expressed in mammalian cell lines, particularly in 
mutant Chinese Hamster Ovary (CHO) cells deficient in the dihydrofolate reductase gene (dhfr) as 
devised by the method of Urlaub et al, PNAS U.S.A. 77, 4216-4220, 1980. A variety of expression 
systems have been used. Many vectors for the expression of genes in such cells are therefore 

1 0 available. Typically, the selection procedures used to isolate cells transformed with the expression 
vectors rely on using methotrexate to select for transformants in which both the dhfr and the target 
genes are coamplified. The dhfr gene, which enables cells to withstand methotrexate, is usually 
incorporated in the vector with the gene whose expression is desired. Selection of cells under 
increasing concentrations of methotrexate is then performed. This leads to amplification of the 

1 5 number of dhfr genes present in each cell of the population, as cells with higher copy numbers 
withstand greater concentrations of methotrexate. As the dhfr gene is amplified, the copy number of 
the gene of interest increases concomitantly with the copy number of the dhfr gene, so that 
increased expression of the gene of interest is achieved. Unfortunately, these amplified genes have 
been reported to be variably unstable in the absence of continued selection (Schimke, J. Biol. Chem. 

20 263, 5989-5992, 1988). This instability is inherent to the presently available expression systems of 
CHO dhfr.sup.- cells. For many years, several promoters have been used to drive the expression of 
the target genes such as the SV40 early promoter, the CMV early promoter and the SR.alpha. 
promoter. The CMV and SR.alpha. promoters are claimed to be the strongest (Wenger et al, Anal. 
Biochem. 221,416-418, 1994). 

25 In one report, the .beta.-interferon promoter has also been used to drive the expression of 

the .beta.-interferon gene in the mutant CHO dhfr.sup.- cells (U.S. Pat, No. 5,376,567). In this 
system, however, the selected CHO dhfr cells had to be superinduced by the method of Tan et al 
(Tan et al, PNAS U.S.A. 67, 464-471, 1970; Tan et al, U.S. Pat. No. 3,773.924) to eflFect a higher 
level of .beta.-interferon production. In this system a significant percentage of the superinduced 

30 .beta.-interferon produced by the CHO dhfr- cells was not glycosylated. The mouse metallothionein 
gene (mMTl) promoter has also been used for the expression of beta-interferon genes in CHO cells, 
BHK and LTK.sup.- mouse cells (Reiser et al 1987 Drug Res. 37, 4, 482-485). However, the 
expression of .beta.-interferon with this promoter was not as good as the SV40 early promoter in 
CHO cells. Further, .beta.-interferon expression from these cells mediated by the mMTl promoter 

35 was inducible by heavy metals. Heavy metals are however extremely toxic to the cells and this 
system was therefore abandoned. Instead, Reiser et al used the CHO dhfr- expression system in 
conjunction with the SV40 early promoter (Reiser et al. Drug Res. 37,4, 482-485 (1987) and EP-A- 
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0529300) to produce .beta.-interferon in CHO dhfr- cells as derived by the method of Urlaub et al 
(1980). 

As described in U.S. Patent No. 6,207,146 (Tan et al) beta.-interferon was expressed in 
wild-type CHO cells using a metallothionein based system. MTG may thus be used in similar 
5 applications so as to provide a system for expression of recombinant proteins. Tan et al 

demonstrates wild-type CHO cells transfected with a vector comprising a .beta.-interferon gene 
under the control of a mouse sarcoma viral enhancer and mouse metallothionein promoter (MSV- 
mMTl), a neo gene under the control of promoter capable of driving expression of the neo gene in 
both E. coll and mammalian cells and a human metallothionein gene having its own promoter. 

10 Transfected cells capable of expressing .beta.-interferon were selected by first exposing cells to 
geneticin (antiobiotic G4 1 8) and thus eliminating cells lacking the neo gene and then exposing the 
surviving cells to increasing concentrations of a heavy metal ion. 

The heavy metal ion enhanced the MSV-mMTl promoter for the .beta.-interferon gene, 
thus increasing .beta.-interferon expression. The heavy metal ion also induced the human 

IS metallothionein gene promoter, causing expression of human metallothionein. The human 

metallothionein protected the cells against the toxic effect of the heavy metal ion. The presence of 
the heavy metal ion ensured that there was continual selection of cells which had the transfecting 
vector, or at least the .beta.-interferon gene and the human metallothionein gene and their respective 
promoters, integrated into their genome. 

20 The selected cells that had been successfully transfected expressed .beta.-interferon. 

Expression was surprisingly improved when the cells were cultured in the presence of Zn^"^. The 
.beta.-interferon had improved properties, in particular a higher bioavailability, than prior .beta.- 
interferons. 

These findings have general applicability and suggest that the MTG gene of the present 
25 invention may be used accordingly in expression systems. Accordingly, the present invention 
provides a nucleic acid vector comprising: 

(i) a coding sequence which encodes a protein of interest and which is operably linked to a 
promoter capable of directing expression of the coding sequence in a mammalian cell in the 
presence of a heavy metal ion; (ii) a first selectable marker sequence which comprises an MTG 
30 gene of the invention and which is operably linked to a promoter capable of directing expression of 
the MTG gene in a mammalian cell in the presence of a heavy metal ion; and optionally (iii) a 
second selectable marker sequence which comprises a neo gene and which is operably linked to a 
promoter capable of directing expression of the neo gene in a mammalian cell; 

35 CDPG fGLYCOSYL PHQSPHATiDYLINOSlTOL-LlNKED GLYCQPROTEIM^ (Clone 
ID: 10009029 1 7) 

SEQ ID NOS 3 and 34 1 and clone FL 1 1 : 1 0009029 1 7_223-52-4-0-G3-F encode the 
polypeptide of SEQ ID NOS 172 and 458 respectively, a glycosyl phosphatidylinositol-linked 
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glycoprotein protein which is thought to be a signal transducing polypeptide expressed in lymphoid, 
myeloid, and erythroid cells. Said polypeptide comprises a CD24 signal transducing domain as 
well as a GPI-anchor of the invention is also referred herein as CDPG. CDPG is believed to be 
highly glycosylated, and it is expected that CDPG molecular weight will vary among cell types and 
5 cell developmental stage due to differences in glycosylation patterns, providing further specificity in 
its use as a therapeutic target. It will be appreciated that all characteristics and uses of the 
polynucleotides of SEQ ID NOs: 3 and 34 1 and polypeptides of SEQ ID NO: 1 72 and 458, 
described throughout the present application also pertain to the human cDNA of clone 1000902917, 
and the polypeptides encoded thereby. 

10 It is suggested that CDPG may have a specific role to play in early thymocyte development. 

The CDPG protein is thought to be extensively o-glycosylated may be capable of modulating b-cell 
activation responses. As a signaling transducer, the CDPG polypeptide's signal transducing 
function may in some embodiments be triggered by the binding of a lectin-like ligand to the CD24 
domain carbohydrates, and the release of second messengers allowing signaling. The CDPG 

I S polypeptide is thought to have important functions in regulating the differentiation and/or growth of 
lymphoid, myeloid, and erythroid cells, including specifically promoting antigen dependent 
proliferation of b-cells, and preventing their terminal differentiation into antibody-forming cells. 

Additionally, based on a growing body of evidence characterizing the CD24 domain 
function, it is proposed that CDPG may have a role as a potent stimulator of neurite outgrowth, and 

20 thus may be useful in the treatment of central nervous system disorders. 

Fragments of CDPG may also be useful, eg. GPI-anchor domain for example in the 
development of soluble T-cell receptors (U.S. Patent No. 6,080,840) or any suitable application 
where a temporally controlled solubilization of a protein of interest is desired. For example, a 
fragment comprising the CDPG GPI-anchor domain can be used in the production of soluble 

25 molecules by replacing the transmembrane domains of the cDNA of a protein of interest with a 
sequence comprising the CDPG glycosylphosphatidyl inositol (GPI) linkage. These chimeric 
cDNAs are then transferred into an expression vector containing a strong promoter and a mutant 
(e.g. DHFR) gene allowing high levels of transcriptional expression and amplification of the gene. 
These chimeric genes are then cotransfected into a selected cell type, preferably lacking the 

30 endogenous protein of interest, and transfectants are selected and can also be screened with 

antibodies for the protein of interest. These GPI linked proteins of interest can then be solubilized 
by cleavage with the enzyme phosphatidyl inositol specific phospholipase C (PI-PLC) and 
purified/concentrated from the supernatant (e.g. by passage over a protein of interest-reactive 
antibody affinity column). 

35 THERAPEUTICS 

CDPG or inhibitors of CDPG may be used in the treatment of any disorder where it is 
desired to regulate B-cell proliferation or differentiation. CDPG or inhibitors thereof may be useful 
in the treatment of B-cell neoplasms, a heterogeneous group of diseases characterized by different 
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maturation states of the B-cell, which are related to the aggressiveness of the disorder. Chronic 
lymphocytic leukemia (CLL) is characterized by proliferation and accumulation of B-lymphocytic 
leukemia (BLL) is characterized by proliferation and accumulation of B-lymphocytes that appear 
morphologically mature but are biologically immature. This disorder accounts for 30% of leukemias 
5 in Western countries. The disorder is characterized by proliferation of biologically immature 
lymphocytes, unable to produce immunoglobulins, which cause lymph node enlargement. As a 
regulator of B-cell proliferation and differentiation, CDPG and/or inhibitors of CDPG may be 
useful for inhibiting proliferation of leukemic B-cells in CLL patients. 

CDPG may also be useful in the modulation of cell growth in the CNS. CD24 is known to 
10 be highly expressed in neurons and has been demonstrated as capable of inhibiting neurite 

outgrowth of dorsal root ganglion neurons while promoting neurite outgrowth of cerebellar neurons 
via interaction with an LI protein. 

SELECTABLE CELL MARKERS 

In one aspect, the CDPG polypeptide may be used as a selectable cell marker and to a 

15 method of using the selectable marker to identify a cell. Viruses such as recombinant retroviruses 
have been used as a vehicle for gene transfer based on their potential for highly efficient infection 
and non-toxic integration of their genome into a wide range of cell types. The transfer of exogenous 
genes into mammalian cells may be used, for example in gene therapy to correct an inherited or 
acquired disorder through the synthesis of missing or defective gene products in vivo. The 

20 expression of exogenous genes in cells may be useful in somatic gene therapy, to correct hereditable 
disorders at the level of the gene. Hemopoietic stem cells are particularly suited to somatic gene 
therapy as regenerative bone marrow cells may be readily isolated, modified by gene transfer and 
transplanted into an immunocompromised host to reconstitute the hosfs hemopoietic system. 

Gene therapy involving hone marrow transplant with recombinant primary hemopoietic 

2S stem cells requires efficient gene transfer into the stem cells. As a very small number of primary 
stem cells can reconstitute the entire host hemopoietic system it is important that the transferred 
gene be efficiently expressed in the recombinant stem cells transferred. The transfer of foreign 
genes into a reconstituted host hemopoietic system has been limited by the availability of a 
selectable marker which permits the rapid and non-toxic selection of cells which are efficiently 

30 expressing the transferred gene. Currently available selection markers may not be suitable for 
primary hemopoietic stem cells since they may alter the proliferative ability or biological 
characteristics of the cells. The transfer of foreign genes into a reconstituted host hemopoietic 
system has also been limited by the availability of a viral vector capable of expression in 
hemopoietic stem cells, especially where more than one transcriptional unit is present in the vector 

35 (Botrell, D. R. L. et al., 1987, Mol. Biol. Med. 4:229). 

U.S. Patent No. 5,804,177 (Humphries et al) has demonstrated that the cell surface protein 
CD24 (also M1/69-J1 Id heat stable antigen) can be used as a dominant marker in a recombinant 
viral vector. A nucleotide sequence encoding the cell surface protein CD24 in a recombinant viral 
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vector was used to infect hematopoietic stem cells and cells infected with the recombinant viral 
vector were rapidly and non-toxically selected for in vitro using fluorescence activated cell sorting 
(FACS), demonstrating a good correlation between proviral copy number and expression of 
selectable marker. 

5 CD24 is a signal transducing molecule found on the surface of most human B cells that can 

modulate their responses to activation signals, and is structurally closely related to CDPG. The 
CD24 CDNA (approximately 300 bps) has been cloned (Kay, R. et al, 1 99 1 , J. Immunol. 1 47: 1 4 1 2) 
and encodes a mature peptide of only 31 to 35 amino acids that is extensively glycosylated and 
attached to the outer surface of the plasma membrane by a glycosyl phosphatidylinositol lipid 

10 anchor. M1/69-J1 Id heat stable antigen is a genetically similar homologous murine peptide widely 
expressed on a variety of hemopoietic cell types (Kay, R. et al., 1990, J. Immunol. 145:1952). 

It is thus proposed that a recombinant viral vector can be used to successfully transfer and 
express the CDPG gene in primitive hemopoietic stem cells such that they are able to repopulate 
lethally irradiated recipients. Preferably foreign CDPG antigen expression in repopulated animals 

1 5 persists post transplantation such that the biological function of the repopulated hemopoietic cells is 
not affected by the expression of the CDPG antigen. CDPG may subsequently be found to be 
expressed in any or all of hemopoietic lineages including granulocytes, macrophages, pro- 
erythrocytes, erythrocytes and T and B lymphocytes. Therefore, the cell surface protein CDPG may 
be particularly usefiil as a marker for hematopoietic stem cells capable of repopulation in vivo and 

20 as a selectable marker in gene therapy. The recombinant viral vectors also have the advantage that 
the nucleotide sequence encoding the marker is very small, leaving a large amount of space for the 
insertion of additional genes of interest such as those coding for exogenous genes. 

In a preferred embodiment of the invention a recombinant viral vector is used to introduce 
the nucleotide sequence into the cell. Preferably, the CDPG nucleotide sequence is operatively 

25 linked to one or more regulatory elements. The recombinant viral vector of the invention may be 
used as a marker for an exogenous gene to be expressed in a host cell. The invention further 
provides a method of identifying a cell and progeny thereof comprising: providing a cell; infecting 
the cell with a recombinant viral vector of the invention under suitable conditions to allow 
expression of the cell surface protein CDPG on the cell; and, identifying the cell and progeny 

30 thereof by detecting expression of the cell surface protein CDPG on the cell or progeny thereof 
Cells infected with a recombinant viral vector of the invention and expressing the cell surface 
protein may be transplanted into a host, and the cell and progeny thereof may be identified after 
transplantation by removing biological samples from the host, and assaying for cells expressing the 
cell surface protein. A recombinant viral vector of the invention may be directly introduced into a 

35 host. 

ENRICHING STEM CELL COMPOSITIONS 

As it is proposed that CDPG is involved in early thymocyte development and is found on 
the cell surface due to its GPI-anchor, CDPG nucleic acids and polypeptides of the present 
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invention may also be used to obtain novel antibody compositions useful for preparing cell 
preparations containing human hematopoietic cells. 

There is a continued interest in developing stem cell purification techniques. Pure 
populations of stem cells will facilitate studies of hematopoiesis. Transplantation of hematopoietic 
5 cells from peripheral blood and/or bone marrow is also increasingly used in combination with high- 
dose chemo- and/or radiotherapy for the treatment of a variety of disorders including malignant, 
nonmaltgnant and genetic disorders. Very few cells in such transplants are capable of long-term 
hematopoietic reconstitution, and thus there is a strong stimulus to develop techniques for 
purification of hematopoietic stem cells. Furthermore, serious complications and indeed the success 
10 of a transplant procedure is to a large degree dependent on the effectiveness of the procedures that 
are used for the removal of cells in the transplant that pose a risk to the transplant recipient. Such 
cells include T lymphocytes that are responsible for graft versus host disease (GVHD) in allogenic 
grafts, and tumour cells in autologous transplants that may cause recurrence of the malignant 
growth. 

IS Hematopoietic cells have been separated on the basis of physical characteristics such as 

density and on the basis of susceptibility to certain pharmacological agents which kill cycling cells. 
The advent of monoclonal antibodies against cell surface antigens has greatly expanded the 
potential to distinguish and separate distinct cell types. There are two basic approaches to separating 
cell populations from bone marrow and peripheral blood using monoclonal antibodies. They differ 

20 in whether it is the desired or undesired cells which are distinguished/labeled with the antibody(s). 
In positive selection techniques the desired cells are labeled with antibodies and removed from the 
remaining unlabeled/unwanted cells. In negative selection, the unwanted cells are labeled and 
removed. Antibody/complement treatment and the use of immunotoxins are negative selection 
techniques, but FACS sorting and most batch wise immunoadsorption techniques can be adapted to 

25 both positive and negative selection. In immunoadsorption techniques cells are selected with 
monoclonal antibodies and preferentially bound to a surface which can be removed from the 
remainder of the cells e,g. column of beads, flasks, magnetic particles. Immunoadsorption 
techniques have won favor clinically and in research because they maintain the high specificity of 
targeting cells with monoclonal antibodies, but unlike FACSorting, they can be scaled up to deal 

30 directly with the large numbers of cells in a clinical harvest and they avoid the dangers of using 
cytotoxic reagents such as immunotoxins, and complement. 

Current positive selection techniques for the purification of hematopoietic stem cells target 
and isolate cells which express CD34. However, positive selection procedures suflFer from many 
disadvantages including the presence of materials such as antibodies and/or magnetic beads on the 

35 CD34"^ cells, and damage to the cells resulting from the removal of these materials. 

Negative selection has been used to remove minor populations of cells from clinical grafts. 
These cells are either T-cells or tumour cells that pose a risk to the transplant recipient. The 
efficiency of these purges varies with the technique and depends on the type and number of 



141 



wo 02/083898 PCT/IBO 1/009 14 

antibodies used. Typically, the end product is very similar to the start suspension, missing only the 
tumor cells or T-cells. 

As described in U.S. Patent No. 5,877,299, Thomas et al developed a negative selection 
technique that uses an antibody composition containing antibodies specific for glycophorin A, CD3, 
5 CD24, CD16, CD14 and optionally CD45RA, CD36, CD2, CD19, CD56, CD66a, and CD66b, 
which reportedly gave a cell preparation highly enriched for human hematopoietic and progenitor 
cells. Maximum enrichment of early progenitor and stem cells (CD34"*', CD38" cells) was observed 
when anti-CD45R and anti-CD36 were included in the antibody composition. However, as CDPG 
is proposed as acting in early thymocyte development, CDPG may be used advantageuously to 
10 develop more effective antibody compositions for selecting hematopoietic stem cells. Accordingly, 
the invention encompasses antibodies specific for CDPG polypeptides of the invention and antibody 
compositions comprising, consisting of or consisting essentially of antibodies specific for CDPG, 
glycophorin A, CD3, CD24, CD16, CD14 and optionally CD45RA, CD36, CD2, CD19, CD56, 
CD66a, and CD66b. 

1 5 Use of the antibody composition comprising CDPG in a negative selection technique to 

prepare a cell preparation which is enriched for hematopoietic stem cells and progenitor cells may 
offer significant advantages over conventional techniques. The antibody composition is applied in 
one step to a sample of peripheral blood, bone marrow, cord blood or frozen bone marrow, 
preferably without additional enrichments steps which could result in loss of, or damage to, 

20 progenitor and stem cells. 

PRSG (PROLTNE-RICH CALCIUM-BINDING PROTEINS and HSTG (BASIC HISTIDINE RICH 
SALIVARY GLAND PEPTIDE) (Clone ID:3381 12 and 10008393 15 resoectiveM 

SEQ ID NOS 22 and 358 and clone FLl 1:10008393 1 5_220.26-l-0-F3.F encode the 
25 polypeptide of SEQ ID NOS 1 91 and 475 respectively, a basic histidine rich salivary gland peptide 
referred to herein as HSTG and expected to have potent antimicrobial properties. Preferably, the 
amino acid sequence of HSTG (SEQ ID NO 475) comprises a tyrosine at amino acid position 40. 
The HSTG protein also comprises a Pattem-DE: Protein kinase C phosphorylation site at amino 
acid position 5 1 (SSK). It will be appreciated that all characteristics and uses of the polynucleotides 
30 of SEQ ID NOs: 22 and 358 and polypeptides of SEQ ID NO: 191 and 475, described throughout 
the present application also pertain to the human cDNA of clone 10008393 15, and the polypeptides 
encoded thereby. 

SEQ ID NOS 8 and 345 and clone FLn:338l 12,174.M-0-Al 1-F encode the polypeptide 
of SEQ ID NOS 177 and 462 respectively, a proline-rich protein referred to herein as PRSG 
35 believed to be a component of saliva and a calcium binding protein also possessing potent 
antimicrobial properties. Preferably, the amino acid sequence of PRSG (SEQ ID NO 462) 
comprises a proline residue at amino acid position 96, an arginine residue at amino acid position 
1 00, a glutamine residue at amino acid positoin 1 02, and/or a glycine residue at amino acid position 
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103. The PRSG protein also comprises a casein kinase 11 phosphorylation site at amino acid 
positions 15 (SAQD), 24 (SQED), and 59 (SAGD) and an N-myristoylation site at amino acid 
position 52 (GGQQSQ). It will be appreciated that all characteristics and uses of the 
polynucleotides of SEQ ID N0s:8 and 345 and polypeptides of SEQ ID NO: 177 and 462, 
5 described throughout the present application also pertain to the human cDNA of clone 3381 12, and 
the polypeptides encoded thereby. 

A study of saliva and its tooth-protective components reveals at least four important 
i\jnctions of saliva: (1) buffering ability, (2) a cleansing effect, (3) antibacterial action, and (4) 
maintenance of a saliva supersaturated in calcium phosphate. Several salivary constituents serve one 

1 0 or more of these functions. Research has yielded important information about organic and inorganic 
secretory products. It is also clear that saliva as a unique biologic fluid has to be considered in its 
entirety to account ftilly for its effects on teeth. Saliva is greater then the sum of its parts. One 
reason for this is that salivary components display redundancy of function, each often having more 
than one function. This redundancy, however, does not imply that proteins that share functional 

1 S roles all contribute to the same degree. For instance, when comparing proteins that inhibit calcium 
phosphate precipitation, statherin and acidic proline-rich proteins are most potent, whereas histatins, 
cystatins, and mucins appear to play lesser roles. The complex interaction between proteins is 
another major factor contributing to saliva's function. In this regard, heterotypic complexes of 
various proteins have been shown to form on hydroxyapatite. Mucin binding to other salivary 

20 proteins, including proline-rich proteins, histatins, cystatins, and statherin, is well documented. The 
complexes, whether adsorbed to the tooth surface or in saliva, have important implications for 
bacterial clearance, selective bacterial aggregation on the tooth surface, and control of 
mineralization and demineralization. Finally, proteolytic activity of saliva generates numerous 
products whose biologic activities are often different from their parent compounds. The ability of 

25 saliva to deliver fluoride to the tooth surface constantly makes salivary fluoride an important player 
in caries protection largely by promoting remineralization and reducing demineralization. Saliva is 
well adapted to protection against dental caries. Saliva's buffering capability; the ability of the saliva 
to wash the tooth surface, to clear bacteria, and to control demineralization and mineralization; 
saliva's antibacterial activities; and perhaps other mechanisms all contribute to its essential role in 

30 the health of teeth. The fact that the protective function of saliva can be overwhelmed by bacterial 
action indicates the importance of prevention and therapy as in other infectious diseases. With 
knowledge of salivary components and their interactions, the use of modified oral molecules as 
therapeutic agents may become a important contributor to oral health. 

Proline-rich proteins are major components of parotid and submandibular saliva in humans 

35 as well as other animals. They can be divided into acidic, basic and glycosylated proteins. The 
proline-rich proteins are apparently synthesized the acinar cells of the salivary glands and their 
phenotypic expression is under complex genetic control. The acidic proline-rich proteins will bind 
calcium with a strength which indicates that they may be important in maintaining the concentration 
of ionic calcium in saliva. Moreover they can inhibit formation of hydroxyapatite, whereby growth 
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of hydroxyapatite crystals on the tooth surface in vivo may be avoided. Both of these activities as 
well as the binding site for hydroxyapatite are located in the N-terminal proline-poor part of the 
protein. 

Basic histidine rich salivary gland peptides such as the peptide of SEQ ID NO. 191 and 
S 475, also referred to as histatins are a group of electrophoretically distinct histidine-rich 
polypeptides with microbicidal activity found in human parotid and submandibular gland 
secretions. Histatins 1, 3» and S are homologous proteins that consist of 38, 32, and 24 amino acid 
residues, respectively, that have been shown to kill the pathogenic yeast, Candida albicans. More 
recently histatins 2, 4, 6, and 7-12 were isolated and characterized Troxler RF et al, J Dent Res 1990 

10 Jan;69(l):2-6. Histatin 2 was found to be identical to the carboxyl terminal 26 residues of histatin 1; 
histatin 4 was found to be identical to the carboxyl terminal 20 residues of histatin 3; and histatin 6 
was found to be identical to histatin 5, but contained an additional carboxyl terminal arginine 
residue. The amino acid sequences of histatins 7-12 formally corresponded to residues 12-24, 13- 
24, 12-25, 13-25, 5-1 1, and 5-12, respectively, of histatin 3, but could also arise proteolytically from 

1 5 histatin 5 or 6. Troxler et al provides further guidance on the structural elements and relationship of 
histatins to one another in the context of their genetic origin, biosynthesis and secretion into the oral 
cavity, and potential as reagents in anti-candidal studies. The HSTG polypeptide and fragments 
thereof are therefore expected to have valuable properties and uses in antimicrobial applications, 
particularly in antifungal applications. Supporting such uses is a considerable body of evidence, 

20 including MacKay BJ et al. Infect Immun. 1984 Jun;44(3):695-701, Growth-inhibitory and 

bactericidal effects of human parotid salivary histidine-rich polypeptides on Streptococcus mutants; 
MacKay BJ et al, Infect Immun. 1984 Jun;44(3):688-94, Isolation of milligram quantities of a group 
of histidine-rich polypeptides from human parotid saliva; Pollock JJ et al, Infect Immun. 1984 
Jun;44(3):702-7, Fungistatic and fungicidal activity of human parotid salivary histidine-rich 

25 polypeptides on Candida albicans; and Xu T et al, Infect Immun. 1991 Aug;59(8):2549-54, 

Anticandidal activity of major human salivary histatins. Furthermore, tissue distribution of RNAs 
for cystatins, histatins, statherin, and proline-rich salivary proteins in humans and macaques is 
further discussed in Sabatini et al, J Dent Res 1989 Jul;68(7):1 138-45. 

In further embodiments, the skilled artisan will appreciate that fragments and analogues of 

30 PRSG and HSTG may readily be generated and selected. Selection of preferred fragments and 
analogies may be carried out by assaying for a desired antimicrobial activity. For example, synthetic 
histatin analogues and methods for obtaining such analogies with broad-spectrum antimicrobial 
activity are described in Helmerhorst EJ et al, Biochem J 1997 Aug 1 5;326 ( Pt l):39-45, where 
histatin analogies inhibited the growth of the second most common yeast found in clinical isolates, 

35 Torulopsis glabrata, of oral- and non-oral pathogens such as Prevotella intermedia and 
Streptococcus mutants, and of a methicillin-resistant Staphylococcus aureus. 

Thus, in preferred embodiments, the PRSG and/or HSTG polypeptides or fragments thereof 
may be used in oral, injectable, topical or edible compositions for the treatment of infection PRSG 
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and/or HSTG polypeptides may also be used as antimicrobial/antifungal compositions for 
disinfection of surfaces (e.g. in industrial settings). 

In a preferred example further discussed below, the PRSG and/or HSTG polypeptides or 
fragments thereof are used in oral, topical (e.g. mouthwash) or edible compositions optionally 
S containing additional salivaiy proteins to provide an anticaries effect. While there is an interest in 
developing and marketing products which reduce caries without reliance on a high level of fluoride 
ions (such as in fluoridated water and fluoride toothpastes), there have not been many reports of 
such approaches meeting with success. While certain cysteine-rich proteins have been proposed 
useful in the treatment of dental caries (U.S. Patent No. 5,688,766, Revis et al), the present 

10 invention provides PRSG and HSTG polypeptides which may provide higher potency, efficacy and 
range of disinfection and protection. 

PRSG and HSTG polypeptide compositions can be administered in a formulation 
comprising a carrier. A preferred carrier composition for the active(s) of this invention are oral 
compositions. Such compositions include toothpastes, mouthrinses, liquid dentifrices, lozenges, 

I S chewing gums or other vehicle suitable for use in the oral cavity. Toothpastes and mouthrinses are 
the preferred systems. The abrasive polishing material contemplated for use in the toothpaste 
compositions of the present invention can be any material which does not excessively abrade dentin. 
These include, for example, silicas including gels and precipitates, calcium carbonate, dicalcium 
orthophosphate dihydrate, calcium pyrophosphate, tricalcium phosphate, calcium 

20 polymetaphosphate, insoluble sodium polymetaphosphate, hydrated alumina, and resinous abrasive 
materials such as particulate condensation products of urea and formaldehyde, and others such as 
disclosed by Cooley et al. in U.S. Pat. No. 3,070,51 0, Dec. 25, 1962, incorporated herein by 
reference. Mixtures of abrasives may also be used. Silica dental abrasives, of various types, can 
provide the unique benefits of exceptional dental cleaning and polishing performance without 

25 unduly abrading tooth enamel or dentin. For these reasons, they are preferred for use herein. 

Flavoring agents can also be added to toothpaste compositions. Suitable flavoring agents include oil 
of wintergreen, oil of peppermint, oil of spearmint, oil of sassafras, and oil of clove. Sweetening 
agents which can be used include aspartame, acesulfame, saccharin, dextrose, levulose and sodium 
cyclamate. Flavoring and sweetening agents are generally used in toothpastes at levels of from 

30 about 0.005% to about 2% by weight. Toothpaste compositions can also contain emulsifying agents. 
Suitable emulsifying agents are those which are reasonably stable and foam throughout a wide pH 
range, including non-soap anionic, nonionic, cationic, zwittefionic and amphoteric organic synthetic 
surfactants. Water is also present in the toothpastes of this invention. Water employed in the 
preparation of commercially suitable toothpastes should preferably be deionized and free of organic 

35 impurities. In preparing toothpastes, it is necessary to add some thickening material to provide a 
desirable consistency. Preferred thickening agents are carboxyvinyl polymers, carrageenan, 
hydroxyethyl cellulose and water soluble salts of cellulose ethers such as sodium carboxymethyl 
cellulose and sodium carboxymethyl hydroxyethyl cellulose. Natural gums such as gum karaya, 
xanthan gum, gum arabic, and gum tragacanth can also be used. Colloidal magnesium aluminum 
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silicate or finely divided silica can be used as part of the thickening agent to further improve 
texture. Thickening agents in ane amount from 0.2% to 5.0% by weight of the total composition can 
be used. It is also desirable to include some humectant material in a toothpaste to keep it from 
hardening. Suitable humectants include glycerin, sorbitol, and other edible polyhydric alcohols at a 
5 level of from about 1 5% to about 70%. 

Another preferred embodiment of the present invention is a mouthwash composition. 
Conventional mouthwash composition components can comprise the carrier for the agents of the 
present invention. Mouthwashes generally comprise from about 20:1 to about 2:1 of a water/ethyl 
alcohol solution or be alcohol free and preferably other ingredients such as flavor, sweeteners, 

10 humectants and sudsing agents such as those mentioned above for dentifrices. The humectants, such 
as glycerin and sorbitol give a moist feel to the mouth. Generally, on a weight basis the 
mouthwashes of the invention comprise 0% to 60% (preferably 5% to 20%) ethyl alcohol, 0% to 
20% (preferably 5% to 20%) of a humectant, 0% to 2% (preferably 0.01% to 1 .0%) emulsifying 
agents, 0% to 0.5% (preferably 0.005% to 0.06%) sweetening agent such as saccharin or natural 

1 5 sweeteners such as stevroside 0% to 0.3% (preferably 0.03% to 0.3%) flavoring agent, and the 
balance water. 

The pH of the present compositions and/or the pH in the mouth can be any pH which is safe 
for the mouth's hard and soft tissues. Such pH's are generally from about 3 to about 10, preferably 
from about 4 to about 8. Other acceptable oral carders include gums, lozenges, as well as other 

20 forms. Such suitable forms are disclosed in U.S. Pat. No. 4,083,955, Apr. 11, 1978 to Grabenstetter 
et al. incorporated herein in its entirety by reference. Edible compositions are also suitable for use 
as the carrier compositions herein. Edible compositions include many types of solid as well as 
liquid compositions. Such compositions include, for example, soft drinks, citrus drinks, cookies, 
cakes, breads among many others. Such compositions may contain sugar or another sweetener, 

25 water, flour, shortening, other fibers such as wheat, com, barley, rye, oats, psyllium and mixtures 
thereof. 

HSDG (HYDROXYSTEROID DEHYDROGENASES (Clone ID:495917) 

SEQ ID NOS 54 and 381 and clone FLl 1:49591 7_160-22-4-0-D8-F encode the polypeptide 
30 of SEQ ID NOS 223 and 498 respectively, a hydroxysteroid dehydrogenase referred to herein as 
HSDG. As the HSDG polypeptide is implicated in steroid hormone regulation, preferably 
glucocorticoid metabolism, HSDG may be useful in any applications where steroid hormones levels 
are to be increased or inhibited. HSDG may be useful in the treatment of disease treatable by steroid 
hormones. HSDG inhibitors may also be useful in systems for self-sufficient biosynthesis of steroid 
35 hormones such as glucocorticoids such as in engineered cells comprising elements of the synthesis 
pathway. HSDG inhibitors of endogenous HSDG activity may allow the recovery of higher 
amounts of glucocorticoids and/or other synthesized steroid hormones from these cell systems. 
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It will be appreciated that all characteristics and uses of the polynucleotides of SEQ ID 
NOs: 54 and 381 and polypeptides of SEQ ID NO: 223 and 498, described throughout the present 
application also pertain to the human cDNA of clone 495917 , and the polypeptides encoded 
thereby. 

5 The HSDG polypeptides of the invention comprise leucine zipper pattern (Prosite ref. 

PS00029) at amino acid positions 58-79 as well as a N-myristoylation site at amino acid position 36 
(GANAGV) of SEQ ID NO 498. In one example, HSDG polypeptides may be used in the 
production or therapeutic modulation of glucocorticoids. The skilled artisan will recognize that any 
suitable HSDG polypeptides or variants or fragments thereof capable of metabolizing 

10 glucocorticoids can readily be used. 

In one embodiment HSDG activity can be determined by detecting levels of glucocorticoids 
or metabolites or glucocorticoids. Moreover, structural aspects of 1 Ibeta-hydroxysteroid 
dehydrogenase have been documented in the art and may serve as a guidelines for developing 
suitable HSDG variants and fragments. The HSDG polypeptides of the invention may allow 

1 S modulation of glucocorticoid activity or identification (e.g. drug screening) of compounds capable 
of specifically modulating glucocorticoid levels or activity. HSDG may be useful in allowing the 
modulation of steroid hormone synthesis, or glucocorticoid synthesis to be carried in a tissue 
specific manner, thereby offering improved methods for treating disease with decreased risk of side 
effects, 

20 Corticosteroids, also referred to as glucocorticoids are steroid hormones, the most common 

form of which is Cortisol. Modulation of glucocorticoid activity is important in regulating 
physiological processes in a wide range of tissues and organs. Glucocorticoids act within the gonads 
to directly suppress testosterone production (Monder, C et al, (1994) Steroids 59, 69-73). High 
levels of glucocorticoids may also result in excessive salt and water retention by the kidneys, 

25 producing high blood pressure. 

Glucocorticoid action is mediated via binding of the molecule to a receptor, defined 
hereinafter as either a mineralocorticoid receptor (MR) or a glucocorticoid receptor (GR). 
Krozowski, Z. S. et al, ((1983) Proc. Natl. Acad. Sci. USA 80, 6056-6060) and Beaumont, K. et al, 
((1983) Endocrinology 1 13, 2043-2049) showed that MR of adrenalectomised rats have an equal 

30 affinity for the mineralocorticoid aldosterone and glucocorticoids, for example corticosterone and 
Cortisol. Confirmatory evidence has been found for human MR (Arriza, J. L et al, (1988) Neuron I, 
887-900). In patients suffering from the congenital syndrome of Apparent Mineralocorticoid 
Excess (AME), Cortisol levels are reportedly elevated and bind to and activate MRs normally 
occupied by aldosterone, the steroid that regulates salt and water balance in the body. Salt and water 

35 are retained in AME patients causing severe hypertension. 

Like HSDG, the enzyme 1 1 P-hydroxysteroid dehydrogenase ( 1 1 pHSD), also discussed in 
U.S. Patent No. 5,965,372 (Funder et al) may be involved in converting glucocorticoids into 
metabolites that are unable to bind to MRs (Edwards et al, (1988) Lancet. 2: 986-9; Funder et al, 
(1988) Science 242, 583,585), present in mineralocorticoid target tissues, for example kidney. 
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pancreas, small intestine, colon, as well as the hippocampus, placenta and gonads. For example, in 
aldosterone target tissues 1 1 PHSD inactivates glucocorticoid molecules, allowing the much lower 
circulating levels of aldosterone to maintain renal homeostasis. When the 1 1 pHSD enzyme is 
inactivated, for example in AME patients or following administration of glycyrrhetinic acid, a 
5 component of licorice, severe hypertension results. Further, placental 1 1 PHSD activity may protect 
the foetus from high circulating levels of glucocorticoid which may predispose to hypertension in 
later life (Edwards et al., 1993). Biochemical characterisation of activity has indicated the presence 
of at least two 11 pHSD isoen^mes(lipHSDl and lipHSD2)with different cofactor requirements 
and substrate affinities. The 1 1 pHSDl enzyme is a low affinity enzyme that prefers NADP+ as a 

10 cofactor (Agarwal et al., 1989). The 1 1 PHSD2 enzyme is a high affinity enzyme (Km for 

glucocorticoid=l 0 nM), requiring NAD+, not NADP+ as the preferred cofactor, belonging to a class 
of glucocorticoid dehydrogenase enzymes hereinafter referred to as "NAD+ dependent 
glucocorticoid dehydrogenase" enzymes. 

Inverse correlation between llpHSD enzyme activity in human granulosa-lutein cells and 

1 5 the success of 1 VF has further been shown, suggesting that activity of this enzyme might be related 
to the success of embryo attachment and implantation following IVF. The measurement of ovarian 
1 1 PHSD enzyme activity as a prognostic indicator for the outcome of assisted conception in all 
species, is the subject of UK Patent Application No 930S984. However, the disclosure of Michael et 
al. ((1993) Lancet 342, 711-712), and corresponding UK Patent Application No 9305984 do not 

20 identify, or even suggest which isoenzyme in the ovary might be a predictive indicator of IVF 
embryo transfer, or a means of distinguishing isoenzymes of 1 1 pHSD in the prediction of IVF 
embryo transfer outcomes. In fact, the enzyme assay procedure might detect all isoenzymes of 
11 PHSD activity in the cell, some of which may be hitherto uncharacterised. 

Thus, the human HSDG hydroxysteroid dehydrogenase enzyme and the nucleic acids 

25 encoding it providing novel means for the development of gene therapies and identification of 
HSDG activators and inhibitors which alter the endogenous activity of this hydroxysteroid 
dehydrogenase enzyme in a cell. The present invention also permits the screening, through genetic 
or immunological means, levels of expression of genes encoding the NAD+ dependent 
glucocorticoid dehydrogenase enzyme in various tissue or organ types, including for example, skin, 

30 colon, kidney, placenta, and gonads, amongst others. 

SIOOG CALCIUM BINDING PROTEIN (Clone ID:200895) 

The polynucleotides of clone FLl 1 :200895_1 1 6-055-1 -0-Hl 1-F and SEQ ID NOS 132 and 
428 encode for the SI 00 calcium binding protein referred to herein as SIOOG. SEQ ID NOS 301 
35 and 542 provide the amino acid sequence corresponding to the nucleic acid sequences of SEQ ID 
NOS 132 and 428, respectively. It will be appreciated that all characteristics and uses of the 
polynucleotides of SEQ ID NOs: 1 32 and 428 and polypeptides of SEQ ID NO: 30 1 and 542, 
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described throughout the present application also pertain to the human cDNA of clone20089S, and 
the polypeptides encoded thereby. 



BACKGROUND 

In nearly all eukaryotic cells, calcium (Ca^"*") functions as an intracellular signaling 
5 molecule in diverse cellular processes including cell proliferation and differentiation, 

neurotransmitter secretion, glycogen metabolism, and skeletal muscle contraction. Within a resting 
cell, the concentration of Ca^"*" in the cytosol is extremely low, <10"7 M, However, when the cell is 
stimulated by an external signal, such as a neural impulse or a growth factor, the cytosolic 
concentration of Ca^"*" increases by about 50-fold. This influx of Ca^"*" is caused by the opening of 

10 plasma membrane Ca^"*" channels and the release of Ca^"^ from intracellular stores such as the 
endoplasmic reticulum. Ca^"'" directly activates regulatory enzymes, such as protein kinase C, 
which trigger signal transduction pathways. 

The protein of SEQ IDNOS 301 and 542 is a calcium binding SI 00 protein typically found 
in heart and muscle. SI 00 proteins are low-molecular weight calcium binding proteins that are 

IS believed to play an important role in various cellular processes such as cytoskeletal/membrane 
interactions, cell division and differentiation. The expression of SI 00 proteins has been evaluated 
in a variety of disorders. For example, SI 00 protein have been evaluated as markers of 
inflammatory disease, including ulcerative colitits, Crohn's disease, and as serum markers for 
subjects with infections diseases including AIDS and malaria and for subjects with hematological 

20 disease. Evidence has accumulated that indicates that SI 00 proteins can alter cellular invasion and 
metastatic spead of cancer. For example, SI 00 protein is expressed in dendritic*cells in human 
transitional cell carcinoma of the bladder and the invasive potential of these tumor has been found 
to correlate with the presence of S 1 00 protein expressing cells. 

THERAPEUTICS AND DIAGNOSTICS 

25 The SI OOG protein of SEQ ID NOS 301 and 542 disclosed herein provides new calcium 

binding SI 00 protein compositions useful in the diagnosis, prevention, and treatment of cancer, 
reproductive disorders, immune disorders, neuronal disorders, vesicle trafficking disorders and 
developmental disorders. 

CBPs are implicated in a variety of disorders and several CBPs have proven to be effective 

30 therapeutic targets for which small molecule inhibitors could be developed. However, while several 
CBPs are targets for widely-used therapeutic treatments, it would be advantageous to provide 
further CBPs allowing more selective therapeutic treatments for disease to be developed. In one 
example, calcineurin is found in the cells of all eukaryotes ranging from yeast to mammals. 
Calcineurin is a target for inhibition by the immunosuppressive agents cyclosporin A and FK506 

35 emphasizing its importance in immune disorders (Kissinger, C. R. et al. (1995) Nature 378:641- 
644). Calcineurin also plays a critical role in transcriptional regulation and growth control in T- 
lymphocytes (Wang, M. G. et al. (1996) Cytogenet. Cell Genet. 72:236-241). However, inhibition 
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of calcineurin phosphatase activity has been implicated both in the mechanism of 
immunosuppression and in the observed toxic side effects of FK506 in nonlymphoid cells, 
suggesting that identification of a new (FK binding proteins (FKBPs) that can mediate calcineurin 
inhibition and are restricted in its expression to T cells could provide new immunosuppressive drugs 
5 may be identified that, by virtue of their specific interaction with the FKBP, would be targeted in 
their site of action (Baughman G, et al Mol Cell Biol 1995 Aug;15(8):4395-402). In another CBP 
example, levels of CaM are increased several-fold in tumors and tumor-derived cell lines for 
various types of cancer (Rasmussen, C. D, and Means, A. R. (1989) Trends in Neuroscience 
12:433-438). Calcium binding SlOOp is another example of a CBP involved in a variety of 

10 disorders. Like the SIOOG protein of the invention, SlOOp contains an EF-hand motif. SlOOp is 
abundantly expressed in the nervous system. SIOOP levels are increased in the blood and 
cerebrospinal fluid of patients with neurological injury resulting from cerebral infarction, transient 
ischemic attacks, hemorrhagia, head trauma, and Down's syndrome. Furthermore, SlOOp and other 
neural-specific CBPs may also protect against neurodegenerative disorders, such as Alzheimer's, 

IS Parkinson's, and Huntington's diseases. SIOOP is produced and secreted by glial cells in the central 
and peripheral nervous systems (Allore, R. J. et al, (1990) J. Biol. Chem. 265:15537-15543). The 
accumulation of SlOOp in mature glial cells is associated with the microtubule network. SIOOP 
promotes neuronal differentiation and survival but may be detrimental to cells if overexpressed. The 
selective overproduction has been implicated in the progression of the neuropathological changes in 

20 Alzheimer's disease which may involve mitotic protein kinases (Marshak, D. R. and Pena, L. A. 
(1992) Prog. Clin. Biol. Res. 379:289-307). Adult T-cell leukaemia (ATL) is a mature T-cell 
malignancy which is caused by human T lymphotrophic virus type-1. Diminished surface 
expression of the T-cell receptor alpha beta (TCRaP+) complex is a specific feature of ATL cells. 
SlOOp is not detectable in CD4+, TCRap+ ATL cells, but is expressed in CD4-, CD8-, TCRaP+ 

25 leukaemic cells from four ATL patients. This suggested that increased levels of S 1 OOP may be 
associated with the diminished surface expression of the TCRaP+ complex in ATL (Suzushima, H. 
et al. (1994) Leuk. Lymphoma 13:257-262). Elevated serum levels of SlOOp are associated with 
disseminated malignant melanoma metastases, suggesting that serum SlOOp may be of value as a 
clinical marker for progression of metastatic melanoma (Henze, G. et al. (1997) Dermatology 

30 1 94:208-2 1 2). In yet another example, messenger RN A levels encoding human calgizzarin (an 
SlOO-like protein), as well as those encoding phospholipase A2, are elevated in colorectal cancers 

compared with those of normal colorectal mucosa (Tanaka, M. et al. (1995) Cancer Lett. 89:195- 
200). Finally, an intracellular SI 00 calcium-binding protein has been isolated from rat peritoneum. 
This protein, MRP 14, is one of two migration inhibitory factor-related proteins that are expressed in 
35 peritoneal macrophages in the arthritis-susceptible Lewis/N rat (Imamichi, T. et al. (1993) Biochem. 
Biophys. Res. Comm. 194:819-825). 

However, despite the many uses and therapeutics based on SI 00 proteins and other CBPs, it 
would be advantageous to selectively target CBPs which are involved in a disorder and which are 
found specifically in the targeted cells or tissues. Thus, in a first embodiment, the SIOOG protein of 
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the invention can be used for the development of selective inhibitors of catcium signaling, eg. 
preferably inhibitors of transcriptional regulation and cell growth control. 

In one embodiment, an antagonist of S lOOG may be administered to a subject to prevent or 
treat a neuronal disorder. Such disorders may include, but are not limited to, akathesia, Alzheimer's 
5 disease, amnesia, amyotrophic lateral sclerosis, bipolar disorder, catatonia, cerebral neoplasms, 
dementia, depression, Down's syndrome, tardive dyskinesia, dystonias, epilepsy, Huntington's 
disease, multiple sclerosis, neurofibromatosis, Parkinson's disease, paranoid psychoses, 
schizophrenia, and Tourette's disorder. In one aspect, an antibody which specifically binds S100G 
may be used directly as an antagonist or indirectly as a targeting or delivery mechanism for bringing 

10 a pharmaceutical agent to cells or tissue which express SIOOG. 

In one embodiment, an antagonist of S I OOG may be administered to a subject to prevent or 
treat a vesicle trafficking disorder. Such disorders may include, but are not limited to, cystic 
fibrosis, glucose-galactose malabsorption syndrome, hypercholesterolemia, diabetes mellitus, 
diabetes insipidus, hyper- and hypoglycemia. Grave's disease, goiter, Cushing's disease, and 

IS Addison's disease; gastrointestinal disorders including ulcerative colitis, gastric and duodenal 
ulcers; other conditions associated with abnormal vesicle trafficking including AIDS; allergies 
including hay fever, asthma, and urticaria (hives); autoimmune hemolytic anemia; proliferative 
glomerulonephritis; inflammatory bowel disease; multiple sclerosis; myasthenia gravis; rheumatoid 
and osteoarthritis; scleroderma; Chediak-Higashi and Sjogren's syndromes; systemic lupus 

20 erythematosus; toxic shock syndrome; traumatic tissue damage; and viral, bacterial, fungal, 

helminth, and protozoal infections. In one aspect, an antibody which specifically binds SIOOG may 
be used directly as an antagonist or indirectly as a targeting or delivery mechanism for bringing a 
pharmaceutical agent to cells or tissue which express S 1 OOG. 

In one embodiment, an antagonist of SIOOG may be administered to a subject to prevent or 

25 treat an immunological disorder. Such disorders may include, but are not limited to, AIDS, 

Addison's disease, adult respiratory distress syndrome, allergies, anemia, asthma, atherosclerosis, 
bronchitis, cholecystitis, Crohn's disease, ulcerative colitis, atopic dermatitis, dermatomyositis, 
diabetes mellitus, emphysema, erythema nodosum, atrophic gastritis, glomerulonephritis, gout, 
Graves* disease, hypereosinophilia, irritable bowel syndrome, lupus erythematosus, multiple 

30 sclerosis, myasthenia gravis, myocardial or pericardial inflammation, osteoarthritis, osteoporosis, 
pancreatitis, polymyositis, rheumatoid arthritis, scleroderma, Sjogren's syndrome, Werner 
syndrome, and autoimmune thyroiditis; complications of cancer, hemodialysis, and extracorporeal 
circulation; viral, bacterial, fungal, parasitic, protozoal, and helminthic infections; and trauma. In 
one aspect, an antibody which specifically binds SI OOP may be used directly as an antagonist or 

35 indirectly as a targeting or delivery mechanism for bringing a pharmaceutical agent to cells or tissue 
which express SIOOG. 

In one embodiment, an antagonist of SIOOG may be administered to a subject to prevent or 
treat a neoplastic disorder. Such disorders may include, but are not limited to, adenocarcinoma, 
leukemia, lymphoma, melanoma, myeloma, sarcoma, teratocarcinoma, and, in particular, cancers of 
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the adrenal gland, bladder, bone, bone marrow, brain, breast, cervix, gall bladder, ganglia, 
gastrointestinal tract, heart, kidney, liver, lung, muscle, ovary, pancreas, parathyroid, penis, 
prostate, salivary glands, skin, spleen, testis, thymus, thyroid, and uterus. In one aspect, an antibody 
which specifically binds SIOOP may be used directly as an antagonist or indirectly as a targeting or 
5 delivery mechanism for bringing a pharmaceutical agent to cells or tissue which express S 1 OOG. 

In addition, the SI OOG nucleic acids and polypeptides of the present invention can be used 
to identify compounds for the treatment of a subject experiencing negative side effects from the 
administration of other pharmaceuticals, such as those drugs that disrupt the body's calcium 
homeostasis. Co-administration of said compounds would be useful to counter-effect iatrogenically 
. 10 caused dysfunction of calcium metabolism. 

An antagonist of SI OOG may be produced using methods which are generally known in the 
art. In particular, purified SI OOG may be used to produce antibodies or to screen libraries of 
pharmaceutical agents to identify those which specifically bind SI OOG. In one example, a positive 
screening for drugs that specifically inhibit the Ca2+-signaling activity was carried out on the basis 

1 S of the growth promoting effect on a yeast mutant with a peculiar phenotype (Shitamukai A et al, 
Biosci Biotechnol Biochem 2000 Sep;64(9): 1942-6). An inappropriate activation of a signaling 
pathway in yeast often has a deleterious physiological effect and causes various defects, including 
growth defects. In a certain genetic background (deltazdsl) of Saccharomyces cereivisiae, the cell- 
cycle progression in G2 is specifically blocked in the medium with CaCI2 by the hyperactivation of 

20 the Ca2+-signaling pathways. Shitamukai et al provide an example of a drug screening procedure 
designed to detect the active compounds that specifically attenuate the Ca2+-signaling activity on 
the basis of the ability to abrogate the growth defect of the cells suffering from the hyperactivated 
Ca2+ signal. Screening conditions were established for the drugs that suppress the Ca2+-induced 
growth inhibition using known calcineurin inhibitors as model compounds, and an indicator strain 

25 with an increased drug sensitivity was constructed with a syrl/erg3 null mutation. 

In another embodiment, a vector expressing the complement of the polynucleotide encoding 
SIOOP may be administered to a subject to treat or prevent a neuronal disorder, immunological 
disorder, neoplastic disorder or vesicle trafficking disorder including, but not limited to, those 
described above. In other embodiments, any of the proteins, antagonists, antibodies, agonists, 

30 complementary sequences or vectors of the invention may be administered in combination with 
other appropriate therapeutic agents. Selection of the appropriate agents for use in combination 
therapy may be made by one of ordinary skill in the art, according to conventional pharmaceutical 
principles. The combination of therapeutic agents may act synergistically to effect the treatment or 
prevention of the various disorders described above. Using this approach, one may be able to 

35 achieve therapeutic efficacy with lower dosages of each agent, thus reducing the potential for 
adverse side effects. 

Expression of SI 00 proteins have been evaluated as serum markers for melanoma (Henze et 
al, Dermatology 194:208-212; Bueretal, 1997 Brit. J. Cancer 75: 1373-1376; Sherbertet al. 1998; 
Anticancer Res. 18:241 5-2422) and more recently as serum markers for cancer in general 
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(particularly breast, colon and lung cancers) and that their detectability in serum can have 
prognostic and/or therapeutic significance in cancer. Furthermore auto-antibodies to SI 00 proteins 
were found in cancer patients. (International Patent Publication No. WO 00/26668). The SIOOG 
protein of the invention may thus be used in the diagnosis and prevention of cancer, for 
5 identification of subjects predisposed to cancer, for monitoring patients undergoing treatment for 
cancer based on the increased level of SIOOG protein(s) in biological fluid samples of subjects. 
Methods for diagnosis and prognosis of cancer in a subject may comprise 

a) detecting a SIOOG protein in a biological fluid sample obtained from a subject, and 

b) comparing the level of protein detected in the subject's sample to the level of protein 
10 detected in a control sample, 

wherein an increase in the level of SIOOG protein detected in the subject's sample as 
compared to control samples is an indicator of a subject with cancer of at increased risk for cancer. 

The invention also comprises methods for diagnosis and prognosis of a subject with cancer 
comprising: 

IS a) contacting a serum sample derived from a subject with a sample containing SIOOG 

protein antigens under conditions such that a specific antigen-antibody complex binding can occur; 
and 

b) detecting the presence of immunospecific binding of autoantibodies present in the 
suject's serum samples to the SIOOG protein; 
20 wherein the presence of immunospecific binding of autoantibodies indicates the presence of 

cancer. 

Assays for detection of S 1 OOG protein in a sample can be accomplished by any suitable 
method, including immunoassays where in SIOOG proteins are detected by their interaction with an 
SIOOG specific antibody. In addition, reagents other than antibodies such as for example 

25 polypeptides that specifically bind SIOOG may be used. 

In yet further embodiments, the SIOOG protein of the invention may be useful for the 
development of specific anti-SlOO antibodies which are specific for SI 00 proteins other than 
SIOOG. As several SlOO proteins have been implicated in disease, it may be advantageous to 
develop a panel of SlOO specific antibodies to characterize disease, eg. cancers. Thus, SIOOG 

30 proteins and antibodies thereto may be advantageous to distinguish cancer types. S 1 OOG proteins 
and antibodies may also be useful in the screening of SlOO specific antibodies by determining the 
selectivity of a given anti-SlOO antibody for its target, and eliminating antibodies which are cross- 
reactive with SIOOG proteins. 

STUCTURAL ASPECTS OF SlOO PROTEINS OF THE INVENTION 

35 The SlOO proteins are a group of low molecular mass (approximately 10-12 kDa) acidic 

Ca^*^ -binding proteins, so named after the solubility of the first isolated protein in 100% saturated 
ammonium sulfate. The most striking conserved feature of these proteins is the presence of an EF- 
hand. The SlOO proteins have two Ca^"^ -binding domains. One of these domains is a basic helix- 
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loop-helix domain, the other domain is an acidic helix-loop-helix EF-hand (Kligman, D. and Hilt, 
D. C. (1988) Trends Biochem. Sci. 13:437-442). The EF-hand domain also encompasses a part of a 
region within SI 00 proteins which specifically identifies members of the SI 00 family of proteins 
which have a low affinity for Ca2+ ions (SlOO/lCaBP; PROSITE PS00303, SWISSPROT, PFAM 
S PEG 1023). The EF-hand is characterized by a twelve amino acid residue-containing loop, flanked 
by two alpha-helices, orientated approximately 90 degrees with respect to one another. Aspartate 
(D) and glutamate (E) residues are usually found bordering the twelve amino acid loop. In addition, 
a conserved glycine residue in the central portion of the loop is found in most Ca^'*' -binding EF- 
hand domains. Oxygen ligands within this domain coordinate the Ca^"*" ion (Kretsinger, R. H. and 

10 Nockolds, C. E. (1973) J. Biol. Chem. 248:33 13-3326). It will also be appreciated by the skilled 
artisan that modifications of the S I OOG polypeptide may readily be made based on extensive 
knowledge of CBP stucture and Ca(2+) binding mechanisms, such as Sastry M et al, (Structure 
1998;6:223-231), describing the three-dimensional structure of Ca(2+)-bound calcyclin and its 
implications for Ca(2+)-signal transduction by SI 00 

IS Recently, a protein designated S100A13 has been discovered and characterized which 

shared significant primary structure similarity with the SI OOG polypeptide of the invention. (Wicki 
et al, (1996) BBRC 227:594-599). 

The binding to calcium induces a conformational change in the SI 00 proteins, and this may 
then affect the secondary effector proteins. This mode of protein-protein interaction and modulation 

20 of the activity of the secondary effector protein is similar to that seen with calmodulin, also 
containing the EF-hands. The SI OOG protein of the invention comprises an ICaBP type calcium 
binding domain domain at amino acid positions 9 to 52 and casein kinase II phosphorylation site 
patterns at amino acid position 7 (TELE); 34 (SVNE); and 55 (SLDE) of SEQ ID NO 542. 

The SI OOG polypeptides of the invention may thus also be used in any situation in vivo or 

25 in vitro where it is desired to modulate, preferably decrease the level of free calcium, or in 
applications involving sensing a change in calcium concentration. In one aspect, SI OOG nucleic 
acids and polypeptides may be used to develop a calcium biosensor. Calcium biosensors may have 
particular utility in detecting abnormalities in calcium transport that result in uncompensated influx 
into, or efflux from, the extracellular fluid, will result in hypercalcaemia or hypocalcaemia, 

30 respectively. Such abnormalities in serum calcium concentration may have profound effects on 
neurological, gastrointestinal, and renal function (Bushinsky DA et al. Lancet 1998 Jul 25;352 
(9I24):306-1 1). Calcium biosensors may be developed by using any suitable means known in the 
art to detect a conformational change induced by the binding of calcium to the EF-hand domains of 
the SI OOG polypeptides. Preferably, the conformational change is detected by detecting a change in 

35 the ability of the SI OOG protein to bind a selected secondary effector protein 

As the distribution of particular SIOO proteins is dependent on speciflc cell types, the SlOO 
proteins may be involved in transducing the signal of an increase in intracellular calcium in a cell 
type-specific fashion (Wu, T. et al. (1997) J. Biol. Chem. 272:17145-1 7153). 
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CaMLP (CALCIUM BINDING PROTEIN) (Clone ID:S00742698) 



PCT/IBOl/00914 



Calcium is one of the "second messengers" which relays chemical and electrical signals 
within a cell. This signal transduction and, hence the regulation of biological processes, involves 
interaction of calcium ion with high-affinity calcium-binding proteins (CBPs). Disclosed herein in 
5 SEQ ID NOS 184 and 469 is one such protein, encoded by the nucleic acid sequences of SEQ ID 
NOS 15 and 352, respectively, and the clone FLl l:500742698_204-61-4-0-B2-F, and fijrther 
referred to herein as CaMLP, which is thought to act as a Ca^*^ sensing and binding protein 
involved in diverse aspects of cell proliferation (such as for example of hepatocytes, melanoma 
cells, leukemic lymphocytes, and HUVEC (human umbilical vein endothelial cells)) and 

10 difTerentiation. It will be appreciated that all characteristics and uses of the polynucleotides of SEQ 
ID NOs: 1 5 and 352 and polypeptides of SEQ ID NO: 1 84 and 469, described throughout the present 
application also pertain to the human cDNA of clone 500742698, and the polypeptides encoded 
thereby. Notably, the CaMLP polypeptide contains EF hand calcium-binding domains (PROSITE 
PS00018)ataminoacid positions 81-93 and at position 129-141 of SEQ ID NO 469, CaMLP 

1 5 The cellular processes in which Ca^"^ functions as an intracellular signaling molecule are 

diverse, including cell proliferation and differentiation, neurotransmitter secretion, glycogen 
metabolism, and skeletal muscle contraction. Within a resting cell, the concentration of Ca^"*" in the 
cytosol is extremely low, <10.sup.-7 M. However, when the cell is stimulated by an external signal, 
such as a neural impulse or a growth factor, the cytosolic concentration of Ca^*^ increases by about 

20 50-fold. This influx of Ca2+ is caused by the opening of plasma membrane Ca^"*" channels and the 
release of Ca^"^ from intracellular stores such as the endoplasmic reticulum. Ca^*^ directly activates 
regulatory enzymes, such as protein kinase C, which trigger signal transduction pathways. Ca^"*" 
also binds to specific Ca^"*" -binding proteins (CBPs) such as calbindins, troponin C, calmodulin, 
and S-100 proteins which then activate multiple target proteins including enzymes, membrane 

25 transport pumps, and ion channels. Calmodulin (CaM) is the most widely distributed and the most 
common mediator of calcium effects and appears to be the primary sensor of Ca^"^ changes in 
eukaryotic cells. The binding of Ca^*^ to CaM induces marked conformational changes in the 
protein permitting interaction with, and regulation of over 100 different proteins. CBP interactions 
are involved in a multitude of cellular processes including, but not limited to, gene regulation, DNA 

30 synthesis, cell cycle progression, mitosis, cytokinesis, cytoskeletal organization, muscle contraction, 
signal transduction, ion homeostasis, exocytosis, and metabolic regulation (Celio, M. R. et al. 
(1996) Guidebook to Calcium-binding Proteins, Oxford University Press, Oxford, UK, pp. 15-20). 

THERAPEUTICS 

The CaMLP protein of SEQ ID NOS 184 and 469 disclosed herein provides new calcium 
35 binding protein compositions useful in the diagnosis, prevention, and treatment of cancer, 
reproductive disorders, immune disorders, neuronal disorders and developmental disorders. 

Calcium binding proteins (CBPs) are implicated in a variety of disorders and several CBPs 
have proven to be effective therapeutic targets for which small molecule inhibitors could be 
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developed. However, while several CBPs are targets for widely-used therapeutic treatments, it 
would be advantageous to provide further CBPs allowing more selective therapeutic treatments for 
disease to be developed. Evidence has accumulated for a large number of CBPs suggesting 
involvement in cell proliferative disorders. It is proposed that CaMLP may be useful as a tissue 
S specific calmodulin homologue allowing the development of specific inhibitors and activators 
having increased selectivity and safety (decreased side efiTect profile). To date, calmodulin 
antagonists are reportedly useful for the treatment of some malignant tumors, particularly those of 
the central nervous system, as well as lung tumors. The antitumor activity of calmodulin 
antagonists, as well as successful chemotherapy using the same, has been described, for example, in 

10 Sculler et al. Cancer Res., 50:1645-1649 (1990) and Hait et al. Cancer Res., 50:6636-6640 (1990). 
U.S. Pat. No. 5,340,565, additionally describes the use of calmodulin antagonists or inhibitors as 
agents which enhance the effectiveness of a chemotherapeutic agent or radiation treatment. 
Specifically, described therein is a method of inhibiting or killing a tumor or cancer cell in a human 
patient undergoing radiation therapy or chemotherapy, for example with such chemotherapeutic 

1 5 agents as cisplatin (Platinol®), by additionally administering a calmodulin binding agent which 
inhibits calmodulin activity. 

Calmodulin is also believed to play a pathogenic role in the tissue damage caused by bums 
and fi-ostbite (Beitner et al.. Gen. Pharmac. 20: 641-646, 1989), as well as in dermatitis and other 
conditions involving keratinocyte hyperproliferation. The methods of the present invention may be 

20 applied to the treatment of these and other conditions wherein antagonism of calmodulin activity is 
desirable. 

Thus, as discussed above, CaMLP protein of the invention shares structural similarity with 
the ubiquitous intracellular receptor protein calmodulin suggesting that CaMLP may be useful in the 
development of selective CaMLP inhibitors. The nucleic acids and polypeptides of the invention 

25 thus provide a novel therapeutic target in particular for cell proliferative disorders. In one aspect, 
said nucleic acids and protein may be used in drug screening processes to develop selective 
calmodulin and other calcium binding protein antagonists which do not inhibit the polypeptide of 
the invention. In another aspect, the nucleic acids and polypeptides of the invention may be used in 
drug screening processes to identify selective modulators of the CaMLP without inhibiting 

30 calmodulin, thereby identifying compounds less likely to cause unwanted side effects. In yet 
another aspect, the nucleic acids and polypeptides of the invention may be used in drug screening 
processes to identify selective modulators of both calmodulin and CaMLP, thereby identifying 
compounds having increased potency. 

Upon calcium binding, CaMLP may interact with a number of protein targets in a calcium 

35 dependent manner, thereby altering a number of complex biochemical pathways that can affect the 
overall behavior of cells. The calcium-calmodulin complex for example controls the biological 
activity of more than thirty different proteins including several enzymes, ion transporters, receptors, 
motor proteins, transcription factors, and cytoskeletal components in eukaryotic cells. 
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As described in U.S. Patent No. 5,840,697, Blondelle et al have peptide inhibitors of 
calmodulin. A number of other calmodulin targeted compounds are known and used for a variety of 
therapeutic applications. For instance, chlorpromazine (Thorazine.RTM.) and related phenothiazine 
derivatives, disclosed, for example, in U.S. Pat. No. 2,645,640, are calmodulin antagonists useful as 
5 tranquilizers and sedatives. Naphthalenen-sulfonamides, also calmodulin antagonists, are known to 
inhibit cell proliferation, as disclosed, for example, in Hidaka et al. ((1981), PNAS, 78:4354-4357) 
and are useful as antitumor agents. In addition, the cyclic peptide cyclosporin A (Sandimmune®), 
disclosed in U.S. Patent No. 4,1 17,1 18, is as an immunosuppressive agent which is thought to work 
by inhibiting calmodulin mediated responses in lymphoid cells. 

10 Many existing calmodulin inhibitors have undesirable biological effects when administered 

at concentrations sufficient to block calmodulin. These undesirable biological effects include non- 
specific binding to other proteins or receptors, as described, for example, in Polak et al, ((1991), J. 
Neurosci. 1 1 :534-542.) In addition, negative side effects such as toxicity can occur. A specific 
example is the toxic side effects from cyclosporin A. Therefore, a need exists for calmodulin 

1 5 targeted agents, and in particular calmodulin antagonists which inhibit calmodulin without having 
additional, undesirable biological or side effects. In particular there is a need for inhibitors which 
are specific to calmodulin and which do not have toxic side effects. 

In addition, the CaMLP nucleic acids and polypeptides of the present invention can be used 
to identify compounds for the treatment of a subject experiencing negative side effects from the 

20 administration of other pharmaceuticals, such as those drugs that disrupt the body's calcium 

homeostasis. Co-administration of said compounds would be useful to counter-effect iatrogenically 
caused dysfunction of calcium metabolism. Such disorders include, but are not limited to, organ 
damage, autoimmune disorders, psychotic disorders, tumors and drug induced dysfunction, such as 
negative side effects subsequent to administration of pharmaceuticals. For example, organ or tissue 

25 transplantation can result in autoimmune disorders, such as tissue graft (allograft) rejections. 

It is well known that calmodulin-targeted compounds which are antagonists can be used as 
immunosuppressive agents. In addition, also as described above, such compounds are widely used 
as sedative or anti-psychotic agents. Furthermore, there is evidence that calmodulin (and hence 
CaMLP) antagonists are useful for the treatment of some malignant tumors, particularly those of the 

30 central nervous system, as well as lung tumors. The antitumor activity of calmodulin antagonists, as 
well as successful chemotherapy using the same, has been described, for example, in Sculler et al. 
Cancer Res.. 50:1645-1649 (1990) and Hait et al. Cancer Res., 50:6636-6640 (1990), both of which 
are incorporated herein by reference. U.S. Pat. No. 5,340,565, which is incorporated herein by 
reference, additionally describes the use of calmodulin antagonists or inhibitors as agents which 

35 enhance the effectiveness of a chemotherapeutic agent or radiation treatment. Specifically, 
described therein is a method of inhibiting or killing a tumor or cancer cell in a human patient 
undergoing radiation therapy or chemotherapy, for example with such chemotherapeutic agents as 
cisplatin (Platinol), by additionally administering a calmodulin binding agent which inhibits 
calmodulin activity. 
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It has also been found that extracellular calmodulin inhibits TNF release and facilitates 
elastase release, providing further suggestion that CaMLP, CaMLP analogues and CaMLP receptor 
agonists are useful agents for regulating the inflammatory process. CaMLP antagonists, which 
include CaMLP receptor antagonists and CaMLP -binding molecules, may be used to block the 
5 interaction of CaMLP with a receptor, thus providing the opposite effect from CaMLP, its 
analogues and receptor agonists. CaMLP may serve as a potent modulator of self-directed 
inflammation by assisting in the recognition of self vs. non-self as prokaryotes (e.g., bacterial 
pathogens) do not contain CaMLP. In some situations such as in tumor necrosis, release of 
extracellular CaMLP may lead to an inappropriate host response and failure of the 

10 immune/inflammatory systems to eradicate tumor cells. Further, a diagnostic test has been 
developed which can discern patient variabilities in TNF inhibition by calmodulin and other 
substances. This test can be utilized in monitoring individual patients for determining effective 
therapies, and for predicting efficacy of therapy with extracellular CaMLP, CaMLP analogues or 
CaMLP receptor agonists on the one hand and CaMLP antagonists on the other. A diagnostic test 

1 5 for elastase has also been developed with similar utility. 

Lvsozvme C Protein of SEP ID NO: 196 f internal designation 482181) and related protein of SEP 
IDNP:479 

The polypeptides of SEQ ID NP : 196 and SEQ ID NP:479encoded by the cDNA of SEQ 

20 ID NP:27 and 362, respectively, belong to the widely conserved family of lysozyme C precursors 
(Prager and Jolles, Lysozymes: model enzymes in biochemistry and biology, ed. Jolles, 9-321 
(1996), Qasba and Kumar, Crit, Rev, Biochem, MoL Biol. 32:255-306 (1997)), which .disclosures 
are hereby incorporated by reference in their entireties. The protein of SEQ ID NOs: 196 and 479 or 
part thereof plays a role in glycoprotein and/or peptidoglycan metabolism, probably as a glycosyl 

25 hydrolase of family 22. Thus, the protein of the invention or part thereof is involved in immune and 
inflammatory responses and has antiviral, antibacterial, anti-inflammatory and/or anti-histaminic 
functions. Preferred polypeptides of the invention are polypeptides comprising the amino acids of 
SEQ ID NP:196 from positions 19 to 100, or from positions 1 to 100. Pther preferred polypeptides 
of the invention are fragments of SEQ ID NP: 196 having any of the biological activities described 

30 herein. The glycolytic activity of the protein of the invention or part thereof may be assayed using 
any of the assays known to those skilled in the art including those described in Gold and Schweiger, 
M. Methods in Enzymology, Vol. XX, Part C pp. 537-542, Ed. Moldave, Academic Press,New 
York and London, 1971 and in the US patent 4,255,517, which disclosures are hereby incorporated 
by reference in their entireties. 

35 Lysozymes, which are ubiquitous proteins found in most body secretions, are defined as 

1,4-beta-N-acetylmuramidase which cleave the glycoside bond between the C-1 of N-acetyl- 
muramic acid and the C-4 of N-acetylglucosamine in the peptidoglycan of bacteria. They have 
various therapeutic properties, such as antiviral, antibacterial, anti-inflammatory and antihistaminic 
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effects. The activity of lysozymes as an anti-bacterial agent appears to be based on both its direct 
bacteriolytic activity and also on stimulatory effects in connection with phagocytosis of 
polymorphonuclear leucocytes and macrophages (Biggar and Sturgess, J. M. Infect Immunol. 16: 
974-982 (1977); Thacore and Willet, Am. Rev. Resp. Dis. 93: 786-790 (1966); Klockars and 
S Roberts, P. Acta Haematol SS: 289-292 (1976)), which disclosures are hereby incorporated by 
reference in their entireties. Lysozymes have proven to be not only a selective factor but also an 
effective factor against microorganisms of the mouth (lacono et al, J. J. Infect. Immunol. 29: 623- 
632 (1980)), which disclosure is hereby incorporated by reference in its entirety. Lysozymes can 
also kill pathogens by acting synergistically with other proteins such as complement or antibody to 

10 lyse pathogenic cells. Lysozymes, also inhibit chemotaxis of polymorphonuclear leukocytes and 
limit the production of oxygen free radicals following an infection. This limits the degree of 
inflammation, while at the same time enhances phagocytosis by these cells. Other postulated 
functions of lysozymes include immune stimulation (Jolles, P. Biomedicine 25: 275-276 (1976) 
Ossermann, E. F. Adv. Pathobiol 4: 98-102 (1976)) and immunological and non-immunological 

15 monitoring of host membranes for any neoplastic transformation (Jolles, P. Biomedicine 25: 275- 
276 (1976); Ossermann, E. F. Adv. Pathobiol 4: 98-102 (1976)), which disclosures are hereby 
incorporated by reference in their entireties. Lysozymes may thus be used in a wide spectrum of 
applications (see US patent 5,618,712, which disclosure is hereby incorporated by reference in its 
entirety). Determination of the lyso^mes from serum and/or urine is used to diagnose various 

20 diseases or as an indicator for their development. In acute lymphoblastic leukaemia the lysozyme • 
serum level is significantly reduced, whereas in chronic myelotic leukaemia and in acute 
monoblastic and myelomonocytic leukaemia the lysozyme concentration in the serum is greatly 
increased. The therapeutically effective use of lysozyme is possible in the treatment of various 
bacterial and virus infections (Zona, Herpes zoster), in colitis, various types of pain, in allergies, 

25 inflammation and in pediatrics (the conversion of cows milk into a form suitable for infants by the 
addition of lysozyme). 

The invention relates to methods and compositions using the protein of the invention or part 
thereof to hydrolyze one or several substrates, alone or in combination with other substances, 
preferably antiviral, antifungal and/or antibacterial substances including but not limited to 

30 immunoglobulins, lactoferrin, betalysin, fibronectin, and complement components. Such substrates 
are glycosylated compounds, preferably containing beta-l-4-glycoside bonds, more preferably 
containing beta-M-glycoside bonds between n-acetylomuraminic acid and n-acetyloglucosamine. 
For example, the protein of the invention or part thereof is added to a sample containing the 
substrate(s) in conditions allowing hydrolysis, and allowed to catalyze the hydrolysis of the 

35 substrate(s). In a preferred embodiment, the hydrolysis is carried out using a standard assay such as 
those described by Gold and Schweiger, supra, and US patents 5,871,477 and 4,255,517, which 
disclosures are hereby incorporated by reference in their entireties. In a preferred embodiment, the 
protein of the invention or part thereof may be used to lyze recombinant bacteria in order to recover 
the recombinant DNA, the recombinant protein of interest, or both using, for example, any of the 
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assays described in Sambrook, et al., Molecular Cloning: A Laboratory Manual, Second Edition, 
Cold Spring Harbor Laboratory Press (1989), which disclosure is hereby incorporated by reference 
in its entirety. 

In an embodiment, the protein of the invention or part thereof is used to hydrolyze 
S contaminating substrates, preferably exogenous substrates from bacterial, fungal or viral origins, in 
an aqueous sample or onto a material, preferably glassware and plasticware. In particular, the 
protein of the invention or part thereof may be used as a disinfectant in dental rinse, in protection of 
aqueous systems or in preparing material for medical applications using any of the methods and 
compositions described in US patents 5,069,71 7, 4,355,022 and 5,001,062, which disclosures are 

10 hereby incorporated by reference in their entireties. In a preferred embodiment, the protein of the 
invention is used as a host resistance factor in infants' formulas to convert cow's milk into a form 
more suitable for infants as described in US patent 6,020,015, which disclosure is hereby 
incorporated by reference in its entirety. In another preferred embodiment, the protein of the 
invention or part thereof may be used as a food preservative (see Hayashi et al., Agric. Biol. Chem. 

15 (European Edition of Japanese Journal of Agriculture, Biochemistry and Chemistry), Vol. 53, pp. 
3173-3177, 1989), which disclosure is hereby incorporated by reference in its entirety. In addition, 
the protein of the invention or part thereof may be used to clarify xanthan gum fermented broth for 
applications in food and in cosmetic industries using the method described in US patent 5,994,107, 
which disclosure is hereby incorporated by reference in its entirety. In another preferred 

20 embodiment, compositions comprising the protein of the present invention or part thereof are added 
to samples or materials as a "cocktail" with other antimicrobial substances, preferably antibiotics or 
hydrolytic enzymes such as those described in US patents 5,458,876 and 5,041,326, which 
disclosures are hereby incorporated by reference in their entireties, to decontaminate the samples. 
For example, the protein of the invention or part thereof may be used in place or in combination 

25 with antibiotics in cell cultures. The advantage of using a cocktail of hydrolytic enzymes is that one 
is able to hydrolyze a wide range of substrates without knowing the specificity of any of the 
enzymes. Using a cocktail of hydrolytic enzymes also protects a sample or material from a wide 
range of future unknown contaminants from a vast number of sources. For example, the protein of 
the invention or part thereof is added to samples where contaminating substrates, preferably 

30 exogenous substrates from bacterial, fungal or viral origins, is undesirable in an amount sufficient to 
promote hydrolysis of said substrates. Alternatively, the protein of the invention or part thereof 
may be bound to a chromatographic support, either alone or in combination with other hydrolytic 
enzymes, using techniques well known in the art, to form an affinity chromatography column. A 
sample containing the undesirable substrate is run through the column to remove the substrate. 

35 Immobilizing the protein of the invention or part thereof on a support advantageous is particularly 
for those embodiments in which the method is to be practiced on a commercial scale. This 
immobilization facilitates the removal of the enzyme from the batch of product and subsequent 
reuse of the enzyme. Immobilization of the protein of the invention or part thereof can be 
accomplished, for example, by inserting a cellulose-binding domain in the protein. One of skill in 
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the art will understand that other methods of immobilization could also be used and are described in 
the available literature. Alternatively, the same methods may be used to identify new substrates. 

In addition, the protein of the invention or part thereof may be useful to identify or quantify 
the amount of a given substrate in biological fluids, foods, water, air, solutions and the like. In a 
5 preferred embodiment, the protein of the invention or part thereof is used in assays and diagnostic 
kits for the identification and quantification of exogenous substrates in bodily fluids including 
blood, lymph, saliva or other tissue samples, in addition to bacterial, fiingal, plant, yeast, viral or 
mammalian cell cultures. In a preferred embodiment, the protein of the invention or part thereof is 
used to detect, identify, and or quantify eubacteria using reagents and assays described in US patent 

10 5,935,804, which disclosure is hereby incorporated by reference in its entirety. Briefly, the protein 
of the invention of part thereof is catalytically inactived , i.e. capable of binding but not cleaving a 
peptidoglycan comprising NAc-muramic acid in the eubacteria, using any of the methods known to 
those skilled in the art including those which produce a mutant enzyme, a recombinant-enzyme, or a 
chemically inactivated enzyme. The catalytically inactive protein of the invention is then incubated 

1 S with an aliquot of a biological sample under conditions suitable for binding of the inactive enzyme 
to the peptidoglycan substrate. Then, the bound enzyme is detected to assess the presence or 
amount of the eubacteria in the biological sample. 

In another embodiment, the nucleic acid of the invention or part thereof may be used to 
increase disease resistance of plants to bacterial, fungal and/or viral infections. A polynucleotide 

20 containing the nucleic acid of the invention or part thereof is introduced into the plant genome in 
conditions allowing correct expression of the transgenic protein using any methods known to those 
skilled in the art including those disclosed in US patents 5,349,122 and 5,850,025, which 
disclosures are hereby incorporated by reference in their entireties. 

In another preferred embodiment, the protein of the invention or part thereof may be useful 

25 to treat and/or prevent bacterial, fungal and viral infections in humans or in animals caused by 
various agents including but not limited to Streptococcus, Veillonella alcalescens, Actinomyces, 
Herpes simplex, Candida albicans. Micrococcus lysodeikticus and HIV by hydrolyzing the 
glycosylated compounds contained in such micro-organisms. In still a preferred embodiment, the 
protein of the invention or part thereof is used to prevent and/or treat bacterial, fungal and viral 

30 infections in immunocompromised individuals who lack fully functional immune systems, such as 
neonates or geriatric patients or HIV-infected individuals, or who suffer from a disease affecting the 
respiratory tract such as cystic fibrosis or the gastrointestinal tract such as ulcerative colitis or sprue. 

In still another embodiment, the protein of the invention or part thereof may be used as a 
growth factor for in vitro cell culture, preferably for T cells and T cell lines, using techniques and 

35 methods taught in US patent 5,468,635. which disclosure is hereby incorporated by reference in its 
entirety. 

In addition, the protein of the invention or part thereof may be used to identify inhibitors for 
mechanistic and clinical applications. Such inhibitors may then be used to identify or quantify the 
protein of the invention in a sample, and to diagnose, treat or prevent any of the disorders where the 
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protein's hydrolytic, immunostimulatory and/or Inflammatory activities is/are undesirable and/or 
deleterious including but not limited to amyloidosis, colitis, lysosomal diseases, inflammatory and 
immune disorders including allergies and leukaemia. The protein of the invention may also be used 
to monitor host cell membranes for neoplastic transformation. 
5 It will be appreciated that all characteristics and uses of the polynucleotides of SEQ ID 

NOs:27 and 362 and polypeptides of SEQ ID NO: 196 and 479, described throughout the present 
application also pertain to the human cDNA of clone 4821 81, and the polypeptides encoded 
thereby. 

10 Angiogenin Protein of SEP ID NO: 176 fintemal designation 1 14180) and related protein of SEP 
IDNP:461 

The polypeptides of SEQ ID NO: 1 76 and SEQ ID NO:46l encoded by the extended cDNA 
SEQ ID N0:7 and SEQ ID NO:344, respectively, are ribonucleases that belongs to the pancreatic 
ribonuclease family (see reviews from Beintema, (1998) Cell Mol LifeScL S4;763-5; Beintema 

1 5 and Kleineidam, ( 1 998) CelL Mol. Life ScL 54:825-32, which disclosures are hereby incorporated 
by reference in their entireties). It will be appreciated that all characteristics and uses of the 
polynucleotides of SEQ ID NO:7 and SEQ ID NO:344 and polypeptides of SEQ ID NO: 1 76 and 
SEQ ID NO:461, described throughout the present application also pertain to the human cDNA of 
clone 1 14180, and the polypeptides encoded thereby. In addition, the protein of the invention plays 

20 a role in angiogenesis as an angiogenin variant protein (see review from Badet, (1999) Pathoi Biol 
74-345-51, which disclosure is hereby incorporated by reference in its entirety). Preferred 
polypeptides of the invention are polypeptides comprising the amino acids of SEQ ID NO: 1 76 
from positions 19 to 75, from positions 26 to 75, or from positions 63 to 69. Other preferred 
polypeptides of the invention are fragments of SEQ ID NO: 1 76 having any of the biological 

25 activity described herein. The ribonuclease activity of the protein of the invention or part thereof 
may be assayed using any of the assays known to those skilled in the art including those described 
in US patent 5,866,1 19, which disclosure is hereby incorporated by reference in its entirety. The 
angiogenic activity of the protein of the invention or part thereof may be assayed using any of the 
assays known to those skilled in the ait including those described by Fett etal (1985) Biochem. 24, 

30 5480-5486, which disclosure is hereby incorporated by reference in its entirety. 

Ribonucleases are proteins which catalyze the hydrolysis of phosphodiester bonds in RNA 
chains. Pancreatic ribonucleases are pyrimidic-specific ribonucleases present in high quantity in the 
pancreas of a number of mammalia taxa and of a few reptiles. In addition to their function in 
hydrolysis of RNA, ribonucleases have evolved to support a variety of other physiological 

35 activities. Such activities include anti-parasite, anti-bacterium, anti-virus, anti-neoplastic activities, 
neurotoxicity, and angiogenesis. For example, bovine seminal ribonuclease is anti-neoplastic 
(Laceetti, et al. (1992) Cancer Res. 52: 4582-4586, which disclosure is hereby incorporated by 
reference in its entirety). Some frog ribonucleases display both anti-viral and anti-neoplastic activity 
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(Youle, et al. (1994) Proc. Natl. Acad. Sci. USA 91: 6012-6016; Mikulski, et al. (1990) J. Natl. 
Cancer Inst. 82: 151-152; and Wu, et al. (1993) J, BioL Chem. 268: 10686-10693), which 
disclosures are hereby incorporated by reference in their entireties. Eosinophil-derived neurotoxin 
(EDN) and eosinophil cationic protein (ECP) are related ribonucleases which possess neurotoxicity 
5 (Beintema, et al. ( 1 988) Biochemistry 27: 4530-4538; Ackerman, ( 1 993) In Makino, and Fukuda, 
Eosinophils: Biological and Clinical Aspects. CRC Press, Boca Raton, Fla., pp 33-74), which 
disclosures are hereby incorporated by reference in their entireties. In addition, ECP exhibits 
cytotoxic, anti-parasitic, and anti-bacterial activities. A EDN-related ribonuclease, named RNase 
k6, is shown to express in normal human monocytes and neutrophils, suggesting a role for this 

10 ribonuclease in host defense (Rosenberg, and Dyer, (1996) Nuc. Acid. Res. 24: 3507-3513), which 
disclosure is hereby incorporated by reference in its entirety. 

Angiogenin is a tRNA-specific ribonuclease which binds protein partners on the surface of 
endothelial cells for endocytosis. Potential partners of angiogenin include heparin, plasminogen, 
elastase, angiostatin, actin, and a 170 kDa receptor on the surface of endothelial cells [Strydom, 

15 (1998) Cell. Mol. Life Sci, 54, 81 1-824, which disclosure is hereby incorporated by reference in its 
entirety ]. Endocytosed angiogenin is translocated to the nucleus where it promotes endothelial 
invasiveness required for blood vessel formation (Moroianu, and Riordan, (1994) Proc. Natl. Acad. 
Sci. USA 91: 1217-1221, which disclosure is hereby incorporated by reference in its entirety ). 

Although originally isolated from medium conditioned by human colon cancer cells (Fett et 

20 al. (1985), supra), and subsequently shown to be produced by several other histological types of 
human tumors [Rybak, et al. (1987) Biochem. Biophys. Res, Commun. 146, 1240-1248; Olson, et 
al., (1995) Proc. Natl. Acad. Sci. U.S.A. 92, 442-446, which disclosures are hereby incorporated by 
reference in their entireties], angiogenin also is a constituent of human plasma and normally 
circulates at a concentration of 250-360 ng/ml [Shimoyama, et al. (1996) Cancer Res. 56, 2703- 

25 2706; Blaser, et al. (1993) Eur. J. Clin. Chem. Clin. Biochem. 31, 513-516, which disclosures are 
hereby incorporated by reference in their entireties]. It has also been shown that recurrent gastric 
cancer patients had a much higher serum concentration of angiogenin than primary gastric cancer 
patients [Shimoyama, and Kaminishi, (2000) J. Cancer Res. Clin. Oncol. 126, 468-474, which 
disclosure is hereby incorporated by reference in its entirety]. 

30 Angiogenin is a potent inducer of angiogenesis [Fett, et al. supra]. Angiogenesis is a 

complex process of blood vessel formation comprising of several separate but interconnected steps 
at the cellular and biochemical level including: (i) activation of endothelial cells by the action of an 
angiogenic stimulus, (ii) adhesion and invasion of activated endothelial cells into the surrounding 
tissues and migration toward the source of the angiogenic stimulus, and (iii) proliferation and 

35 differentiation of endothelial cells to form a new microvasculature [Folkman, and Shing, (1992) J. 
Biol. Chem. 267, 10931-10934; Moscatelli, and Rifkin, (1988) Biochim. Biophys. Acta 948, 67-85, 
which disclosures are hereby incorporated by reference in their entireties]. While angiogenesis is a 
tightly-controlled process under usual physiological conditions, abnormal angiogenesis can have 
devastating consequences in pathological conditions such as arthritis, diabetic retinopathy and 
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tumor growth. It is now well-established that the growth of virtually all solid tumors is angiogenesis 
dependent [Folkman, (1989) J. Natl. Cancer Inst. 82, 4-6, which disclosure is hereby incorporated 
by reference in its entirety]. Angiogenesis is also a prerequisite for the development of metastasis, 
since it provides the means whereby tumor cells disseminate from the original primary tumor and 
5 establish at distant sites [Mahadevan, and Hart, (1990) Rev. Oncol. 3, 97-103; Blood, and Zetter 
(1990) Biochim. Biophys. Acta 1032, 89-1 1 8, which disclosures are hereby incorporated by 
reference in their entireties]. Therefore, interference with the process of tumor-induced 
angiogenesis can be an effective therapy for both primary and metastatic cancers. Indeed, several 
anti-angiogenic agents have been produced and are currently in the clinical trial stage. 

1 0 The invention relates to methods and compositions using the protein of the invention or part 

thereof to hydrolyze one or several substrates, preferably nucleic acids, more preferably RNA, alone 
or in combination with other substances. For example, the protein of the invention or part thereof is 
added to a sample containing the substrate(s) in conditions allowing hydrolysis, and allowed to 
catalyze the hydrolysis of the substrate(s). Hydrolysis conditions as described in the US patent 

1 5 5,866, 1 1 9 may be used, which disclosure is hereby incorporated by reference in its entirety. 

In a preferred embodiment, the protein of the invention or part thereof may be used to 
remove contaminating RNA in a biological sample, alone or in combination with other nucleases. 
In a more preferred embodiment, the protein of the invention or part thereof may be used to purify 
DNA preparations from contaminating RNA, to remove RNA templates prior to second strand 

20 synthesis and prior to analysis of in vitro translation products. Compositions comprising the protein 
of the present invention or part thereof are added to biological samples as a "cocktail" with other 
nucleases. The advantage of using a cocktail of hydrolytic enzymes is that one is able to hydrolyze 
a wide range of substrates without knowing the specificity of any of the enzymes. Such cocktails of 
nucleases are commonly used in molecular biology assays, for example to remove unbound RNA in 

25 RNAse protection assays. Using a cocktail of hydrolytic enzymes also protects a sample from a 
wide range of future unknown RNA contaminants from a vast number of sources. For example, the 
protein of the invention or part thereof is added to samples where contaminating substrates is 
undesirable. Alternatively, the protein of the invention or part thereof may be bound to a 
chromatographic support, either alone or in combination with other hydrolytic enzymes, using 

30 techniques well known in the art, to form an affinity chromatography column. A sample containing 
the undesirable substrate is run through the column to remove the substrate. Immobilizing the 
protein of the invention or part thereof on a support is particularly advantageous for those 
embodiments in which the method is to be practiced on a commercial scale. This immobilization 
facilitates the removal of the enzyme from the batch of product and subsequent reuse of the enzyme. 

35 Immobilization of the protein of the invention or part thereof can be accomplished, for example, by 
inserting a cellulose-binding domain in the protein. One of skill in the art will understand that other 
methods of immobilization could also be used and are described in the available literature. 
Alternatively, the same methods may be used to identify new substrates. 
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In another embodiment, the protein of the invention or part thereof may be used to 
decontaminate or disinfect samples infected by undesirable parasite, bacteria and/or viruses using 
any of the methods known to those skilled in the art including those described in Youle et al, 
(1994), supra; Mikulski et al (1990) supra, Wu et al (1993) supra. 
S In another embodiment, the present invention relates to compositions and methods using the 

protein of the invention or part thereof to selectively kill cells. The protein of the invention or part 
thereof is linked to a recognition moiety capable of binding to a chosen cell, such as lectins, 
receptors or antibodies thus generating cytotoxic reagents using methods and techniques described 
in US patent 5,955,073, which disclosure is hereby incorporated by reference in its entirety. 
10 In still another embodiment, the invention relates to compositions and methods using the 

protein of the invention or part thereof to stimulate cell proliferation both in vitro and in vivo, 
especially endothelial cell growth. For example, soluble forms of the protein of the invention or 
part thereof may be added to cell culture medium in an amount effective to stimulate cell 
proliferation. 

15 In still another embodiment, the protein of the invention or part thereof may be used in the 

diagnosis, prevention and/or treatment of disorders associated with excessive angiogenesis such as 
tumor growth, arthritis or diabetic retinopathy. 

In a preferred embodiment, the protein of the invention may be used as a diagnostic marker 
to evaluate the risk of a given individual to develop a tumor, to evaluate the risk of recurrence of 

20 tumors or to evaluate the degree of cancer aggressiveness based on the facts that the level of 

circulating angiogenin is lower in normal individuals than in patients bearing tumors, and is lower 
in patients with primary cancers compared to patients with reoccurent tumors, as stated above. 
Thus, quantitative immunoassays can be used for the detection of abnormal levels of either the 
protein of SEQ ID NO: 1 76 or the mRNA encoding such protein as the polynucleotide of SEQ ID 

25 No:7, thereby identifying those individuals at risk for the development of tumors or the recurrence 
of tumors. Detection of abnormal levels of the protein of the invention may be performed using any 
techniques known to those skilled in the art including those described elsewhere in the application. 
For example, antibodies binding specifically to the protein of the invention, or fragments thereof, 
may be used in routine immunoassays to screen for the presence or absence of the protein of the 

30 invention, or fragments thereof. Alternatively, the nucleic acids which encode the protein of the 
invention, or fragments thereof, may be used in hybridization assays to detect and/or quantity the 
expression of said pi-otein. 

Another aspect of the invention provides for molecules which inhibit, or reduce, the 
biological activity or expression of SEQ ID NO: 276. Such molecules may be be administered to 

35 patients to prevent vascularization, especially tumor vascularization, thereby limiting tumor growth. 
Such antagonists and/or inhibitors may be antibodies specific for the protein of the invention that 
can be used directly as an antagonist, or indirectly as a targeting or delivery mechanism for bringing 
a pharmaceutical agent to cells or tissue which express the protein of the invention. Neutralizing 
antibodies, (i.e., those which inhibit protein-protein interactions) are especially preferred for 
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therapeutic use. Alternatively, such molecules may be mutated forms of the protein of the invention 
or truncated forms v^hich will be able to bind to the partners of the protein of the invention and 
compete with if for partners but without eliciting any of its biological activities. Other methods to 
inhibit the expression of the protein of the invention include antisense and triple helix strategies as 
5 described herein. Other antagonists or inhibitors of the protein of the invention may be produced 
using methods which are generally known in the art, including the screening of libraries of 
pharmaceutical agents to identify those which specifically bind the protein of the invention. The 
protein of the invention, or part thereof, preferably its functional or immunogenic fragments, or 
oligopeptides related thereto, can be used for screening libraries of compounds in any of a variety of 
1 0 drug screening techniques including those described herein. 

Protease Inhibitor Protein of SEP ID NO: 181 (internal designation 1 00077 1934) and related 
protein of SEO ID NO:466 

The protein of SEQ ID NOs: 1 8 1 and 466 encoded by the extended cDNA SEQ ID NO: 1 2 

IS and 349, respectively, is a protease inhibitor belonging to the WAP-type disulfide core family. It 
will be appreciated that all characteristics and uses of the polynucleotides of SEQ ID NOs: 12 and 
349 and polypeptides of SEQ ID N0s:l81 and 466, described throughout the present application 
also pertain to the human cDNA of clone 1000771934, and the polypeptides encoded thereby. 
Preferred polypeptides of the invention are polypeptides comprising the amino acid fragments of 

20 SEQ ID NO: 181 from positions 32 to 73, 49 to 62, 76 to 122, 97 to 1 10 or any combination thereof 
Other preferred polypeptides of the invention are fragments of SEQ ID NO: 1 8 1 having any of the 
biological activity described herein. The protease inhibitor activity of the protein of the invention 
or part thereof may be assessed using any techniques known to those skilled in the art. Possible 
substrates for the protein of the invention include are not limited to trypsin, chymotrypsin, 

25 leukoproteinase, elastase, subtilisin, type IV collagenase and other serine proteases. 

Proteases, which cleave proteins, are largely used in industry including in food processing, 
brewing, and alcohol production. Proteases are important components of laundry detergents and 
other products. Within biological research, proteases are used in purification processes to degrade 
unwanted proteins. It is often desirable to employ proteases of low specificity or mixtures of more 

30 specific proteases to obtain the necessary degree of degradation. 

Proteases are also key components of a broad range of biological pathways, including blood 
coagulation and digestion. For example, the absence or insufficiency of a protease can result in a 
pathological condition that can be treated by replacement or augmentation therapy. Such therapies 
include the treatment of hemophilia with clotting factors VIII, IX, and Vila. In another application, 

35 the proteolytic enzyme tissue plasminogen activator (t-PA) is used to activate the body's clot lysing 
mechanism, thereby reducing morbitity resulting from myocardial infarction. The protease thrombin 
is used to initiate the clotting of fibrinogen*based tissue adhesives during surgery. Neutrophils 
produce several antibacterial serine proteases (Gabay, Ciba Found. Symp. 186:237-247, 1994; 
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Scocchi et al., Eur. J. Biochem. 209:589-595, 1992, which disclosures are hereby incorporated by 
reference in their entireties). Proteases also regulate cellular processes through receptor-mediated 
pathways by proteolytic activation of the cognate receptor (Vu et al.. Cell 64:1057-1068, 1991; 
Blackhart et al., J. Biol, Chem. 271:16466-16471, 1996, which disclosures are hereby Incorporated 
S by reference in their entireties). 

Overproduction or lack of regulation of proteases can also have pathological consequences. 
Elastase, released within the lung in response to the presence of foreign particles, can damage lung 
tissue if its activity is not tightly regulated. Emphysema in smokers is believed to arise from an 
imbalance between elastase and its inhibitor, alpha- 1 -antitrypsin. This balance may be restored by 
10 administration of exogenous alpha- 1 -antitrypsin. 

In addition, protease inhibitors have been shown to inhibit the growth of microorganisms 
including human pathogenic bacteria such as strains of group A streptococci, including antibiotic- 
resistant strains (Merigan, T. et al (1996) Ann Intern Med 124: 1039-1050; Stoka, V. (1995) FEBS. 
Lett 370:101-104; Vonderfecht, S. et al (1988) J Clin Invest 82:201 1-2016; Collins, A. et al (1991) 
1 5 Antimicrob Agents Chemother 35:2444-2446, which disclosures are hereby incorporated by 
reference in their entireties). 

In view of the growing use of proteases in industry, research, and medicine, there is an 
ongoing need in the art for new enzymes and new enzyme inhibitors. The present invention 
addresses these needs. 

20 The invention relates to compositions and methods using the protein of the invention or part 

thereof to inhibit proteases, both in vitro or in vivo. Since proteases play an important role in the 
regulation of many biological processes in virtually all living organisms as well as a major role in 
diseases, inhibitors of proteases are useful in a wide variety of applications. 

In one embodiment, the protein of the invention or part thereof may be useful to quantify 

25 the amount of a given protease in a biological sample, and thus used in assays and diagnostic kits 
for the quantification of proteases in bodily fluids or other tissue samples, in addition to bacterial, 
fungal, plant, yeast, viral or mammalian cell cultures. In a preferred embodiment, the sample is 
assayed using a standard protease substrate. A known concentration of protease inhibitor is added, 
and allowed to bind to a particular protease present. The protease assay is then rerun, and the loss 

30 of activity is correlated to the protease inhibitor activity using techniques well known to those 
skilled in the art. 

In addition, the protein of the invention or part thereof may be used to remove, identify or 
inhibit contaminating proteases in a sample. Compositions comprising the polypeptides of the 
present invention may be added to biological samples as a "cocktail" with other protease inhibitors 
35 to prevent degradation of protein samples. The advantage of using a cocktail of protease inhibitors 
is that one is able to inhibit a wide range of proteases without knowing the specificity of any of the 
proteases. Using a cocktail of protease inhibitors also protects a protein sample from a wide range 
of future unknown proteases which may contaminate a protein sample from a vast number of 
sources. For example, the protein of the invention or part thereof are added to samples where 
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proteolytic degradation by contaminating proteases is undesirable. Such protease inhibitor cocktails 
(see for example the ready to use cocktails sold by Sigma) are widely used in research laboratory 
assays to inhibit proteases susceptible of degrading a protein of interest for which the assay is to be 
performed. Alternatively, the protein of the invention or part thereof may be bound to a 
S chromatographic support, either alone or in combination with other protease inhibitor, using 

techniques well known in the art, to form an affinity chromatography column. A sample containing 
the undesirable protease is run through the column to remove the protease. Alternatively, the same 
methods may be used to identify new proteases. 

In a preferred embodiment, the protein of the invention or part thereof may be used to 

10 inhibit proteases implicated in a number of diseases where cellular proteolysis occur such as 
diseases characterized by tissue degradation including but not limited to arthritis, muscular 
dystrophy, inflammation, tumor invasion, glomerulonephritis, parasite-borne infections, 
Alzheimer's disease, periodontal disease, and cancer metastasis. 

In another preferred embodiment, the protein of the invention or part thereof may be useful 

IS to inhibit exogenous proteases, both in vivo and in vitro, implicated in a number of infectious 
diseases including but not limited to gingivitis, malaria, leishmaniasis, filariasis, osteoporosis and 
osteoarthritis, and other bacterial, and parasite^bome or viral infections. In particular, the protein of 
the invention or part thereof may offer applications in viral diseases where the proteolysis of 
primary polypeptide precursors is essential to the replication of the virus, as for HIV and HCV. 

20 In another embodiment, the protease inhibitors of the present invention may be used as 

antibacterial agents to retard or inhibit the growth of certain bacteria either in vitro or in vivo. 
Particularly, the polypeptides of the present invention may be used to inhibit the growth of group A 
streptococci on non-living matter such as surgical instruments, laboratory glassware and 
plasticware, and in culture of living plant, fungi, and animal cells. 

25 Furthermore, the protease inhibitors of the present invention find use in drug potentiation 

applications. For example, therapeutic agents such as antibiotics or antitumor drugs can be 
inactivated through proteolysis by endogenous proteases, thus rendering the administered drug less 
effective or inactive. Accordingly, the protease inhibitors of the invention may be administered to a 
patient in conjunction with a therapeutic agent in order to potentiate or increase the activity of the 

30 drug. This co-administration may be by simultaneous administration, such as a mixture of the 
protease inhibitor and the drug, or by separate simultaneous or sequential administration. 

Seroin Protein of SEP ID NO: 179 (internal designation 784093) and related protein of SEP ID 
NP:464 

35 The protein of SEQ ID NPs: 1 79-and 464 and encoded by the extended cDN A SEQ ID 

NPs: 1 0 and 347, respectively, is a serine protease inhibitor. It will be appreciated that all 
characteristics and uses of the polynucleotides of SEQ ID NPs: 10 and 347 and polypeptides of SEQ 
ID NPs: 1 79 and 464, described throughout the present application also pertain to the human cDNA 
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of clone 784093, and the polypeptides encoded thereby. Preferred polypeptides of the invention are 
polypeptides comprising the amino acid fragments of SEQ ID NO: 179 from positions 47 to 139. 
Other preferred polypeptides of the invention are fragments of SEQ ID NO: 179 having any of the 
biological activity described herein. The protease inhibitor activity of the protein of the invention 
S or part thereof may be assessed using any techniques known to those skilled in the art including 
those disclosed in the US patent 5,955,284. Possible substrates for the protein of the invention 
include are not limited to serine proteases such as elastase, trypsin, chymotrypsin, thrombin III, 
plasmin, heparin, complement II, plasminogen activator, protein C, interleukin-lB coverting 
enzyme, preferably trypsin, elastase, and chymotrypsin. 

10 Proteases are key components of a broad range of biological pathways, including blood 

coagulation and digestion. For example, the absence or insufficiency of a protease can result in a 
pathological condition that can be treated by replacement or augmentation therapy. Such therapies 
include the treatment of hemophilia with clotting factors Vlll, IX, and Vila. In another application, 
the proteolytic enzyme tissue plasminogen activator (t-PA) is used to activate the body's clot lysing 

1 5 mechanism, thereby reducing morbitity resulting from myocardial infarction. The protease thrombin 
is used to initiate the clotting of fibrinogen-based tissue adhesives during surgery. Neutrophils 
produce several antibacterial serine proteases (Gabay, Ciba Found. Symp. 1 86:237-247, 1994; 
Scocchi et al., Eur. J. Biochem. 209:589-595, 1992, which disclosures are hereby incorporated by 
reference in their entireties). Proteases also regulate cellular processes through receptor-mediated 

20 pathways by proteolytic activation of the cognate receptor (Vu et al.. Cell 64:1057-1068, 1991; 
Blackhart et al., J. Biol. Chem. 271:16466-16471, 1996, which disclosures are hereby incorporated 
by reference in their entireties). 

Overproduction or lack of regulation of proteases can also have pathological consequences. 
Elastase, released within the lung in response to the presence of foreign particles, can damage lung 

25 tissue if its activity is not tightly regulated. Emphysema in smokers is believed to arise from an 
imbalance between elastase and its inhibitor, alpha- 1 -antitrypsin. This balance may be restored by 
administration of exogenous alpha-l-antitiypsin. 

The serine proteases (SP) are a large family of proteolytic enzymes that include the 
digestive enzymes, trypsin and chymotrypsin, components of the complement cascade and of the 

30 blood-clotting cascade, and enjsymes that control the degradation and turnover of macromolecules 
of the extracellular matrix. SP are so named because of the presence of a serine residue in the active 
catalytic site for protein cleavage. They are characterized by a catalytic triad of serine, histidine, and 
aspartic acid residues. SP have a wide range of substrate specificities and can be subdivided into 
subfamilies on the basis of these specificities. The main sub-families are trypases (cleavage after 

35 arginine or lysine), aspases (cleavage after aspartate), chymases (cleavage after phenylalanine or 
leucine), metases (cleavage after methionine), and serases (cleavage after serine). 

Serine proteases are used for a variety of industrial purposes, For example, the serine 
protease subtilisin is used in laundry detergents to aid in the removal of proteinaceous stains (e.g., 
Crabb, ACS Symposium Series 460:82-94, 1991, which disclosure is hereby incorporated by 
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reference in its entirety). In the food processing industry, serine proteases are used to produce 
protein-rich concentrates from fish and livestock, and in the preparation of dairy products (Kida et 
at.. Journal of Fermentation and Bioengineering 80:478-484, 1995; Haard and Simpson, in Martin, 
A. M., ed., Fisheries Processing: Biotechnological Applications, Chapman and Hall, London, 1994, 
5 132-154; Bos et al., European Patent Office Publication 494 149 A1, which disclosures are hereby 
incorporated by reference in their entireties). 

Serpins are irreversible serine protease inhibitors which are principally located 
extracellularly. Proteins which have been assigned to the serpin family include the following: 
,alpha.-l protease inhibitor,.alpha.-l-antichymotrypsin, antithrombin III, .aIpha.-2-antiplasmin, 

10 heparin cofactor II, complement CI inhibitor, plasminogen activator inhibitors 1 and 2, glia derived 
nexin, protein C inhibitor, rat hepatocyte inhibitors, crmA (a viral serpin which inhibits interleukin 
l-.beta. cleavage enzyme), human squamous cell carcinoma antigen which may modulate the host 
immune response against tumor cells, human maspin which seems to function as a tumor 
suppressor, lepidopteran protease inhibitor, leukocyte elastase inhibitor (the only known 

15 intracellular serpin), and products from three orthopoxviruses (these products may be involved in 
the regulation of the blood clotting cascade and/or of the complement cascade in the mammalian 
host). 

In view of the growing use of proteases in industry, research, and medicine, there is an 
ongoing need in the art for new enzymes and new enzyme inhibitors. The present invention 

20 addresses these needs. 

In one embodiment, the protein of the invention or part thereof may be useful to quantify 
the amount of a given protease in a biological sample, and thus used in assays and diagnostic kits 
for the quantification of proteases in bodily fluids or other tissue samples, in addition to bacterial, 
fungal, plant, yeast, viral or mammalian cell cultures. In a preferred embodiment, the sample is 

25 assayed using a standard protease substrate. A known concentration of protease inhibitor is added, 
and allowed to bind to a particular protease present. The protease assay is then rerun, and the loss 
of activity is correlated to the protease inhibitor activity using techniques well known to those 
skilled in the art. Preferred proteases in this embodiment are serine protease, more preferably 
elastase, trypsin and chymotrypsin. 

30 In addition, the protein of the invention or part thereof may be used to remove^ identify or 

inhibit contaminating proteases in a sample. Compositions comprising the polypeptides of the 
present invention may be added to biological samples as a "cocktail" with other protease inhibitors 
to prevent degradation of protein samples. The advantage of using a cocktail of protease inhibitors 
is that one is able to inhibit a wide range of proteases without knowing the specificity of any of the 

35 proteases. Using a cocktail of protease inhibitors also protects a protein sample from a wide range 
of future unknown proteases which may contaminate a protein sample from a vast number of 
sources. For example, the protein of the invention or part thereof are added to samples where 
proteolytic degradation by contaminating proteases is undesirable. Such protease inhibitor cocktails 
(see for example the ready to use cocktails sold by Sigma) are widely used in research laboratory 
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assays to inhibit proteases susceptible of degrading a protein of interest for which the assay is to be 
performed. Alternatively, the protein of the invention or part thereof may be bound to a 
chromatographic support, either alone or in combination with other protease inhibitor, using 
techniques well known in the art, to form an affinity chromatography column. A sample containing 
S the undesirable protease is run through the column to remove the protease. Alternatively, the same 
methods may be used to identify new proteases. 

In a preferred embodiment, the protein of the invention or part thereof may be used to 
inhibit proteases implicated in a number of diseases where cellular proteolysis occur such as 
diseases characterized by tissue degradation including but not limited to arthritis, muscular 

10 dystrophy, inflammation, tumor invasion, glomerulonephritis, parasite-borne infections, 

Alzheimer's disease, periodontal disease, and cancer metastasis. In a more preferred embodiment, 
the invention relates to compositions and methods to use the protein of the invention or part thereof 
in diseases characterized by an abnormally elevated levels of trypsin, chymotrypsin or elastase, 
including but not limited to chronic emphysema of the lungs, cirrhosis, liver failure, cystic fibrosis, 

1 5 alphal -antitrypsin deficiency associated disorders such as aneurysm or toxic shock. For prevention 
and/or treatment purposes, the protein of the invention may be used using any of the gene therapy 
methods described herein or known to those skilled in the art. 

In another preferred embodiment, the protein of the invention or part thereof may be usefiil 
to inhibit exogenous proteases, both in vivo and in vitro, implicated in a number of infectious 

20 diseases including but not limited to gingivitis, malaria, leishmaniasis, filariasis, osteoporosis and 
osteoarthritis, and other bacterial, and parasite-borne or viral infections. In particular, the protein of 
the invention or part thereof may offer applications in viral diseases where the proteolysis of 
primary polypeptide precursors is essential to the replication of the virus, as for HIV and HCV. 
In another embodiment, the protease inhibitors of the present invention may be used as 

25 antibacterial agents to retard or inhibit the growth of certain bacteria either in vitro or in vivo. 

Particularly, an amount of the polypeptides of the present invention effective to inhibit proliferation 
may be used to inhibit the growth of group A streptococci on non-living matter such as surgical 
instruments, laboratory glassware and plasticware, and in culture of living plant, fungi, and animal 
cells. 

30 Furthermore, the protease inhibitors of the present invention find use in drug potentiation 

applications. For example, therapeutic agents such as antibiotics or antitumor drugs can be 
inactivated through proteolysis by endogenous proteases, thus rendering the administered drug less 
effective or inactive. Accordingly, the protease inhibitors of the invention may be administered to a 
patient in conjunction with a therapeutic agent in order to potentiate or increase the activity of the 

35 drug. This co-administration may be by simultaneous administration, such as a mixture of the 
protease inhibitor and the drug, or by separate simultaneous or sequential administration. 
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ATPKf Protein sequence of SEP ID No.253 (internal designation 1000867870) 



The protein of SEQ ID NO:253 encoded by the extended cDNA SEQ ID NO:84 and relate 
polynucleotides of SEQ IDNO:404 is a variant of the human mitochondrial ATP synthase f subunit 
or ATPK (E.C. 3.6. 1 .34) and, as such, plays a role in cellular respiration. Preferred polypeptides of 
5 the invention are are polypeptides comprising the amino acids of SEQ ID NO: 253 from positions 5 
to 88. Other preferred polypeptides of the invention are fragments of SEQ ID NO: 253 having any 
of the biological activity described herein. It will be appreciated that all characteristics and uses of 
the polynucleotides of SEQ ID NOs:84 and 404 and polypeptides of SEQ ID NO: 253 described 
throughout the present application also pertain to the human cDNA of clone 1000867870, and the 

1 0 polypeptides encoded thereby. 

The mitochondrial electron transport (or respiratory) chain is a series of enzyme complexes 
in the mitochondrial membrane that is responsible for the transport of electrons from NADH to 
oxygen and the coupling of this oxidation to the synthesis of ATP (oxidative phosphorylation). ATP 
then provides the primary source of energy for driving a cell's many energy-requiring reactions. 

1 5 ATP synthase (FO Fl ATPase) is the enzyme complex at the terminus of this chain and serves as a 
reversible coupling device that interconverts the energies of an electrochemical proton gradient 
across the mitochondrial membrane into either the synthesis or hydrolysis of ATP. This gradient is 
produced by other enzymes of the respiratory chain in the course of electron transport from NADH 
to oxygen. When the cell's energy demands are high, electron transport from NADH to oxygen 

20 generates an electrochemical gradient across the mitochondrial membrane. Proton translocation 
from the outer to the inner side of the membrane drives the synthesis of ATP. Under conditions of 
low energy requirements and when there is an excess of ATP present, this electrochemical gradient 
is reversed and ATP synthase hydrolyzes ATP. The energy of hydrolysis is used to pump protons 
out of the mitochondrial matrix. ATP synthase is, therefore, a dual complex, the FO portion of 

25 which is a transmembrane proton carrier or pump, and the Fl portion of which is catalytic and 
synthesizes or hydrolyzes ATP. Mammalian ATP synthase complex consists of sixteen different 
polypeptides (Walker, J. E. and Collinson, T. R. (1994) FEBS Lett.346: 39-43, which disclosure is 
hereby incorporated by reference in its entirety). Six of these polypeptides (subunits alpha, beta, 
gamma, delta, epsilon, and an ATPase inhibitor protein IF 1) comprise the globular catalytic F 1 

30 ATPase portion of the complex, which lies outside of the mitochondrial membrane. The remaining 
ten polypeptides (subunits a, b, c, d, e, f, g, F6, OSCP, and A6L) comprise the proton-translocating, 
membrane spanning FO portion of the complex. Like other members of the respiratory chain, all but 
two of the polypeptide subunits of ATP synthase are nuclear gene products that are imported into 
the mitochondria. Enzyme complexes similar to mammalian ATP synthase are found in all cell 

35 types and in chloroplast and bacterial membranes. This universality indicates the central importance 
of this enzyme to ATP metabolism. Transcriptional regulation of these nuclear encoded genes 
appears to be the predominant means for controlling the biogenesis of ATP synthase. Multiple 
mitochondrial pathologies exist because of the essential role of mitochondrial oxidative 
phosphorylation in cellular energy production, in the generation of reactive oxygen species and in 
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the initation of apoptosis (Wallace, Science, 283:1482-1488, 1999, which disclosure is hereby 
incorporated by reference in its entirety). It is now clear that mitochondrial diseases encompass an 
assemblage of clinical problems commonly involving tissues that have high energy requirements 
such as heart, muscle and the renal and endocrine systems. Over the past 1 1 years, a considerable 
S body of evidence has accumulated implicating defects in the mitochondrial energy-generating 
pathway, oxidative phosphorylation, in a wide variety of degenerative diseases including myopathy 
and cardiomyopathy. Most classes of pathogenic mitochondrial DNA mutations affect the heart, in 
association with a variety of other clinical manifestations that can include skeletal muscle, the 
central nervous system (including eye), the endocrine system, and the renal system. Nuclear 

10 mutations causing mitochondrial disorders have been described. They are often found in highly 
conserved subunits. Mitochondrial disorders with nuclear mutations include : myopathies (PEO, 
MNGIE, congenital muscular dystrophy, carnitine disorders), encephalopathies (Leigh, Infantile, 
Wilson's disease, Deafness-Dystonia syndrome), other systemic disorders and cardiomyopathies. 
The discovery of a new ATP synthase subunit, and polynucleotides encoding it satisfy a 

1 5 need in the art by providing new compositions which are useful for the diagnosis, prevention, and 
treatment of cancer, myopathies, immune disorders, and neurological disorders. 

An object of the present invention relates to compositions and methods of targeting 
heterologous compounds, either polypeptides or polynucleotides to mitochondria by recombinantly 
or chemically fusing a fragment of the protein of the invention to an heterologous polypeptide or 

20 polynucleotide. Preferred fragments are signal peptide, amphiphilic alpha helices and/or any other 
fragments of the protein of the invention, or part thereof, that may contain targeting signals for 
mitochondria including but not limited to matrix targeting signals as defined in Herrman and 
Neupert, Curr. Opinion Microbiol. 3:210-4 (2000); Bhagwat et al. J. Biol. Chem. 274:24014-22 
(1999), Murphy Trends Biotechnol. 15:326-30 (1997); Glaser et al. Plant Mol Biol 38:31 1-38 

25 (1998); Ciminale et al. Oncogene 18:4505-14 (1999), which disclosures are hereby incorporated by 
reference in their entireties. Such heterologous compounds may be used to modulate 
mitochondria's activities. For example, they may be used to Induce and/or prevent mitochondrial- 
induced apoptosis or necrosis. In addition, heterologous polynucleotides may be used for 
mitochondrial gene therapy to replace a defective mitochondrial gene and/or to inhibit the 

30 deleterious expression of a mitochondrial gene. 

The invention further relates to methods and compositions using the protein of the invention 
or part thereof to diagnose, prevent and/or treat several disorders in which mitochondrial respiratory 
electron transport chain is impaired, including but not limited to mitochondriocytopathies, necrosis, 
aging, myopathies, cancer and neurodegenerative diseases such as Alzheimer's disease, 

35 Huntington's disease, Parkinson's disease, epilepsy, Down's syndrome, dementia, multiple sclerosis, 
and amyotrophic lateral sclerosis. For diagnostic purposes, the expression of the protein of the 
invention could be investigated using any of the Northern blotting, RT-PCR or immunoblotting 
methods described herein and compared to the expression in control individuals. For prevention 
and/or treatment purposes, the protein of the invention may be used to enhance electron transport 
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and increase energy delivery using any of the gene therapy methods described herein or known to 
those skilled in the art. 

In another embodiment, the invention further relates to methods and compositions using the 
protein of the invention or part thereof to diagnose, prevent and/or treat several disorders in which 
5 mitochondria! respiratory electron transport chain needs to be impaired, including but not limited to 
Sjogren's syndrome, Addison's disease, bronchitis, dermatomyositis, polymyositis, 
glomerulonephritis, diabetes meltitus, emphysema, Graves' disease, atrophic gastritis, lupus 
erythematosus, myasthenia gravis, multiple sclerosis, autoimmune thyroiditis, ulcerative colitis, 
anemia, pancreatitis, scleroderma, rheumatoid and osteoarthritis, asthma, allergic rhinitis, atopic 
10 dermatitis, dermatomyositis, polymyositis, and gout, using any techniques known to those skilled in 
the art including the antisense or triple helices strategies described herein. 

Moreover, antibodies to the protein of the invention or part thereof may be used for 
detection of mitochondria organelles and/or mitochondrial membranes using any techniques known 
to those skilled in the art. 

15 

Olipomerization Protein sequence of SEP ID No. 310 (internal designation D1S0S68) 

The protein of SEQ ID NO : 3 10 encoded by the cDNA of SEQ ID NO: 141 and 435, is 
able to form homo-oligomers. Preferred polypeptides of the invention are polypeptides comprising 
the amino acids of SEQ ID NO:310 from positions 1 to 109. Other preferred polypeptides of the 

20 invention are fi*agments of SEQ ID NO: 310 having any of the biological activities described herein. 

Multivalency is a prerequisite for a variety of macromolecular interactions such as binding 
of antibodies or lectins to specific targets, ligand recognition, activation or inhibition of receptors 
and cell adhesion. Dimerization and oligomerization of proteins are thus general biological control 
mechanisms that contribute to the activation of cell membrane receptors, transcription factors, 

25 vesicle fusion proteins, and other classes of intra- and extracellular proteins. 

Multimerization domains have been shown to be useful tools in several areas of 
biotechnology, especially in protein engineering. For example, Tso et al have used leucine zippers 
for producing bispecific antibody heterodimers (US patent 5,932,448) / Methods of preparing 
soluble oligomeric proteins using leucine zippers have been described by Conrad et al (US patent 

30 5,965,712), Ciardelli et al (US patent 5,837,816), Spriggs et al (WO9410308) / Leucine zipper 
forming sequences have been used by Pelletier et al in protein fragment complementation assays to 
detect biomolecular interactions (WO9834I20), which disclosures are hereby incorporated by 
reference in their entireties. Because of their usefulness in biotechnology, it is thus highly 
interesting to isolate new multimerization domains. 

35 The multimerization activity of the protein of the invention or part thereof may be assayed 

using any of the assays known to those skilled in the art including circular dichroism spectrum, gel 
filtration chromatography and thermal melting analyses. 
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In one embodiment, the invention relates to compositions and methods of using the protein 
of the invention or part thereof for preparing soluble muitimeric proteins, which consist in 
multimers of fusion proteins containing a multimerization domain fused to a protein of interest, 
using any technique known to those skilled in the art including those described in international , 
5 patent WO9410308, which disclosure is hereby incorporated by reference in its entirety. 

In another embodiment, the protein of the invention or part thereof or derivative thereof is 
used for detection and determination of an analyte in a biological liquid using the teachings of US 
patent 5,643,73 1, which disclosure is hereby incorporated by reference in its entirety. Briefly, a 
first multimerization domain is immobilized on a solid support and the second multimerization 

10 domain is coupled to a specific binding partner for an analyte in a biological fluid. The two 

peptides are then brought into contact thereby immobilizing the binding partner on the solid phase. 
The biological sample is then contacted with the immobilized binding partner and the amount of 
analyte in the sample bound to the binding partner determined. 

In still another embodiment, the protein of the invention or part thereof may be used to 

1 S construct multimerization devices comprising hybrid molecules with a functional domain fused to a 
multimerization domain in order to yield muitimeric complexes with improved pharmacokinetic and 
pharmacological properties as described in WOO 102440, which disclosure is hereby incorporated 
by reference in its entirety. In a preferred embodiment, the protein of the invention or part thereof 
may be used to construct different fusion proteins with different functional domains such as enzyme 

20 moieties or cytotoxic moieties. Vectors encoding these different proteins may then be transfected in 
the same host cell in conditions allowing for multimerization, thus yielding muitimeric 
multifunctional complexes. 

Chaperone Protein of SEP ID NO: 303 (internal designation D637548) 

25 The protein of SEQ ID NO : 303 encoded by the cDNA of SEQ ID NO: 1 34 is a chaperonin. 

Accordingly, the protein of SEQ IDNO:303 plays a role in protein synthesis/folding, cellular 
trafficking , and the cellular stress response. In addition, the protein of SEQ ID No: 303 has 
immunosupressant and growth factor properties. It is able to depress delayed type hypersensitivity 
reactions. It is a product of primary and neoplastic cell proliferation and under these conditions acts 

30 as a growth factor. It is also a product of platelet activation and may play a part in wound healing 
and skin repair. Preferred polypeptides of the invention are polypeptides comprising the amino 
acids of SEQ ID NO:303 from positions 9 to 33, or from positions 7 to 1 0 1 . Other preferred 
polypeptides of the invention are fragments of SEQ I D NO: 303 having any of the biological 
activities described herein. The different activities of the protein of the invention or part thereof 

35 may be assayed using any of the assays described in US 6,1 1 7,42 1 or any of the assays referred into 
US, 61 17,421, which disclosures are hereby incorporated by reference in their entireties. 

Chaperonins belong to a wider class of molecular chaperones, molecules involved in post- 
translational folding, targeting and assembly of other proteins, but which do not themselves form 
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part of the final assembled structure as discussed by Ellis et al., 1991, Annu. Rev. Biochem. 60 321- 
347, which disclosures are hereby incorporated by reference in their entireties. Most molecular 
chaperones are "heat shock" or "stress" proteins (hsp); i.e. their production is induced or increased 
by a variety of cellular insults (such as metabolic disruption, oxygen radicals, inflammation, 
S infection and transformation), heat being only one of the better studies stresses as reviewed by 
Lindqulst et al., 1988, Annu. Rev. Genet. 22 631-677, which disclosure is hereby incorporated by 
reference in its entirety. As well as these quantitative changes in specific protein levels, stress can 
induce the movement of constitutively produced stress proteins to different cellular compartments 
as referred to in the Lindquist reference mentioned above. The heat shock response is one of the 

10 most highly conserved genetic system known and the various heat shock protein families are among 
the most evolutionarily stable proteins in existence. The major stress proteins accumulate to very 
high levels in stressed cells but occur at low to moderate levels in cells that have not been stressed. 
As well as enabling cells to cope under adverse conditions, members of these families perform 
essential functions in normal cells. 

1 S Chaperones are also involved in a number of disorders, especially autoimmune diseases 

such as type 1 diabetes, rheumatoid arthritis, systemic lupus erythematosus, Sjogren syndrome, and 
mixed connective tissue disease (Feige et al. EXS 1996; 77:3S9«73; Feili-Hariri et al. J Autoimmun 
2000; 14:133-42, which disclosures are hereby incorporated by reference in their entireties). 
Chaperones are also involved in various disorders including tuberculosis and leprosy (Zugel et al. 

20 Clin Microbiol Rev 1999; 12:19-39), neurogenerative disorders such as Alzheimer and Parkinson 
diseases (Yoo et al. J Neural Transm SuppI 1999; 57:3 15-22), and malignant disorders (Csermely et 
al. Pharmacol Ther 1998; 79:129-68), which disclosures are hereby incorporated by reference in 
their entireties. 

In one embodiment, the protein of the invention or part thereof may be used to detect a 
25 potential pregnancy, preferably within 6-24 hours of fertilization using the teaching of Morton et al., 
1976, Proc. R. Soc. B. 193 413-41 and US 6,1 17,421, which disclosures are hereby incorporated by 
reference in their entireties. Detection of the expression or activity of the protein of the invention 
may be performed using any techniques known to those skilled in the art including those described 
elsewhere in the application. 
30 In another embodiment, molecules able to block the expression or activity of the protein of 

the invention, such as antibodies, antisense or triple helix oligonucleotides, dominant negative 
forms of the protein, polypeptides or small molecule inhibitors of the expression or activity of the 
proteins, may be used to induce abortion as described in US patent 6,1 17,421. 

In still another embodiment, the protein of the invention or part thereof may be used to treat 
35 and/or prevent infertility and miscarriage using the simple administration of the protein of the 
invention or part thereof or using any of the gene therapy methods described elsewhere in the 
application and the teaching of the US patent 6,1 17,421. 

In another embodiment, the present invention provide methods of using the present proteins 
to identify specific cell types in vitro and in vivo. For example, as chaperone proteins are often 
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upregulated in response to cellular stress, the detection of cells expressing elevated levels of the 
proteins provides a tool for detecting cells under stress. As cellular stress has been implicated in a 
number of disorders, such as cardiovascular disorders, neurodegenerative disorders, and cancer, the 
ability to detect such stress thus provides a diagnostic or screening tool for such conditions. 
S In addition, the present polypeptides and polynucleotides can be used to develop diagnostic 

and screening assays for diseases characterized by an abnormal level or activity of the protein of 
SEQ ID NO: 303 such as malignant disorders of various types, and autoimmune diseases including 
type 1 diabetes, rheumatoid arthritis, systemic lupus erythematosus, Sjogren syndrome. Graves 
disease, multiple sclerosis, and mixed connective tissue disease. Such assays can be performed 

10 using any biological sample, such as serum or plasma. 

In another embodiment, various disorders can be treated, attenuated and/or prevented by a 
protein of SEQ ID NO: 303, or part thereof, or any other compound that can affect the level or 
activity of the proteins such as nucleic acids, antibodies, or chemical substances. In a preferred 
embodiment, proteins or other compounds directed to the proteins of the invention can be used to 

1 S treat or prevent disorders in which the activity or level of the protein of SEQ ID NO: 303 is 
unbalanced. Such diseases include, but are not limited to, infectious diseases, neurogenerative 
disorders as Alzheimer and Parkinson diseases, schizophrenia, alopecia, aging, atherosclerosis, 
malignant disorders of various types, and autoimmune diseases including type 1 diabetes, 
rheumatoid arthritis, systemic lupus erythematosus, Sjogren syndrome, mixed connective tissue 

20 disease, malignant disorders, autoimmune and any other neurodegenerative disorder. In another 
embodiment, the proteins of SEQ ID NO: 303 or part thereof can be used as vaccines for various 
disorders including, but not limited, to cancer (Wang et al. Immunol Invest 2000;29:131-7), 
tuberculosis (Silva et al. Microbes Infect 1999;1:429-35), diabetes (Int Immunol 1999;n:957-66), 
and atherosclerosis (Xu et al. Arterioscler Thromb 1992;12:789-99), which disclosures are hereby 

25 incorporated by reference in their entireties. 

One embodiment of the present invention relates to methods and compositions using the 
protein of SEQ ID NO:303 or fragments thereof as a stabilizing adjuvant to slow down protein 
degradation, boost the yields of recombinant proteins, prevent the aggregation of proteins or 
regenerate denatured proteins. In a preferred embodiment, the protein of SEQ ID NO:303 of 

30 fragment thereof is mixed with a composition comprising the protein for which it is desired to slow 
down degradation, boost yield, or regenerate denatured proteins under conditions which facilitate 
the desired result. For example, numerous commercial assay kits commonly used by those skilled 
in the arts of molecular biology and biochemistry depend on the biological properties of proteins 
(mostly enzymes) which can be very short-lived in vitro due to the low stability of those proteins. 

35 An example is described in Eur. Patent DE4 124286, the disclosure of which is incorporated herein 
by reference in its entirety, wherein the low intrinsic stability of test solutions used in optical tests is 
increased by addition of chaperone proteins, thus making the test more sensitive. Another example 
is given in US patent 6,013,488, which disclosure is hereby incorporated by reference in its entirety, 
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wherein a heat-labile reverse transcriptase is able to perform cDNA synthesis at high temperature 
levels in the presence of a chaperone. 

The protein of SEQ ID NO:303 may also be used to increase the yield or activity of 
recombinant proteins, preferably secreted proteins. In recombinant DNA technology, a major 
5 unsolved problem is the solubility and biological activity of the recombinantly overexpressed 
protein in a host, especially a bacterial or yeast host. Many eukaryotic proteins, especially the 
secreted ones, require for correct folding a specific cellular machinery w^hich is lacking in bacterial 
hosts such as E, coli or becomes insufficient in mammalian/yeast cells due to high expression of the 
protein. The ability of the protein of SEQ ID NO:303 or fragments thereof to ensure proper folding 

10 of recombinant proteins may be utilized as follows. The protein of SEQ ID NO:303, or fragment 
thereof, may be coexpressed with the recombinant protein in bacterial or eukaryotic hosts to cause 
the hosts to express the heterologous proteins or polypeptides in a form having increased solubility 
and/or biological activity. For example, the protein of SEQ ID NO:303 or fragments thereof may 
be used in the methods described in US patent 5,773,245, the disclosure of which is incorporated 

1 5 herein by reference in its entirety. Therefore the invention relates to a method for the correct 

folding, deaggregation or prevention of aggregation of a monomeric protein in vivo comprising: (a) 
constructing a host cell transformed with (i) a first DNA encoding a polypeptide having the amino 
acid sequence of a bioactive protein or a precursor thereof, wherein said polypeptide or precursor 
can aggregate within the cell to result in a multimeric, non-bioactive protein or precursor thereof 

20 and (ii) a second DNA which enable the cell to co-express the protein of the invention or part 
thereof with the said polypeptide or precursor, (b) growing said host cell for sufficient time under 
conditions wherein said first DNA and said second DNA express said bioactive protein and said 
protein of the invention, respectively; and (c) obtaining monomeric protein that is a bioactive 
protein. Alternatively the protein of SEQ ID NO:303 or fragments thereof may be exogeneously 

25 added to the cell cultures as described in PCT application WO 00/08 1 35, the disclosure of which is 
incorporated herein by reference in its entirety. 

The protein of SEQ ID NO:303 or fragments thereof, may further be used to regenerate 
denatured proteins. Recombinantly expressed proteins with poor biologival activity are routinely 
denatured with a potent denaturing agent, such as guanidine hydrochloride, followed by refolding 

30 by dilution with a large amount of a diluent to reduce the concentration of the denaturing agent. 
However, this method often results in a poor refolding rate which may be significantly increased by 
addition of a cocktail of chaperone proteins in a fashion similar to that described in Eur. Patent 
EP0650975, the disclosure of which is incorporated herein by reference in its entirety. The 
advantage of using a cocktail of chaperone proteins is to accommodate differences in binding 

35 specificity of the Hsp different families and the different members within each family. 

In another embodiment of the present invention, the protein of SEQ ID NO:303 may be 
used to promote tissue repair and/or increase cell survival in stress conditions such as hypoxy, 
oxidative stress, genotoxic agents and more generally harmful conditions leading to programmed 
cell death. Those conditions include but are not limited to infarction, heart surgery, stroke, 
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neurodegenerative diseases, epilepsy, trauma, atherosclerosis, restenosis after angioplasty, and 
nerve damage. 

In addition, the invention relates to compositions and methods for promoting cell growth 
both in vitro and in vivo using any of the techniques known to those skilled in the art including 
5 those described in the US patent 6,1 17,421 . For example, soluble forms of the protein of the 
invention or part thereof may be added to cell culture medium in an amount effective to stimulate 
cell proliferation. Alternatively, any of the gene therapy methods described herein may be used to 
overexpress the protein of the invention or part thereof in vivo. Alternatively, the protein of the 
invention or part thereof may be directly administered in an amount effective to promoting cell 

10 growth in said subject. These applications are particularly important in individuals suffering from 
wounds or tissue damage to enhance tissue repair, in individuals to which organ or skin grafts have 
been applied, in individuals suffering from an inflammatory condition or an allergic disease. 

In addition, the invention relates to compositions and methods for promoting 
immunosupression in a subject using any of the techniques known to those skilled in the art 

1 5 including those described in the US patent 6, 1 1 7,42 1 . For example, the protein of the invention or 
part thereof may be directly administered in an amount effective to achieve immunosupression in 
said subject. Alternatively, any of the gene therapy methods described herein may be used to 
overexpress the protein of the invention or part thereof in vivo. These applications are particularly 
important in cases in which immunosupression is desired such as in individuals suffering from 

20 autoimmune disease including any of the diseases cited above and in individuals that have received 
an heterologous graft they could reject. 

Ion transport Protein of SEP ID NO: 276 (internal designation DS38694) 

The protein of SEQ ID NO : 276 encoded by the cDNA of SEQ ID NO: 107 belongs to the 
25 FXDY family of small ion transport regulators or channels (Sweadner and Rael (2000) Genomics 
68:41-56, which disclosure is hereby incorporated by reference in its entirety). The protein of SEQ 
ID NO: 276 or part thereof plays a role in the control of ion transport. Preferred polypeptides of the 
invention are polypeptides comprising the amino acids of SEQ ID NO:276 from positions 9 to 63, 
or from positions 16 to 29. Other preferred polypeptides of the invention are fragments of SEQ ID 
30 NO: 276 having any of the biological activities described herein. The activity of the protein of the 
invention or part thereof may be assayed using any of the assays known to those skilled in the art 
including those described in . 

Osmoregulation occurs in all organisms, though the mechanisms differ according to the 
organism's environment. Fresh water inhabitants need to retain salts, whereas ocean inhabitants 
35 need to retain water. Terrestrial inhabitants need to conserve both water and salts. Organisms must 
balance these needs with a requirement to eliminate metabolic waste, such as nitrogenous waste, 
and generate secreted body fluids, such as saliva for digestion and sweat for thermoregulation. 
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In mammals, sweat glands, salivary glands, and the kidney all produce a primary secretion 
that is essentially isosmotic with blood and extracellular fluids. Modification of this primary 
secretion then occurs as much of the sodium chloride and water are reabsorbed as they pass through 
the excretory ducts of the glands and kidney, whereas potassium and bicarbonate ions are secreted. 
S This modification of the primary secretion is important in the sweat glands to conserve sodium 
chloride in hot environments, and in the salivary glands to conserve sodium chloride when 
excessive quantities of saliva are lost. This modification is critical in the kidney to maintain proper 
sodium and water balance in the extracellular fluids, a balance which also regulates arterial 
pressure. Loss of this modification activity by the duct cells causes a large loss of sodium and water, 

10 resulting in severe dehydration and low blood volume, and ultimately to circulatory collapse. 

Sodium absorption by the intestines, especially in the colon, is necessary to prevent loss of 
sodium in the stools. The loss of sodium absorption produces a failure to absorb anions and water as 
well. The unabsorbed sodium chloride and water then lead to diarrhea, with further loss of sodium 
chloride from the body. Other body fluids may be under regulation similar to that seen in the 

IS systems described above. For example, cerebrospinal fluid is produced by active sodium ion 
transport from the capillaries across the epithelium of the choroid plexus, which in turn attracts 
chloride ions and water. A counter flow of potassium and bicarbonate ions move out of the 
cerebrospinal fluid into the capillaries. A dysfunction in osmoregulation is associated with several 
disease states, including hyponatremia, renal failure, and hypematremia. (Strange, K. (1992) J Am. 

20 Soc. Nephrol. 3:12-27, which disclosure is hereby incorporated by reference in its entirety). 

In one embodiment, the protein of the invention may be useful in the diagnosis, prevention 
and/or treatment of osmoregulatory disorders including but not limited to diabetes insipidus, 
diarrhea, peritonitis, chronic renal failure, Addison's disease, syndrome of inappropriate antidiuretic 
hormone (SIADH), hypoaldosteronism, hyponatremia, adrenal insufficiency, hypothyroidism, 

25 hypematremia, hypokalemia, Barter's syndrome, Cushing's syndrome, metabolic acidosis, metabolic 
alkalosis, encephalopathy, edema, hypotension, and hypertension. For example, any of the gene 
therapy methods described herein may be used to overexpress the protein of the invention or part 
thereof in vivo. Alternatively, the protein of the invention or part thereof may be directly 
administered in an amount effective to promoting cell growth in said subject. For diagnostic 

30 purposes, the expression of the protein of the invention could be investigated using any of the 
Northern blotting, RT-PCR or immunoblotting methods described herein and compared to the 
expression in control individuals. For prevention and/or treatment purposes, the protein of the 
invention may be used to enhance ion transport and prevent or treat osmoregulatory disorders using 
any of the gene therapy methods described herein. 

35 
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Antibodies of the present invention have uses that include, but are not limited to, methods 
known in the art to purify, detect, and target the polypeptides of the present invention including 
both in vitro and in vivo diagnostic and therapeutic methods. An example of such use using 
5 immunoaffinity chromatography is given below. The antibodies of the present invention may be 
used either alone or in combination with other compositions. For example, the antibodies have use 
in immunoassays for qualitatively and quantitatively measuring levels of antigen-bearing substances, 
including the polypeptides of the present invention, in biological samples (See, e.g., Harlow et aL, 
1988). (Incorporated by reference in the entirety). The antibodies may also be used in therapeutic 

1 0 compositions for killing cells expressing the protein or reducing the levels of the protein in the body. 
The invention further relates to antibodies that act as agonists or antagonists of the 
polypeptides of the present invention. For example, the present invention includes antibodies that 
disrupt the receptor/1 igand interactions with the polypeptides of the invention either partially or 
fully. Included are both receptor-specific antibodies and ligand-specific antibodies. Included are 

15 receptor-specific antibodies, which do not prevent ligand binding but prevent receptor activation. 
Receptor activation (i.e., signaling) may be determined by techniques described herein or otherwise 
known in the art. Also include are receptor-specific antibodies which both prevent ligand binding 
and receptor activation. Likewise, included are neutralizing antibodies that bind the ligand and 
prevent binding of the ligand to the receptor, as well as antibodies that bind the ligand, thereby 

20 preventing receptor activation, but do not prevent the ligand from binding the receptor. Further 
included are antibodies that activate the receptor. These antibodies may act as agonists for either all 
or less than all of the biological activities affected by ligand-mediated receptor activation. The 
antibodies may be specified as agonists or antagonists for biological activities comprising specific 
activities disclosed herein. The above antibody agonists can be made using methods known in the 

25 art. See e.g., WO 96/40281; US Patent 5,81 1,097; Deng et al. (1998); Chen etaL (1998); Harrop et 
al ( 1 998); Zhu et al (1998); Yoon et al ( 1 998); Prat et al (1998); Pitard et al ( 1 997); Liautard et 
al (1997); Carlson et al (1997); Taryman et al (1995); Muller et al (1998); Bartunek et al (1996) 
(said references incorporated by reference in their entireties). 

As discussed above, antibodies of the polypeptides of the invention can, in turn, be utilized 

30 to generate anti-idiotypic antibodies that "mimic" polypeptides of the invention using techniques 
well known to those skilled in the art (See, e.g. Greenspan and Bona (1989) and Nissinoff (1991), 
which disclosures are hereby incorporated by reference in their entireties). For example, antibodies 
which bind to and competitively inhibit polypeptide multimerization or binding of a polypeptide of 
the invention to ligand can be used to generate anti-idiotypes that "mimic" the polypeptide 

35 multimerization or binding domain and, as a consequence, bind to and neutralize polypeptide or its 
ligand. Such neutralization anti-idiotypic antibodies can be used to bind a polypeptide of the 
invention or to bind its ligands/receptors, and thereby block its biological activity. 
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Antibodies prepared as described herein are coupled to a support. Preferably, the antibodies 
are monoclonal antibodies, but polyclonal antibodies may also be used. The support may be any of 
those typically employed in immunoaffmity chromatography, including Sepharose CL-4B 
5 (Pharmacia, Piscataway, NJ), Sepharose CL-2B (Pharmacia, Piscataway, NJ), Affi-gel 1 0 (Biorad, 
Richmond, CA), or glass beads. 

The antibodies may be coupled to the support using any of the coupling reagents typically 
used in immunoaffmity chromatography, including cyanogen bromide. After coupling the antibody 
to the support, the support is contacted with a sample which contains a target polypeptide whose 
10 isolation, purification or enrichment is desired. The target polypeptide may be a polypeptide 
selected from the group consisting of sequences of SEQ ID NOs: 170-338, 4S6-S60, 785-918 and 
polypeptides encoded by the clone inserts of the deposited clone pool, variants and fragments 
thereof, or a fusion protein comprising said selected polypeptide or a fragment thereof 

Preferably, the sample is placed in contact with the support for a sufficient amount of time 
1 5 and under appropriate conditions to allow at least 50% of the target polypeptide to specifically bind 
to the antibody coupled to the support. 

Thereafter, the support is washed with an appropriate wash solution to remove polypeptides 
which have non-specifically adhered to the support. The wash solution may be any of those 
typically employed in immunoaffinity chromatography, including PBS, Tris-lithium chloride buffer 
20 (O.IM lysine base and 0.5M lithium chloride, pH 8.0), Tris-hydrochloride buffer (0.05M Tris- 
hydrochloride, pH 8.0), or Tris/Triton/NaCI buflFer(50mM Tris.cl, pH 8.0 or 9.0, 0,1% Triton X- 
100, and O.SMNaCI). 

After washing, the specifically bound target polypeptide is eluted from the support using 
the high pH or low pH elution solutions typically employed in immunoaffinity chromatography. In 
25 particular, the elution solutions may contain an eluant such as triethanolamine, diethylamine, 
calcium chloride, sodium thiocyanate, potasssium bromide, acetic acid, or glycine. In some 
embodiments, the elution solution may also contain a detergent such as Triton X-100 or octyl-beta- 
D-glucoside. 

Expression of Genset gene products 

30 Spatial expression of the GENSET genes of the invention 

Tissue expression of the cDNAs of the present invention was examined. Tables III and IV 
lists the number of hits for the cDNAs in Genset's libraries of tissues and cell types as well as in 
public databases. The tissues and cell types examined for polynucleotide expression were, for 
Table 111: Brain; Fetal brain; Fetal kidney; Fetal liver; Pituitary gland; Liver;- Placenta; Prostate; 
35 Salivary gland; Stomach/Intestine; and Testis. For each cDNA referred to by its corresponding 
sequence identification number from the priority application (see Table I for corresponding SEQ ID 
NO in present application), the number of proprietary 5'ESTs (i.e. cDNA fragments) expressed in a 
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particular tissue referred to by its name is indicated in parentheses (second column). In addition, the 
bias in the spatial distribution of the polynucleotide sequences of the present invention was 
examined by comparing the relative proportions of the biological polynucleotides of a given tissue 
using the following statistical analysis. The under- or over-representation of a polynucleotide of a 
S given cluster in a given tissue was performed using the normal approximation of the binomial 
distribution. When the observed proportion of a polynucleotide of a given tissue in a given 
consensus had less than 1% chance to occur randomly according to the chi2 test, the frequency bias 
was reported as "preferred"". The results are given in Table V as follows. For each polynucleotide 
showing a bias in tissue distribution as referred to by its sequence identification number in the first 
10 column, the list of tissues where the polynucleotides are under-represented is given in the second 
column entitled "low expression" and the list of tissues where the polynucleotides are over- 
represented is given in the third column entitled "high expression". 

Evaluation of Expression Levels and Patterns of GENSET polypeptide-encoding mRNAs 

The spatial and temporal expression patterns of GENSET polypeptide-encoding mRNAs, as 

1 S well as their expression levels, may also be further determined as follows. 

Expression levels and patterns of GENSET polypeptide-encoding mRNAs may be analyzed 
by solution hybridization with long probes as described in International Patent Application No. WO 
97/05277, the entire contents of which are hereby incorporated by reference. Briefly, a GENSET 
polynucleotide, or fragment thereof, corresponding to the gene encoding the mRNA to be 

20 characterized is inserted at a cloning site immediately downstream of a bacteriophage (T3, T7 or 
SP6) RNA polymerase promoter to produce antisense KNA. Preferably, the GENSET 
polynucleotide is at least a 100 nucleotides in length. The plasmid is linearized and transcribed in 
the presence of ribonucleotides comprising modified ribonucleotides (i.e. biotin-UTP and DIG- 
UTP). An excess of this doubly labeled RNA is hybridized in solution with mRNA isolated from 

25 cells or tissues of interest. The hybridizations are performed under standard stringent conditions 
(40-50X for 16 hours in an 80% formamide, 0.4 M NaCI buffer, pH 7-8). The unhybridized probe 
is removed by digestion with ribonucleases specific for single-stranded RNA (i.e. RNases CL3, Tl, 
Phy M, U2 or A). The presence of the biotin-UTP modification enables capture of the hybrid on a 
microti tration plate coated with streptavidin. The presence of the DIG modification enables the 

30 hybrid to be detected and quantified by ELISA using an anti-DIG antibody coupled to alkaline 
phosphatase. 

The GENSET polypeptide-encoding cDNAs, or fragments thereof, may also be tagged with 
nucleotide sequences for the serial analysis of gene expression (SAGE) as disclosed in UK Patent 
Application No. 2 305 241 A, the entire contents of which are incorporated by reference. In this 
35 method, cDNAs are prepared from a cell, tissue, organism or other source of nucleic acid for which 
it is desired to determine gene expression patterns. The resulting cDNAs are separated into two 
pools. The cDNAs in each pool are cleaved with a first restriction endonuclease, called an 
'^anchoring enzyme," having a recognition site which is likely to be present at least once in most 



183 



wo 02/083898 PCT/IBO 1/009 14 

cDNAs. The fragments which contain the 5' or 3' most region of the cleaved cDNA are isolated by 
binding to a capture medium such as streptavidin coated beads. A first oligonucleotide linker 
having a first sequence for hybridization of an amplification primer and an internal restriction site 
for a "tagging endonuclease" is ligated to the digested cDNAs in the first pool. Digestion with the 
S second endonuclease produces short "tag'* fragments from the cDNAs. A second oligonucleotide 
having a second sequence for hybridization of an amplification primer and an internal restriction 
site is ligated to the digested cDNAs in the second pool. The cDNA fragments in the second pool 
are also digested with the "tagging endonuclease" to generate short "tag" fragments derived from 
the cDN As in the second pool. The "tags" resulting from digestion of the first and second pools 

1 0 with the anchoring enzyme and the tagging endonuclease are ligated to one another to produce 
"ditags." In some embodiments, the ditags are concatamerized to produce ligation products 
containing from 2 to 200 ditags. The tag sequences are then determined and compared to the 
sequences of the GENSET polypeptide-encoding cDNAs to determine which genes are expressed in 
the cell, tissue, organism, or other source of nucleic acids from which the tags were derived. In this 

1 5 way, the expression pattern of a GENSET polypeptide-encoding gene in the cell, tissue, organism, 
or other source of nucleic acids is obtained. 

Quantitative analysis of GENSET gene expression may also be performed using arrays. 
For example, quantitative analysis of gene expression may be performed with GENSET 
polynucleotides, or fragments thereof in a complementary DNA microarray as described by Schena 

20 et al. (1995 and 1996) which disclosures are hereby incorporated by reference in their entireties. 
GENSET polypeptide-encoding cDNAs or fragments thereof are amplified by PGR and arrayed 
from 96-well microtiter plates onto silylated microscope slides using high-speed robotics. Printed 
arrays are incubated in a humid chamber to allow rehydration of the array elements and rinsed, once 
in 0.2% SDS for I min, twice in water for 1 min and once for 5 min in sodium borohydride solution. 

25 The arrays are submerged in water for 2 min at 95°C, transferred into 0.2% SDS for 1 min, rinsed 
twice with water, air dried and stored in the dark at 2S^C. Cell or tissue mRNA is isolated or 
commercially obtained and probes are prepared by a single round of reverse transcription. Probes 
are hybridized to 1 cm^ microarrays under a 14 x 14 mm glass coverslip for 6-12 hours at 60^C. 
Arrays are washed for 5 min at 25*'C in low stringency wash buffer (IX SSC/0.2% SDS), then for 

30 10 min at room temperature in high stringency wash buffer (O.IX SSC/0.2% SDS). Arrays are 
scanned in O.IX SSC using a fluorescence laser scanning device fitted with a custom filter set. 
Accurate differential expression measurements are obtained by taking the average of the ratios of 
two independent hybridizations. 

Quantitative analysis of the expression of genes may also be performed with GENSET 

35 polypeptide-encoding cDNAs or fragments thereof in complementary DNA arrays as described by 
Pietu etal (1996), which disclosure is hereby incorporated by reference in its entirety. The 
GENSET polynucleotides of the invention or fragments thereof are PCR amplified and spotted on 
membranes. Then, mRNAs originating from various tissues or cells are labeled with radioactive 
nucleotides. After hybridization and washing in controlled conditions, the hybridized mRNAs are 
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detected by phospho-imaging or autoradiography. Duplicate experiments are performed and a 
quantitative analysis of differentially expressed mRNAs is then performed. 

Alternatively, expression analysis of GENSET genes can be done through high density 
nucleotide arrays as described by Lockhart et al ( 1 996) and Sosnowski et al ( 1 997), which 
S disclosures are hereby incorporated by reference in their entireties. Oligonucleotides of IS-SO 
nucleotides corresponding to sequences of a GENSET polynucleotide or fragments thereof are 
synthesized directly on the chip (Lockhart et al, supra) or synthesized and then addressed to the 
chip (Sosnowski et aL, supra). Preferably, the oligonucleotides are about 20 nucleotides in length. 
cDNA probes labeled with an appropriate compound, such as biotin, digoxigenin or fluorescent dye, 

10 are synthesized from the appropriate mRNA population and then randomly fragmented to an 
average size of 50 to 100 nucleotides. The said probes are then hybridized to the chip. After 
washing as described in Lockhart et al., (supra) and application of different electric fields 
(Sosnowsky et al, supra), the dyes or labeling compounds are detected and quantified. Duplicate 
hybridizations are performed. Comparative analysis of the intensity of the signal originating from 

1 5 cDNA probes on the same target oligonucleotide in different cDNA samples indicates a differential 
expression of the GENSET polypeptide-encoding mRNA. 

Uses of GENSET gene expression data 

Once the expression levels and patterns of a GENSET polypeptide-encoding mRNA has 
been determined using any technique known to those skilled in the art, in particular those described 
20 in the section entitled "Evaluation of Expression Levels and Patterns of GENSET polypeptide- 
encoding mRNAs", or using the instant disclosure, these information may be used to design 
GENSET gene specific markers for detection, identification, screening and diagnosis purposes as 
well as to design DNA constructs with an expression pattern similar to a GENSET gene expression 
pattern. 

25 Detection of GENSET polvpeptide expression and/or biological activity 

The invention further relates to methods of detection of GENSET polypeptide expression 
and/or biological activity in a biological sample using the polynucleotide and polypeptide sequences 
described herein. Such method scan be used, for example, as a screen for normal or abnormal 
GENSET polypeptide expression and/or biological activity and, thus, can be used diagnostically. 
30 The biological sample for use in the methods of the present invention includes a suitable sample 
from, for example, a mammal, particularly a human. For example, the sample can be issued from 
tissues or cell lines having the same origin as tissues or cell lines in which the polypeptide is known 
to be expressed, e.g. using data from Tables III, IV, or V. 

Detection of GENSET polypeptides 

35 The invention further relates to methods of detection of GENSET polypeptide or encoding 

polynucleotides in a sample using the sequences described herein and any techniques known to 
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those skilled in the art. For example, a labeled polynucleotide probe having all or a functional 
portion of the nucleotide sequence of a GENSET polypeptide-encoding polynucleotide can be used 
in a method to detect a GENSET polypeptide-encoding polynucleotide in a sample. In one 
embodiment, the sample is treated to render the polynucleotides in the sample available for 
S hybridization to a polynucleotide probe, which can be DNA or RNA. The resulting treated sample 
is combined with a labeled polynucleotide probe having all or a portion of the nucleotide sequence 
of the GENSET polypeptide-encoding cDNA or genomic sequence, under conditions appropriate 
for hybridization of complementary sequences to occur. Detection of hybridization of 
polynucleotides from the sample with the labeled nucleic probe indicates the presence of GENSET 

10 polypeptide-encoding polynucleotides in a sample. The presence of GENSET polypeptide- 
encoding mRNA is indicative of GENSET polypeptide-encoding gene expression. 

Consequently, the invention comprises methods for detecting the presence of a 
polynucleotide comprising a nucleotide sequence selected from a group consisting of the sequences 
of SEQ ID N0s;l-1 69, 339-455, 561-784, the sequences of clone inserts of the deposited clone 

1 5 pool, sequences fully complementary thereto, fragments and variants thereof in a sample. In a first 
embodiment, said method comprises the following steps of: 

a) bringing into contact said sample and a nucleic acid probe or a plurality of nucleic acid 
probes which hybridize to said selected nucleotide sequence; and 

b) detecting the hybrid complex formed between said probe or said plurality of probes and 
20 said polynucleotide. 

In a preferred embodiment of the above detection method, said nucleic acid probe or said 
plurality of nucleic acid probes is labeled with a detectable molecule. In another preferred 
embodiment of the above detection method, said nucleic acid probe or said plurality of nucleic acid 
probes has been immobilized on a substrate. In still another preferred embodiment, said nucleic 
25 acid probe or said plurality of nucleic acid probes has a sequence comprised in a sequence 
complementary to said selected sequence. 

In a second embodiment, said method comprises the steps of: 

a) contacting said sample with amplification reaction reagents comprising a pair of 
amplification primers located on either side of the region of said nucleotide sequence to be 

30 amplified; 

b) performing an amplification reaction to synthesize amplification products containing said 
region of said selected nucleotide sequence; and 

c) detecting said amplification products. 

In a preferred embodiment of the above detection method, when the polynucleotide to be 
35 amplified is a RNA molecule, preliminary reverse transcription and synthesis of a second cDNA 
strand are necessary to provide a DNA template to be amplified. In another preferred embodiment 
of the above detection method, the amplification product is detected by hybridization with a labeled 
probe having a sequence which is complementary to the amplified region, in still another preferred 
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embodiment, at least one of said amplification primer has a sequence comprised in said selected 
sequence or in the sequence complementary to said selected sequence. 

Alternatively, a method of detecting GENSET polypeptide expression in a test sample can 
be accomplished using any product which binds to a GENSET olypeptide of the present invention 
S or a portion of a GENSET polypeptide. Such products may be antibodies, binding fi-agments of 
antibodies, polypeptides able to bind specifically to GENSET polypeptides or fragments thereof, 
including GENSET polypeptide agonists and antagonists. Detection of specific binding to the 
antibody indicates the presence of a GENSET polypeptide in the sample (e.g., ELISA). 

Consequently, the invention is also directed to a method for detecting specifically the 
10 presence of a GENSET polypeptide according to the invention in a biological sample, said method 
comprising the steps of: 

a) bringing into contact said biological sample with a product able to bind to a polypeptide 
of the invention or fragments thereof; 

b) allowing said product to bind to said polypeptide to form a complex; and 
1 5 b) detecting said complex. 

In a preferred embodiment of the above detection method, the product is an antibody. In a 
more preferred embodiment, said antibody is labeled with a detectable molecule. In another more 
preferred embodiment of the above detection method, said antibody has been immobilized on a 
substrate. 

20 In addition, the invention also relates to methods of determining whether a GENSET gene 

product (e.g. a polynucleotide or polypeptide) is present or absent in a biological sample, said 
methods comprising the steps of: 

a) obtaining said biological sample from a human or non-human animal, preferably a 
mammal; 

25 b) contacting said biological sample with a product able to bind to a GENSET polypeptide 

or encoding polynucleotide of the invention; and 

c) determining the presence or absence of said GENSET polypeptide-encoding gene 
product in said biological sample. 

The present invention also relates to kits that can be used in the detection of GENSET 
30 polypeptide-encoding gene expression products. The kit can comprise a compound that specifically 
binds a GENSET polypeptide (e.g. binding proteins, antibodies or binding fragments thereof (e.g. 
F(ab')2 fragments) or a GENSET polypeptide-encoding mRNA (e.g. a complementary probe or 
primer), for example, disposed within a container means. The kit can further comprise ancillary 
reagents, including buffers and the like. 

3 5 Detection of GENSET polypeptide biological activity 

The invention further includes methods of detecting specifically a GENSET polypeptide 
biological activity, and to identify compounds capable of modulating the activity of a GENSET 
polypeptide. Assessing the GENSET polypeptide biological activity may be performed by the 
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detection of a change in any cellular property associated with the GENSET polypeptide, using a 
variety of techniques, including those described herein. To identify modulators of the polypeptides, 
a control is preferably used. For example, a control sample includes all of the same reagents but 
lacks the compound or agent being assessed; it is treated in the same manner as the test sample. A 
S number of potentially assayable biological activities for many of the herein-described proteins are 
described supra, under the heading, "Uses of polypeptides of the invention." 

The present invention also relates to kits that can be used in the detection of GENSET 
polypeptide biological activity. The kit can comprise, e.g. substrates for GENSET polypeptides, 
GENSET-binding compounds, antibodies to GENSET polypeptides, etc., for example, disposed 
10 within a container means. The kit can further comprise ancillary reagents, including buffers and the 
like. 

Identification of a specific context of GENSET polvpeptide-encoding gene expression 

When the expression pattern of a GENSET polypeptide-encoding mRNA shows that a 
GENSET polypeptide-encoding gene is specifically expressed in a given context, probes and 

1 5 primers specific for this gene as well as antibodies binding to the GENSET polypeptide-encoding 
polynucleotide may then be used as markers for the specific context. Examples of specific contexts 
are: specific expression in a given tissue/cell or tissue/cell type (see, e.g., Tables III-V) , expression 
at a given stage of development of a process such as embryo development or disease development, 
or specific expression in a given organelle. Such primers, probes, and antibodies are useful 

20 commercially to identify tissues/cells/organelles of unknown origin, for example, forensic samples, 
differentiated tumor tissue that has metastasized to foreign bodily sites, or to differentiate different 
tissue types in a tissue cross-section using any technique known to those skilled in the art including 
in situ PGR or immunochemistry for example. 

For example, the cDNAs and proteins of the sequence listing and fragments thereof, may be 

2S used to distinguish human tissues/cells from non-human tissues/cells and to distinguish between 
human tissues/cells/organelles that do and do not express the polynucleotides comprising the 
cDNAs. By knowing the expression pattern of a given GENSET polypeptide, either through 
routine experimentation or by using the instant disclosure, the polynucleotides and polypeptides of 
the present invention may be used in methods of determining the identity of an unknown tissue/cell 

30 sample/organelle. As part of determining the identity of an unknown tissue/cell sample/organelle, 
the polynucleotides and polypeptides of the present invention may be used to determine what the 
unknown tissue/cell sample is and what the unknown sample is not. For example, if a cDNA is 
expressed in a particular tissue/cell type/organelle, and the unknown tissue/cell sample/organelle 
does not express the cDNA, it may be inferred that the unknown tissue/cells are either not human or 

35 not the same human tissue/cell type/organelle as that which expresses the cDNA. These methods of 
determining tissue/cell/organelle identity are based on methods which detect the presence or 
absence of the mRNA (or corresponding cONA) in a tissue/cell sample using methods well know in 
the art (e.g., hybridization, PGR based methods, immunoassays, immunochemistry, ELISA). 
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Examples of such techniques are described in more detail below. Therefore, the invention 
encompasses uses of the polynucleotides and polypeptides of the invention as tissue markers. In a 
preferred embodiment, polynucleotides preferentially expressed in given tissues as indicated in 
Tables III-V and polypeptides encoded by such polynucleotides are used for this purpose. The 
5 invention also encompasses uses of polypeptides of the invention as organelle markers. 

Consequently, the present invention encompasses methods of identification of a tissue/cell 
type/subcellular compartment, wherein said method includes the steps of: 

a) contacting a biological sample which identity is to be assayed with a product able to 
bind a GENSET gene product; and 
1 0 b) determining whether a GENSET gene product is expressed in said biological 

sample. 

Products that are able to bind specifically to a GENSET gene product, namely a GENSET 
polypeptide or a GENSET polypeptide-encoding mRNA, include GENSET polypeptide binding 
proteins, antibodies or binding fragments thereof (e.g. F(ab')2 fragments), as well as GENSET 
1 S polynucleotide complementary probes and primers. 

Step b) may be performed using any detection method known to those skilled in the art 
including those disclosed herein, especially in the section entitled "Detection of GENSET 
polypeptide expression and/or biological activity". 

Identification of Tissue Types or Cell Species by Means of Labeled Tissue Specific Antibodies 

20 Identification of specific tissues is accomplished by the visualization of tissue specific 

antigens by means of antibody preparations which are conjugated, directly (e.g., green fiuorescent 
protein) or indirectly to a detectable marker. Selected labeled antibody species bind to their specific 
antigen binding partner in tissue sections, cell suspensions, or in extracts of soluble proteins from a 
tissue sample to provide a pattern for qualitative or semi-qualitative interpretation. 

25 Antisera for these procedures must have a potency exceeding that of the native preparation, 

and for that reason, antibodies are concentrated to a mg/ml level by isolation of the gamma globulin 
fraction, for example, by ion*exchange chromatography or by ammonium sulfate fractionation. 
Also, to provide the most specific antisera, unwanted antibodies, for example to common proteins, 
must be removed from the gamma globulin fraction, for example by means of insoluble 

30 immunoabsorbents, before the antibodies are labeled with the marker. Either monoclonal or 
heterologous antisera is suitable for either procedure. 

A. Immunohistochemical Techniques 

Purified, high-titer antibodies, prepared as described above, are conjugated to a detectable 
marker, as described, for example, by Fudenberg, (1980) or Rose et al^ (1980), which disclosures 
35 are hereby incorporated by reference in their entireties. 

A fluorescent marker, either fluorescein or rhodamine, is preferred, but antibodies can also 
be labeled with an en^me that supports a color producing reaction with a substrate, such as 
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horseradish peroxidase. Markers can be added to tissue-bound antibody in a second step, as 
described below. Alternatively, the specific anti-tissue antibodies can be labeled with ferritin or 
other electron dense particles, and localization of the ferritin coupled antigen-antibody complexes 
achieved by means of an electron microscope. In yet another approach, the antibodies are 
S radiolabeled, with, for example '^'l, and detected by overlaying the antibody treated preparation 
with photographic emulsion. Preparations to carry out the procedures can comprise monoclonal or 
polyclonal antibodies to a single protein or peptide identified as specific to a tissue type, for 
example, brain tissue, or antibody preparations to several antigenically distinct tissue specific 
antigens can be used in panels, independently or in mixtures, as required. Tissue sections and cell 

10 suspensions are prepared for immunohistochemical examination according to common histological 
techniques. Multiple cryostat sections (about 4 urn, unfixed) of the unknown tissue and known 
control, are mounted and each slide covered with different dilutions of the antibody preparation. 
Sections of known and unknown tissues should also be treated with preparations to provide a 
positive control, a negative control, for example, pre-immune sera, and a control for non-specific 

1 S staining, for example, buffer. Treated sections are incubated in a humid chamber for 30 min at 
room temperature, rinsed, then washed in buffer for 30-45 min. Excess fluid is blotted away, and 
the marker developed. If the tissue specific antibody was not labeled in the first incubation, it can 
be labeled at this time in a second antibody-antibody reaction, for example, by adding fluorescein- 
or enzyme-conjugated antibody against the immunoglobulin class of the antiserum-producing 

20 species, for example, fluorescein labeled antibody to mouse IgG. Such labeled sera are 

commercially available. The antigen found in the tissues by the above procedure can be quantified 
by measuring the intensity of color or fluorescence on the tissue section, and calibrating that signal 
using appropriate standards. 

B. Identification of Tissue Specific Soluble Proteins 

25 The visualization of tissue specific proteins and identification of unknown tissues from that 

procedure is carried out using the labeled antibody reagents and detection strategy as described for 
immunohistochemistry; however the sample is prepared according to an electrophoretic technique 
to distribute the proteins extracted from the tissue in an orderly array on the basis of molecular 
weight for detection. A tissue sample is homogenized using a Virtis apparatus; cell suspensions are 

30 disrupted by Dounce homogenization or osmotic lysis, using detergents in either case as required to 
disrupt cell membranes, as is the practice in the art. Insoluble cell components such as nuclei, 
microsomes, and membrane fragments are removed by ultracentrifugation, and the soluble protein- 
containing fraction concentrated if necessary and reserved for analysis. A sample of the soluble 
protein solution is resolved into individual protein species by conventional SDS polyacrylamide 

35 electrophoresis as described, for example, by Davis et al^ Section 19-2 (1986), using a range of 
amounts of polyacrylamide in a set of gels to resolve the entire molecular weight range of proteins 
to be detected in the sample. A size marker is run in parallel for purposes of estimating molecular 
weights of the constituent proteins. Sample size for analysis is a convenient volume of from 5 toSS 
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ul, and containing from about 1 to 100 ug protein. An aliquot of each of the resolved proteins is 
transferred by blotting to a nitrocellulose filter paper, a process that maintains the pattern of 
resolution. Multiple copies are prepared. The procedure, known as Western Blot Analysis, is well 
described in Davis et al^ (1986) Section 19-3. One set of nitrocellulose blots is stained with 
5 Coomassie Blue dye to visualize the entire set of proteins for comparison with the antibody bound 
proteins. The remaining nitrocellulose filters are then incubated with a solution of one or more 
specific antisera to tissue specific proteins prepared as described herein. In this procedure, as in 
procedure A above, appropriate positive and negative sample and reagent controls are run. 

In either procedure A or B, a detectable label can be attached to the primary tissue antigen- 
ic primary antibody complex according to various strategies and permutations thereof. In a 

straightforward approach, the primary specific antibody can be labeled; alternatively, the unlabeled 
complex can be bound by a labeled secondary anti-IgG antibody. In other approaches, either the 
primary or secondary antibody is conjugated to a biotin molecule, which can, in a subsequent step, 
bind an avidin conjugated marker. According to yet another strategy, enzyme labeled or radioactive 
1 5 protein A, which has the property of binding to any IgG, is bound in a final step to either the 

primary or secondary antibody. The visualization of tissue specific antigen binding at levels above 
those seen in control tissues to one or more tissue specific antibodies, prepared from the gene 
sequences identified from cDNA sequences, can identify tissues of unknowrn origin, for example, 
forensic samples, or differentiated tumor tissue that has metastasized to foreign bodily sites. 

20 Screening and diagnosis of abnormal GENSET polvpeptide expression and/or biological activity 

Moreover, antibodies and/or primers specific for GENSET polypeptide expression may also 
be used to identify abnormal GENSET polypeptide expression and/or biological activity, and 
subsequently to screen and/or diagnose disorders associated with abnormal GENSET polypeptide 
expression. For example, a particular disease may resuh from lack of expression, over expression, 

25 or under expression of a GENSET polypeptide-encoding mRNA. By comparing mRNA expression 
patterns and quantities in samples taken from healthy individuals with those from individuals 
suffering from a particular disorder, genes responsible for this disorder may be identified. Primers, 
probes and antibodies specific for this GENSET polypeptide may then be used to elaborate kits of 
screening and diagnosis for a disorder in which the gene of interest is specifically expressed or in 

30 which its expression is specifically dysregulated, i.e. underexpressed or overexpressed. 

Screening for specific disorders 

The present invention also relates to methods and uses of GENSET polypeptides for 
identifying individuals having elevated or reduced levels of GENSET polypeptides, which 
individuals are likely to benefit from therapies to suppress or enhance GENSET polypeptide- 
35 encoding gene expression, respectively. One example of such methods and uses comprises the 
steps of: 

a) obtaining from a mammal a biological sample; 
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b) detecting the presence in said sample of a GENSET polypeptide-encoding gene product 
(mRNA or protein); 

c) comparing the amount of said GENSET polypeptide-encoding gene product present in 
said sample with that of a control sample; and 

5 d) determing whether said human or non-human mammal has a reduced or elevated level of 

GENSET gene expression compared to the control sample. 

A biological sample from a subject affected by, or at risk of developing, any disease or 
condition associated with a GENSET polypeptide can be screened for the presence of increased or 
decreased levels of GENSET gene product, relative to a normal population (standard or control), 

10 with an increased or decreased level of the GENSET polypeptide relative to the normal population 
being indicative of predisposition to or a present indication of the disease or condition, or any 
sympton associated with the disease or condition. Such individuals would be candidates for 
therapies, e.g., treatment with pharmaceutical compositions comprising the GENSET polypeptide, a 
polynucleotide encoding the GENSET polypeptide, or any other compound that affects the 

1 5 expression or activity of the GENSET polypeptide. Generally, the identification of elevated levels 
of the GENSET polypeptide in a patient would be indicative of an individual that would benefit 
from treatment with agents that suppress GENSET polypeptide expression or activity, and the 
identification of low levels of the GENSET polypeptide in a patient would be indicative of an 
individual that would benefit from agents that induce GENSET expression or activity. 

20 Biological samples suitable for use in this method include any biological fluids, including, 

but not limited to, blood, saliva, milk, and urine. Tissue samples (e.g. biopsies) can also be used in 
the method of the invention, including samples derived from any tissue associated with GENSET 
gene expression (see, e.g. Tables Ill-V). Cell cultures or cell extracts derived, for example, from 
tissue biopsies can also be used. The detection step of the present method can be performed using 

25 standard protocols for protein/mRNA detection. Examples of suitable protocols include Northern 
blot analysis, immunoassays (e.g. RIA, Western blots, immunohistochemical analyses), and PGR. 

Thus, the present invention further relates to methods and uses of GENSET polypeptides 
for identifying individuals or non-human animals at increased risk for developing, or present state 
of having, certain diseases/disorders associated with abnormal GENSET polypeptide expression or 

30 biological activity. One example of such methods comprises the steps of: 

a) obtaining from a human or non-human mammal a biological sample; 

b) detecting the presence in said sample of a GENSET gene product (mRNA or protein); 

c) comparing the amount of said GENSET gene product present in said sample with that of 
a control sample; and 

35 d) determing whether said human or non-human mammal is at increased risk for 

developing, or present state of having, a diseases or disorder. 

In preferred embodiments, the biological sample is taken from animals presenting any 
symptom associated with any disease or condition associated with a GENSET gene product. In 
accordance with this method, the presence in the sample of altered (e.g. increased or decreased) 
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levels of the GENSET product indicates that the subject is predisposed to the disease or condition. 
Biological samples suitable for use in this method include biological fluids including, but not 
limited to, blood, saliva, milk, and urine. Tissue samples (e.g. biopsies) can also be used in the 
method of the invention, including samples derived fh)m any of the tissues listed in Tables Ill-V. 
5 Cell cultures or cell extracts derived, for example, from tissue biopsies can also be used. 

The diagnostic methodologies described herein are applicable to both humans and 
non-human mammals. 

Detection of GENSET gene mutations 

The invention also encompasses methods and uses of GENSET polynucleotides to detect 

10 mutations in GENSET polynucleotides of the invention. Such methods may advantageously be 
used to detect mutations occurring in GENSET genes and preferably in their regulatory regions. 
When the mutation was proven to be associated with a disease, the detection of such mutations may 
be used for screening and diagnosis purposes. 

In one embodiment of the oligonucleotide arrays of the invention, an oligonucleotide probe 

1 5 matrix may advantageously be used to detect mutations occurring in GENSET genes and preferably 
in their regulatory regions. For this particular purpose, probes are specifically designed to have a 
nucleotide sequence allowing their hybridization to the genes that carry known mutations (either by 
deletion, insertion or substitution of one or several nucleotides). By known mutations, it is meant, 
mutations on the GENSET genes that have been identified according, for example to the technique 

20 used by Huang et a/.(1996) or Samson et al. {1996% which disclosures are hereby incorporated by 
reference in their entireties. 

Another technique that is used to detect mutations in GENSET genes is the use of a high- 
density DNA array. Each oligonucleotide probe constituting a unit element of the high density 
. DNA array is designed to match a specific subsequence of a GENSET genomic DNA or cDNA. 

2S Thus, an array consisting of oligonucleotides complementary to subsequences of the target gene 
sequence is used to determine the identity of the target sequence with the wild gene sequence, 
measure its amount, and detect differences between the target sequence and the reference wild gene 
sequence of the GENSET gene. In one such design, termed 4L tiled array, is implemented a set of 
four probes (A, C, G, T), preferably 1 5-nucleotide oligomers. In each set of four probes, the perfect 

30 complement will hybridize more strongly than mismatched probes. Consequently, a nucleic acid 
target of length L is scanned for mutations with a tiled array containing 4L probes, the whole probe 
set containing all the possible mutations in the known wild reference sequence. The hybridization 
signals of the 1 5-mer probe set tiled array are perturbed by a single base change in the target 
sequence. As a consequence, there is a characteristic loss of signal or a "footprint" for the probes 

35 flanking a mutation position. This technique was described by Chee et aL in 1996, which disclosure 
is hereby incorporated by reference in its entirety. 
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Construction of DNA constructs with a GENSET gene expression pattern 



In addition, characterization of the spatial and temporal expression patterns and expression 
levels of GENSET polypeptide-encoding mRNAs is also usefiil for constructing expression vectors 
capable of producing a desired level of gene product in a desired spatial or temporal manner, as 
5 discussed below. 

DNA Constructs That Direct Temporal And Spatial GENSET Gene Expression In Recombinant Cell 
Hosts And In Transgenic Animals. 

In order to study the physiological and phenotypic consequences of a lack of synthesis of a 
GENSET polypeptide, both at the cellular level and at the multi cellular organism level, the 

1 0 invention also encompasses DNA constructs and recombinant vectors enabling a conditional 

expression of a specific allele of a GENSET polypeptide-encoding genomic sequence or cDNA and 
also of a copy of this genomic sequence or cDNA harboring substitutions, deletions, or additions of 
one or more bases as regards to a nucleotide sequence selected from the group consisting of 
sequences of SEQ ID NOs: 1-169, 339-455, 561-784 and sequences of clone inserts of the deposited 

1 5 clone pool, or a fragment thereof, these base substitutions, deletions or additions being located 
either in an exon, an intron or a regulatory sequence, but preferably in the S'-reguIatory sequence or 
in an exon of the GENSET polypeptide-encoding genomic sequence or within the GENSET 
polypeptide-encoding cDNA. 

A first preferred DNA construct is based on the tetracycline resistance operon tet from £. 

20 coli transposon Tn 1 0 for controlling the GENSET gene expression, such as described by Gossen et 
al. (1992, 1995) and Furth et al. (1994), which disclosures are hereby incorporated by reference in 
their entireties. Such a DNA construct contains seven tet operator sequences from TnlO (tetop) that 
are fused to either a minimal promoter or a 5'-regulatory sequence of the GENSET gene, said 
minimal promoter or said GENSET polynucleotide regulatory sequence being operably linked to a 

25 polynucleotide of interest that codes either for a setise or an antisense oligonucleotide or for a 
polypeptide, including a GENSET polypeptide, or a peptide fragment thereof. This DNA construct 
is functional as a conditional expression system for the nucleotide sequence of interest when the 
same cell also comprises a nucleotide sequence coding for either the wild type (tTA) or the mutant 
(rTA) repressor fused to the activating domain of viral protein VP 16 of herpes simplex virus, placed 

30 under the control of a promoter, such as the HCMVIEl enhancer/promoter or the MMTV-LTR. 
Indeed, a preferred DNA construct of the invention comprise both the polynucleotide containing the 
tet operator sequences and the polynucleotide containing a sequence coding for the tTA or the rTA 
repressor. In a specific embodiment, the conditional expression DNA construct contains the 
sequence encoding the mutant tetracycline repressor rTA, the expression of the polynucleotide of 

35 interest is silent in the absence of tetracycline and induced in its presence. 

DNA Constructs Allowing Homologous Recombination: Replacement Vectors 
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A second preferred DNA construct will comprise, from 5*-end to 3 '-end: (a) a first 
nucleotide sequence that is found in the GENSET polypeptide-encoding genomic sequence; (b) a 
nucleotide sequence comprising a positive selection marker, such as the marker for neomycin 
resistance (neo); and (c) a second nucleotide sequence that is found in the GENSET polypeptide- 
S encoding genomic sequence, and is located on the genome downstream the first GENSET 
polypeptide-encoding nucleotide sequence (a). 

In a preferred embodiment, this DNA construct also comprises a negative selection marker 
located upstream of the nucleotide sequence (a) or downstream from the nucleotide sequence (c). 
Preferably, the negative selection marker comprises the thymidine kinase (tk) gene (Thomas et al,, 
10 1986), the hygromycine beta gene (Te Riele et al, 1990), the hprt gene ( Van der Lugt et al, 1991; 
Reid et al, 1990) or the Diphteria toxin A fragment {Dt-A) gene (Nada et al, 1993; Yagi et 
al 1990), which disclosures are hereby incorporated by reference in their entireties. Preferably, the 
positive selection marker is located within a GENSET exon sequence so as to interrupt the sequence 
encoding a GENSET polypeptide. These replacement vectors are described, for example, by 
15 Thomas e/a/.(1986; 1987), Mansourer a/.(1988) and Roller e/«/.(1992). 

The first and second nucleotide sequences (a) and (c) may be indifferently located within a 
GENSET polypeptide-encoding regulatory sequence, an intronic sequence, an exon sequence or a 
sequence containing both regulatory and/or intronic and/or exon sequences. The size of the 
nucleotide sequences (a) and (c) ranges from I to SO kb, preferably from I to 10 kb, more 
20 preferably from 2 to 6 kb and most preferably from 2 to 4 kb. 

DNA Constructs Allowing Homologous Recombination: Cre-LoxP System, 

These new DNA constructs make use of the site specific recombination system of the PI 
phage. The PI phage possesses a recombinase called Cre which interacts specifically with a 34 
base pairs loxP site. The loxP site is composed of two palindromic sequences of 13 bp separated by 

25 a 8 bp conserved sequence (Hoess et al^ 1986), which disclosure is hereby incorporated by 
reference in its entirety. The recombination by the Cre enzyme between two lox? sites having an 
identical orientation leads to the deletion of the DNA fragment. 

The Cre-/oxP system used in combination with a homologous recombination technique has 
been first described by Gu et al (1993, 1994), which disclosures are hereby incorporated by 

30 reference in their entireties. Briefly, a nucleotide sequence of interest to be inserted in a targeted 
location of the genome harbors at least two lox? sites in the same orientation and located at the 
respective ends of a nucleotide sequence to be excised from the recombinant genome. The excision 
event requires the presence of the recombinase (Cre) enzyme within the nucleus of the recombinant 
cell host. The recombinase enzyme may be brought at the desired time either by (a) incubating the 

35 recombinant cell hosts in a culture medium containing this enzyme, by injecting the Cre enzyme 
directly into the desired cell, such as described by Araki et al (1995), which disclosure is hereby 
incorporated by reference in its entirety, or by lipofection of the enzyme into the cells, such as 
described by Baubonis etal (1993), which disclosure is hereby incorporated by reference in its 
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entirety; (b) transfecting the cell host with a vector comprising the Cre coding sequence operably 
linked to a promoter functional in the recombinant cell host, which promoter being optionally 
inducible, said vector being introduced in the recombinant cell host, such as described by Gu e/ al 
(1993) and Sauer et ai (1988), which disclosures are hereby incorporated by reference in their 
5 entireties; (c) introducing in the genome of the cell host a polynucleotide comprising the Cre coding 
sequence operably linked to a promoter functional in the recombinant cell host, which promoter is 
optionally inducible, and said polynucleotide being inserted in the genome of the cell host either by 
a random insertion event or an homologous recombination event, such as described by Gu et 
a/,(1994). 

10 In a specific embodiment, the vector containing the sequence to be inserted in the GENSET 

gene by homologous recombination is constructed in such a way that selectable markers are flanked 
by lox? sites of the same orientation, it is possible, by treatment by the Cre enzyme, to eliminate the 
selectable markers while leaving the GENSET sequences of interest that have been inserted by an 
homologous recombination event. Again, two selectable markers are needed: a positive selection 

1 S marker to select for the recombination event and a negative selection marker to select for the 

homologous recombination event. Vectors and methods using the Cre-toP system are described by 
Zou et al (1994), which disclosure is hereby incorporated by reference in its entirety. 

Thus, a third preferred DNA construct of the invention comprises, from 5*-end to 3'-end: 
(a) a first nucleotide sequence that is comprised in the GENSET genomic sequence; (b) a nucleotide 

20 sequence comprising a polynucleotide encoding a positive selection marker, said nucleotide 

sequence comprising additionally two sequences defining a site recognized by a recombinase, such 
as a lox? site, the two sites being placed in the same orientation; and (c) a second nucleotide 
sequence that is comprised in the GENSET genomic sequence, and is located on the genome 
downstream of the first GENSET nucleotide sequence (a). 

25 The sequences defining a site recognized by a recombinase, such as a lox? site, are 

preferably located within the nucleotide sequence (b) at suitable locations bordering the nucleotide 
sequence for which the conditional excision is sought. In one specific embodiment, two lox? sites 
are located at each side of the positive selection marker sequence, in order to allow its excision at a 
desired time after the occurrence of the homologous recombination event. 

30 In a preferred embodiment of a method using the third DNA construct described above, the 

excision of the polynucleotide fragment bordered by the two sites recognized by a recombinase, 
preferably two loxP sites, is performed at a desired time, due to the presence within the genome of 
the recombinant host cell of a sequence encoding the Cre enzyme operably linked to a promoter 
sequence, preferably an inducible promoter, more preferably a tissue-specific promoter sequence 

35 and most preferably a promoter sequence which is both inducible and tissue-specific, such as 
described by Gu et a/. (1994). 

The presence of the Cre enzyme within the genome of the recombinant cell host may result 
from the breeding of two transgenic animals, the first transgenic animal bearing the GENSET- 
derived sequence of interest containing the lox? sites as described above and the second transgenic 
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entirety; (b) transfecting the cell host with a vector comprising the Cre coding sequence operably 
linked to a promoter functional in the recombinant cell host, which promoter being optionally 
inducible, said vector being introduced in the recombinant cell host, such as described by Gu et al 
(1993) and Sauer et al (1988), which disclosures are hereby incorporated by reference in their 
5 entireties; (c) introducing in the genome of the cell host a polynucleotide comprising the Cre coding 
sequence operably linked to a promoter functional in the recombinant cell host, which promoter is 
optionally inducible, and said polynucleotide being inserted in the genome of the cell host either by 
a random insertion event or an homologous recombination event, such as described by Gu et 
ii/.(1994). 

10 In a specific embodiment, the vector containing the sequence to be inserted in the GENSET 

gene by homologous recombination is constructed in such a way that selectable markers are flanked 
by lox? sites of the same orientation, it is possible, by treatment by the Cre enzyme, to eliminate the 
selectable markers while leaving the GENSET sequences of interest that have been inserted by an 
homologous recombination event. Again, two selectable markers are needed: a positive selection 

1 5 marker to select for the recombination event and a negative selection marker to select for the 

homologous recombination event Vectors and methods using the Cre-/ojcP system are described by 
Zou et al (1994), which disclosure is hereby incorporated by reference in its entirety. 

Thus, a third preferred DNA construct of the invention comprises, from S'-end to 3'-end: 
(a) a first nucleotide sequence that is comprised in the GENSET genomic sequence; (b) a nucleotide 

20 sequence comprising a polynucleotide encoding a positive selection marker, said nucleotide 

sequence comprising additionally two sequences defining a site recognized by a recombinase, such 
as a lox? site, the two sites being placed in the same orientation; and (c) a second nucleotide 
sequence that is comprised in the GENSET genomic sequence, and is located on the genome 
downstream of the first GENSET nucleotide sequence (a). 

25 The sequences defining a site recognized by a recombinase, such as a lox? site, are 

preferably located within the nucleotide sequence (b) at suitable locations bordering the nucleotide 
sequence for which the conditional excision is sought. In one specific embodiment, two lox? sites 
are located at each side of the positive selection marker sequence, in order to allow its excision at a 
desired time after the occurrence of the homologous recombination event. 

30 In a preferred embodiment of a method using the third DNA construct described above, the 

excision of the polynucleotide fi-agment bordered by the two sites recognized by a recombinase, 
preferably two loxP sites, is performed at a desired time, due to the presence within the genome of 
the recombinant host cell of a sequence encoding the Cre enzyme operably linked to a promoter 
sequence, preferably an inducible promoter, more preferably a tissue-specific promoter sequence 

35 and most preferably a promoter sequence which is both inducible and tissue-specific, such as 
described by Gu et a/.(1994). 

The presence of the Cre enzyme within the genome of the recombinant cell host may result 
from the breeding of two transgenic animals, the first transgenic animal bearing the GENSET- 
derived sequence of interest containing the lox? sites as described above and the second transgenic 
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animal bearing the Cre coding sequence operably linked to a suitable promoter sequence, such as 
described by Gu et a/. (1994). 

Spatio-temporal control of the Cre enzyme expression may also be achieved with an 
adenovirus based vector that contains the Cre gene thus allowing infection of cells, or in vivo ^ 
5 infection of organs, for delivery of the Cre enzyme, such as described by Anton and Graham (1995) 
and Kanegae et ai (1995), which disclosures are hereby incorporated by reference in their 
entireties. 

The DNA constructs described above may be used to introduce a desired nucleotide 
sequence of the invention, preferably a GENSET genomic sequence or a GENSET cDNA sequence, 
1 0 and most preferably an altered copy of a GENSET genomic or cDNA sequence, within a 

predetermined location of the targeted genome, leading either to the generation of an altered copy of 
a targeted gene (knock-out homologous recombination) or to the replacement of a copy of the 
targeted gene by another copy sufficiently homologous to allow an homologous recombination 
event to occur (knock-in homologous recombination). 

15 Modifying Gensetpol ypoptwe expression and/ok biological activity 

Modifying endogenous GENSET expression and/or biological activity is expressly 
contemplated by the present invention. 

Screening for compounds that modulate GENSET expression and/or biological activity 

The present invention further relates to compounds able to modulate GENSET expression 
20 and/or biological activity and methods to use these compounds. Such compounds may interact with 
the regulatory sequences of GENSET genes or they may interact with GENSET polypeptides 
directly or indirectly. 

Compounds Interacting With GENSET Repulatorv Sequences 

The present invention also concerns a method for screening substances or molecules that 
25 are able to interact with the regulatory sequences of a GENSET gene, such as for example promoter 
or enhancer sequences in untranscribed regions of the genomic DNA, as determined using any 
techniques known to those skilled in the art including those described in the section entitled 
"Identification of Promoters in Cloned Upstream Sequences, or such as regulatory sequences 
located in untranslated regions of GENSET mRNA. 
30 Sequences within untranscribed or untranslated regions of polynucleotides of the invention 

may be identified by comparison to databases containing known regulatory sequence such as 
transcription start sites, transcription factor binding sites, promoter sequences, enhancer sequences, 
5'UTR and 3'UTR elements (Pesole ei ai, 2000; http://igs-server.cnrs- 
mrs.fr/~gauthere/UTR/index.html). Alternatively, the regulatory sequences of interest may be 
35 identified through conventional mutagenesis or deletion analyses of reporter plasmids using, for 
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instance, techniques described in the section entitled "Identification of Promoters in Cloned 
Upstream Sequences". 

Following the identification of potential GENSET regulatory sequences, proteins which 
interact with these regulatory sequences may be identified as described below. 
5 Gel retardation assays may be performed independently in order to screen candidate 

molecules that are able to interact with the regulatory sequences of the GENSET gene, such as 
described by Fried and Crothers (1981), Gamer and Revzin (1981) and Dent and Latchman (1993), 
the teachings of these publications being herein incorporated by reference. These techniques are 
based on the principle according to which a DNA or mRNA fragment which is bound to a protein 

10 migrates slower than the same unbound DNA or mRNA fragment. Briefly, the target nucleotide 
sequence is labeled. Then the labeled target nucleotide sequence is brought into contact with either 
a total nuclear extract from cells containing regulation factors, or with different candidate molecules 
to be tested. The interaction between the target regulatory sequence of the GENSET gene and the 
candidate molecule or the regulation factor is detected after gel or capillary electrophoresis through 

15 a retardation in the migration. 

Nucleic acids encoding proteins which are able to interact with the promoter sequence of 
the GENSET gene, more particularly a nucleotide sequence selected from the group consisting of 
the polynucleotides of the S' and 3' regulatory region or a fragment or variant thereof, may be 
identified by using a one-hybrid system, such as that described in the booklet enclosed in the 

20 Matchmaker One*Hybrid System kit from Clontech (Catalog Ref no. K1603-1, the technical 

teachings of which are herein incorporated by reference). Briefly, the target nucleotide sequence is 
cloned upstream of a selectable reporter sequence and the resulting polynucleotide construct is 
integrated in the yeast genome (Saccharomyces cerevisiae). Preferably, multiple copies of the 
target sequences are inserted into the reporter plasmid in tandem. The yeast cells containing the 

25 reporter sequence in their genome are then transformed with a library comprising fusion molecules 
between cDNAs encoding candidate proteins for binding onto the regulatory sequences of the 
GENSET gene and sequences encoding the activator domain of a yeast transcription factor such as 
GAM. The recombinant yeast cells are plated in a culture broth for selecting cells expressing the 
reporter sequence. The recombinant yeast cells thus selected contain a fusion protein that is able to 

30 bind onto the target regulatory sequence of the GENSET gene. Then, the cDNAs encoding the 
fusion proteins are sequenced and may be cloned into expression or transcription vectors in vitro. 
The binding of the encoded polypeptides to the target regulatory sequences of the GENSET gene 
may be confirmed by techniques familiar to the one skilled in the art, such as gel retardation assays 
or DNAse protection assays. 

35 Lieands interacting with GENSET polypeptides 

For the purpose of the present invention, a ligand means a molecule, such as a protein, a 
peptide, an antibody or any synthetic chemical compound capable of binding to a GENSET protein 
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or one of its fragments or variants or to modulate the expression of the polynucleotide coding for 
GENSET or a fragment or variant thereof. 

In the ligand screening method according to the present invention, a biological sample or a 
defined molecule to be tested as a putative ligand of a GENSET protein is brought into contact with 
S the corresponding purified GENSET protein, for example the corresponding purified recombinant 
GENSET protein produced by a recombinant cell host as described herein, in order to form a 
complex between this protein and the putative ligand molecule to be tested. 

As an illustrative example, to study the interaction of a GENSET protein, or a fragment 
comprising a contiguous span of at least 6 amino acids, preferably at least 8 to 10 amino acids, more 

10 preferably at least 12, 15, 20, 25, 30, 40, 50, or 100 amino acids of a polypeptide selected from the 
group consisting of sequences of SEQ ID NOs: 170-338, 456-560, 785-918 and polypeptides 
encoded by the clone inserts of the deposited clone pool, with drugs or small molecules, such as 
molecules generated through combinatorial chemistry approaches, the microdialysis coupled to 
HPLC method described by Wang et aL (1997) or the affinity capillary electrophoresis method 

IS described by Bush et al. (1997), the disclosures of which are incorporated by reference, can be used. 
In further methods, peptides, drugs, fatty acids, lipoproteins, or small molecules which 
interact with a GENSET protein, or a fragment comprising a contiguous span of at least 6 amino 
acids, preferably at least 8 to 10 amino acids, more preferably at least 12, 15, 20, 25, 30, 40, 50, or 
100 amino acids of a polypeptide selected from the group consisting of sequences of SEQ ID 

20 NOs: 1 70-338, 456-560, 785-9 1 8 and polypeptides encoded by the clone inserts of the deposited 
clone pool may be identified using assays such as the following. The molecule to be tested for 
binding is labeled with a detectable label, such as a fluorescent, radioactive, or enzymatic tag and 
placed in contact with immobilized GENSET protein, or a fragment thereof under conditions which 
permit specific binding to occur. After removal of non-specifically bound molecules, bound 

25 molecules are detected using appropriate means. 

Various candidate substances or molecules can be assayed for interaction with a GENSET 
polypeptide. These substances or molecules include, without being limited to, natural or synthetic 
organic compounds or molecules of biological origin such as polypeptides. When the candidate 
substance or molecule comprises a polypeptide, this polypeptide may be the resulting expression 

30 product of a phage clone belonging to a phage-based random peptide library, or alternatively the 
polypeptide may be the resulting expression product of a cDNA library cloned in a vector suitable 
for performing a two-hybrid screening assay. 

A, Candidate ligands obtained from random peptide libraries 

In a particular embodiment of the screening method, the putative ligand is the expression 
35 product of a DNA insert contained in a phage vector (Parmley and Smith, 1988). Specifically, 
random peptide phages libraries are used. The random DNA inserts encode for peptides of 8 to 20 
amino acids in length (Oldenburg et a/., 1 992; Valadon et al,, 1 996; Lucas, 1 994; Westerink, 1 995; 
Felici et al., 1991), which disclosures are hereby incorporated by reference in their entireties. 
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According to this particular embodiment, the recombinant phages expressing a protein that binds to 
an immobilized GENSET protein is retained and the complex formed between the GENSET protein 
and the recombinant phage may be subsequently immunoprecipitated by a polyclonal or a 
monoclonal antibody directed against the GENSET protein. 
S Once the ligand library in recombinant phages has been constructed, the phage population 

is brought into contact with the immobilized GENSET protein. Then the preparation of complexes 
is washed in order to remove the non-specifically bound recombinant phages. The phages that bind 
specifically to the GENSET protein are then eluted by a buffer (acid pH) or immunoprecipitated by 
the monoclonal antibody produced by the hybridoma anti-GENSET, and this phage population is 

10 subsequently amplified by an over-infection of bacteria (for example E. coli). The selection step 
may be repeated several times, preferably 2-4 times, in order to select the more specific 
recombinant phage clones. The last step comprises characterizing the peptide produced by the 
selected recombinant phage clones either by expression in infected bacteria and isolation, 
expressing the phage insert in another host-vector system, or sequencing the insert contained in the 

1 5 selected recombinant phages. 

A Candidate ligands obtained by competition experiments. 

Alternatively, peptides, drugs or small molecules which bind to a GENSET protein or 
fragment thereof comprising a contiguous span of at least 6 amino acids, preferably at least 8 to 10 
amino acids, more preferably at least 12, 15, 20, 25, 30, 40, 50, or 100 amino acids of a polypeptide 

20 selected from the group consisting of sequences of SEQ ID NOs: 170-338, 456-560, 785-918 and 
polypeptides encoded by the clone inserts of the deposited clone pool, may be identified in 
competition experiments. In such assays, the GENSET protein, or a fragment thereof, is 
immobilized to a surface, such as a plastic plate. Increasing amounts of the peptides, drugs or small 
molecules are placed in contact with the immobilized GENSET protein, or a fragment thereof, in 

25 the presence of a detectable labeled known GENSET protein ligand. For example, the GENSET 
ligand may be detectably labeled with a fluorescent, radioactive, or enzymatic tag. The ability of 
the test molecule to bind the GENSET protein, or a fragment thereof, is determined by measuring 
the amount of detectably labeled known ligand bound in the presence of the test molecule. A 
decrease in the amount of known ligand bound to the GENSET protein, or a fragment thereof, when 

30 the test molecule is present indicated that the test molecule is able to bind to the GENSET protein, 
or a fragment thereof. 

C. Candidate ligands obtained by affinity chromatography. 

Proteins or other molecules interacting with a GENSET protein, or a fragment thereof 
comprising a contiguous span of at least 6 amino acids, preferably at least 8 to 10 amino acids, more 
35 preferably at least 1 2, 15, 20, 25, 30, 40, 50, or 1 00 amino acids of a polypeptide selected from the 
group consisting of sequences of SEQ ID NOs: 1 70-338, 456-560, 785-91 8 and polypeptides 
encoded by the clone inserts of the deposited clone pool, can also be found using affinity columns 
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which contain the GENSET protein, or a fragment thereof. The GENSET protein, or a fragment 
thereof, may be attached to the column using conventional techniques including chemical coupling 
to a suitable column matrix such as agarose, Affi Gel® , or other matrices familiar to those of skill 
in art. In some embodiments of this method, the affmity column contains chimeric proteins in 
S which the GENSET protein, or a fragment thereof, is iiised to glutathion S transferase (GST). A 
mixture of cellular proteins or poo! of expressed proteins as described above is applied to the 
affinity column. Proteins or other molecules interacting with the GENSET protein, or a fragment 
thereof, attached to the column can then be isolated and analyzed on 2-0 electrophoresis gel as 
described in Ramunsen et al, (1997), the disclosure of which is incorporated by reference. 
1 0 Alternatively, the proteins retained on the affinity column can be purified by electrophoresis based 
methods and sequenced. The same method can be used to isolate antibodies, to screen phage 
display products, or to screen phage display human antibodies. 

D. Candidate ligands obtained by optical biosensor methods 

Proteins interacting with a GENSET protein, or a fragment comprising a contiguous span of 

I S at least 6 amino acids, preferably at least 8 to 10 amino acids, more preferably at least 12, IS, 20, 
25, 30, 40, SO, or 100 amino acids of a polypeptide selected from the group consisting of sequences 
of SEQ ID NOs: 170-338, 4S6-S60, 785-918 and polypeptides encoded by the clone inserts of the 
deposited clone pool, can also be screened by using an Optical Biosensor as described in Edwards 
and Leatherbarrow (1997) and also in Szabo et al, (1995), the disclosures of which are incorporated 

20 by reference. This technique permits the detection of interactions between molecules in real time, 
without the need of labeled molecules. This technique is based on the surface plasmon resonance 
(SPR) phenomenon. Briefly, the candidate ligand molecule to be tested is attached to a surface 
(such as a carboxymethyl dextran matrix). A light beam is directed towards the side of the surface 
that does not contain the sample to be tested and is reflected by said surface. The SPR phenomenon 

25 causes a decrease in the intensity of the reflected light with a specific association of angle and 
wavelength. The binding of candidate ligand molecules cause a change in the refraction index on 
the surface, which change is detected as a change in the SPR signal. For screening of candidate 
ligand molecules or substances that are able to interact with the GENSET protein, or a fragment 
thereof, the GENSET protein, or a fragment thereof, is immobilized onto a surface. This surface 

30 comprises one side of a cell through which flows the candidate molecule to be assayed. The 
binding of the candidate molecule on the GENSET protein, or a fragment thereof, is detected as a 
change of the SPR signal. The candidate molecules tested may be proteins, peptides, carbohydrates, 
lipids, or small molecules generated by combinatorial chemistry. This technique may also be 
performed by immobilizing eukaryotic or prokaryotic cells or lipid vesicles exhibiting an 

35 endogenous or a recombinantly expressed GENSET protein at their surface. 

The main advantage of the method is that it allows the determination of the association rate 
between the GENSET protein and molecules interacting with the GENSET protein. It is thus 
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possible to select specifically ligand molecules interacting with the GENSET protein, or a fragment 
thereof, through strong or conversely weak association constants. 

E. Candidate ligands obtained through a two-hybrid screening assay. 

The yeast two-hybrid system is designed to study protein-protein interactions in vivo (Fields 
5 and Song, 1989), which disclosure is hereby incorporated by reference in its entirety, and relies 
upon the fiision of a bait protein to the DNA binding domain of the yeast Gal4 protein. This 
technique is also described in the US Patent No. US 5,667,973 and the US Patent No. 5,283,173, the 
technical teachings of both patents being herein incorporated by reference. 

The general procedure of library screening by the two-hybrid assay may be performed as 
10 described by Harper et al (1 993) or as described by Cho et aL (1998) or also Fromont-Racine et al 
(1997), which disclosures are hereby incorporated by reference in their entireties. 

The bait protein or polypeptide comprises, consists essentially of, or consists of a GENSET 
polypeptide or a fragment thereof comprising a contiguous span of at least 6 amino acids, preferably 
at least 8 to 10 amino acids, more preferably at least 12, 15, 20, 25, 30, 40, 50, or 100 amino acids 
15 of a polypeptide selected from the group consisting of sequences of SEQ ID NOs: 1 70-33 8, 45 6- 
560, 785-918 and polypeptides encoded by the clone inserts of the deposited clone pool. 

More precisely, the nucleotide sequence encoding the GENSET polypeptide or a fragment 
or variant thereof is fused to a polynucleotide encoding the DNA binding domain of the GAL4 
protein, the fused nucleotide sequence being inserted in a suitable expression vector, for example 
20 pAS2orpM3. 

Then, a human cDNA library is constructed in a specially designed vector, such that the 
human cDNA insert is fused to a nucleotide sequence in the vector that encodes the transcriptional 
domain of the GAL4 protein. Preferably, the vector used is the pACT vector. The polypeptides 
encoded by the nucleotide inserts of the human cDNA library are termed "prey" polypeptides. 
25 A third vector contains a detectable marker gene, such as beta galactosidase gene or CAT 

gene that is placed under the control of a regulation sequence that is responsive to the binding of a * 
complete Gal4 protein containing both the transcriptional activation domain and the DNA binding 
domain. For example, the vector pGSEC may be used. 

Two different yeast strains are also used. As an illustrative but non limiting example the 
30 two different yeast strains may be the followings : 

. Y190, the phenotype of which is (MATa, Leu2-3, 1 12 ura3-l2, trpl-901, his3-D200, 
ade2-101, ga!4Dgall80D URA3 GAL-LacZ, LYS G^I-H1S3, cyh'); 

- Y187, the phenotype of which is (MATa gal4 gal80 his3 trpl-901 ade2-101 ura3-52 leu2- 
3, -1 1 2 URA3 GAL-lacZmef), which is the opposite mating type of Y 1 90. 
35 Briefly, 20 ^g of pAS2/GENSET and 20 \xg of pACT-cDNA libraiy are co-transformed 

into yeast strain Y190. The transformants are selected for growth on minimal media lacking 
histidine, leucine and tryptophan, but containing the histidine synthesis inhibitor 3-AT (50 mM). 
Positive colonies are screened for beta galactosidase by filter lift assay. The double positive 
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colonies (His*, beta-gar) are then grown on plates lacking histidine, leucine, but containing 
tryptophan and cycloheximide (10 mg/ml) to select for loss of pAS2/GENSET plasmids but 
retention of pACT-cDNA library plasmids. The resuhing Y190 strains are mated with Y187 strains 
expressing GENSET or non-related control proteins; such as cyclophilin B, iamin, or SNFl, as Gal4 
5 fusions as described by Harper e( al ( 1 993) and by Bram et al ( 1 993), which disclosures are hereby 
incorporated by reference in their entireties, and screened for beta galactosidase by filter lift assay. 
Yeast clones that are beta gal- after mating with the control Gal4 fusions are considered false 
positives. 

In another embodiment of the two-hybrid method according to the invention, interaction 
1 0 between the GENSET or a fragment or variant thereof with cellular proteins may be assessed using 
the Matchmaker Two Hybrid System 2 (Catalog No. K 1604-1, Clontech). As described in the 
manual accompanying the kit, the disclosure of which is incorporated herein by reference, nucleic 
acids encoding the GENSET protein or a portion thereof, are inserted into an expression vector such 
that they are in frame with DNA encoding the DNA binding domain of the yeast transcriptional 
1 5 activator GAL4. A desired cDNA, preferably human cDN A, is inserted into a second expression 
vector such that they are in frame with DNA encoding the activation domain of GAL4. The two 
expression plasmids are transformed into yeast and the yeast are plated on selection medium which 
selects for expression of selectable markers on each of the expression vectors as well as GAL4 
dependent expression of the HIS3 gene. Transformants capable of growing on medium lacking 
20 histidine are screened for GAM dependent lacZ expression. Those cells which are positive in both the 
histidine selection and the lacZ assay contain interaction between GENSET and the protein or peptide 
encoded by the initially selected cDNA insert. 

Compounds Modulating GENSET biological activity 

Another method of screening for compounds that modulate GENSET expression and/or 
25 biological activity is by measuring the effects of test compounds on specific biological activity, e.g. 
a GENSET biological activity in a host cell. In one embodiment, the present invention relates to a 
method of identifying an agent which alters GENSET biological activity, wherein a nucleic acid 
construct comprising a nucleic acid which encodes a mammalian GENSET polypeptide is 
introduced into a host cell. The host cells produced are maintained under conditions appropriate for 
30 expression of the encoded mammalian GENSET polypeptides, whereby the nucleic acid is 

expressed. The host cells are then contacted with a compound to be assessed (an "agent," or "test 
agent"), and the properties of the cells is assessed. Detection of a change in any GENSET 
polypeptide-associated property in the presence of the agent indicates that the agent alters GENSET 
activity. In a particular embodiment, the invention relates to a method of identifying an agent 
35 which is an activator of GENSET activity, wherein detection of an increase of any GENSET 
polypeptide-associated property in the presence of the agent indicates that the agent activates 
GENSET activity. In another particular embodiment, the invention relates to a method of 
identifying an agent which is an inhibitor of GENSET activity, wherein detection of a decrease of 
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any GENSET polypeptide-associated property in the presence of the agent indicates that the agent 
inhibits GENSET activity. 

in a particular embodiment, a high throughput screen can be used to identify agents that 
activate (enhance) or inhibit GENSET activity (See e.g., PCT publication WO 98/45438, which 
S disclosure is hereby incorporated by reference in its entirety). For example, the method of 
identifying an agent which alters GENSET activity can be performed as follows. A nucleic acid 
construct comprising a polynucleotide which encodes a mammalian GENSET polypeptide is 
introduced into a host cell to produce recombinant host cells. The recombinant host cells are then 
maintained under conditions appropriate for expression of the encoded mammalian GENSET 

1 0 polypeptide, whereby the nucleic acid is expressed. The compound to be assessed is added to the 
recombinant host cells; the resulting combination is referred to as a test sample. A detectable, 
GENSET polypeptide-associated property of the cells is detected. A control can be used in the 
methods of detecting agents which alter GENSET activity. For example, the control sample 
includes the same reagents but lacks the compound or agent being assessed; it is treated in the same 

1 S manner as the test sample. 

Methods of Screening for Compounds Modulating GENSET Expression and/or Activity 

The present invention also relates to methods of screening compounds for their ability to 
modulate (e.g. increase or inhibit) the activity or expression of GENSET. More specifically, the 
present invention relates to methods of testing compounds for their ability either to increase or to 
20 decrease expression or activity of GENSET. The assays are performed in vitro or in vivo. 

In vitro methods 

In vitro, cells expressing GENSET polypeptides are incubated in the presence and absence 
of the test compound. By determining the level of GENSET expression in the presence of the test 
compound or the level of GENSET activity in the presence of the test compound, compounds can 

25 be identified that suppress or enhance GENSET expression or activity. Alternatively, constructs 
comprising a GENSET regulatory sequence operably linked to a reporter gene (e.g. iuciferase, 
chloramphenicol acetyl transferase, LacZ, green fluorescent protein, etc.) can be introduced into 
host cells and the effect of the test compounds on expression of the reporter gene detected. Cells 
suitable for use in the foregoing assays include, but are not limited to, cells having the same origin 

30 as tissues or cell lines in which the polypeptide is known to be expressed using the data from Tables 
III, IV, orV. 

Consequently, the present invention encompasses a method for screening molecules that 
modulate the expression of a GENSET gene, said screening method comprising the steps of: 

a) cultivating a prokaryotic or an eukaryotic cell that has been transfected with a nucleotide 
35 sequence encoding a GENSET protein or a variant or a fragment thereof, placed under the control 

of its own promoter; 

b) bringing into contact said cultivated cell with a molecule to be tested; 
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c) quantifying the expression of said GENSET protein or a variant or a fragment thereof in 
the presence of said molecule. 

Using DNA recombination techniques well known by the one skill in the art, the GENSET 
protein encoding DNA sequence is inserted into an expression vector, downstream from its 
S promoter sequence. As an illustrative example, the promoter sequence of the GENSET gene is 
contained in the untranscribed region of the GENSET genomic DNA. 

The quantification of the expression of a GENSET protein may be realized either at the 
mRNA level (using for example Northen blots, RT-PCR, preferably quantitative RT-PCR with 
primers and probes specific for the GENSET mRNA of interest) or at the protein level (using 
10 polyclonal or monoclonal antibodies in immunoassays such as ELISA or RIA assays, Western blots, 
or immunochemistry). 

The present invention also concerns a method for screening substances or molecules that 

are able to increase, or in contrast to decrease, the level of expression of a GENSET gene. Such a 

method may allow the one skilled in the art to select substances exerting a regulating effect on the 
15 expression level of a GENSET gene and which may be useful as active ingredients included in 

pharmaceutical compositions for treating patients suffering from disorders associated with abnormal 

levels of GENSET products. 

Thus, another part of the present invention is a method for screening a candidate molecule 

that modulates the expression of a GENSET gene, this method comprises the following steps: 
20 a) providing a recombinant cell host containing a nucleic acid, wherein said nucleic acid 

comprises a GENSET 5' regulatory region or a regulatory active fragment or variant thereof, 

operably linked to a polynucleotide encoding a detectable protein; 

b) obtaining a candidate molecule; and 

c) determining the ability of said candidate molecule to modulate the expression levels of 
25 said polynucleotide encoding the detectable protein. 

In a further embodiment, said nucleic acid comprising a GENSET 5' regulatory region or a 
regulatory active fragment or variant thereof, includes the 5'UTR region of a GENSET cDNA 
selected from the group comprising of the 5'UTRs of the sequences of SEQ ID NOs: 1 - 1 69, 339- 
4SS, 561-784, sequences of clones inserts of the deposited clone pool, regulatory active fragments 
30 and variants thereof. In a more preferred embodiment of the above screening method, said nucleic 
acid includes a promoter sequence which is endogenous with respect to the GENSET 5'UTR 
sequence. In another more preferred embodiment of the above screening method, said nucleic acid 
includes a promoter sequence which is exogenous with respect to the GENSET 5'UTR sequence 
defmed therein. 

35 Preferred polynucleotides encoding a detectable protein are polynucleotides encoding beta 

galactosidase, green fluorescent protein (GFP) and chloramphenicol acetyl transferase (CAT). 

The invention further relates to a method for the production of a pharmaceutical 
composition comprising a method of screening a candidate molecule that modulates the expression 
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of a GENSET gene and furthermore mixing the identified molecule with a pharmaceutical ly 
acceptable carrier. 

The invention also pertains to kits for the screening of a candidate substance modulating the 
expression of a GENSET gene. Preferably, such kits comprise a recombinant vector that allows the 
5 expression of a GENSET 5' regulatory region or a regulatory active fragment or a variant thereof, 
operably linked to a polynucleotide encoding a detectable protein or a GENSET protein or a 
fragment or a variant thereof. More preferably, such kits include a recombinant vector that 
comprises a nucleic acid including the 5'UTR region of a GENSET cDNA selected from the group 
comprising the 5'UTRs of the sequences of SEQ ID NOs:l-169, 339-455, 561-784, sequences of 
10 clone inserts of the deposited clone pool, regulatory active fragments and variants thereof, being 
operably linked to a polynucleotide encoding a detectable protein. 

For the design of suitable recombinant vectors useful for performing the screening methods 
described above, it will be referred to the section of the present specification wherein the preferred 
recombinant vectors of the invention are detailed, 
1 5 Another object of the present invention comprises methods and kits for the screening of 

candidate substances that interact with a GENSET polypeptide, fragments or variants thereof. By 
their capacity to bind covalently or non-covalently to a GENSET protein, fragments or variants 
thereof, these substances or molecules may be advantageously used both in vitro and in vivo. 

In vitro, said interacting molecules may be used as detection means in order to identify the 
20 presence of a GENSET protein in a sample, preferably a biological sample. 

A method for the screening of a candidate substance that interact with a GENSET 
polypeptide, fragments or variants thereof, said methods comprising the following steps: 

a) providing a polypeptide comprising, consisting essentially of, or consisting of a 
GENSET protein or a fragment comprising a contiguous span of at least 6 amino acids, preferably 

25 at least 8 to 1 0 amino acids, more preferably at least 1 2, 1 5, 20, 25, 30, 40, 50, or 100 amino acids 
of a polypeptide selected from the group consisting of sequences of SEQ ID NOs: 170-338, 456- 
560, 785-918 and polypeptides encoded by the clone inse^rts of the deposited clone pool; 

b) obtaining a candidate substance; 

c) bringing into contact said polypeptide with said candidate substance; 

30 d) detecting the complexes formed between said polypeptide and said candidate substance. 

The invention further relates to a method for the production of a pharmaceutical 
composition comprising a method for the screening of a candidate substance that interact with a 
GENSET polypeptide, fragments or variants thereof and furthermore mixing the identified 
substance with a pharmaceutically acceptable carrier. 

35 The invention further concerns a kit for the screening of a candidate substance interacting 

with the GENSET polypeptide, wherein said kit comprises: 

a) a polypeptide comprising, consisting essentially of, or consisting of a GENSET protein 
or a fragment comprising a contiguous span of at least 6 amino acids, preferably at least 8 to 10 
amino acids, more preferably at least 12, 15, 20, 25, 30, 40, 50, or 100 amino acids of a polypeptide 
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selected from the group consisting of sequences of SEQ ID NOs: 170-338, 456-560, 785-918 and 
polypeptides encoded by the clone inserts of the deposited clone pool; and 

b) optionally means useful to detect the complex formed between said polypeptide or a 
variant thereof and the candidate substance. 
S In a preferred embodiment of the kit described above, the detection means comprises a 

monoclonal or polyclonal antibody binding to said GENSET protein or fragment or variant thereof. 

In vivo methods 

Compounds that suppress or enhance GENSET expression can also be identified using in 
vivo screens. In these assays, the test compound is administered (e.g. IV, IP, IM, orally, or 

10 otherwise), to the animal, for example, at a variety of dose levels. The effect of the compound on 
GENSET expression is determined by comparing GENSET levels, for example in tissues known to 
express the gene of interest using, for example the data obtained in Tables III, IV, or V, and using 
Northern blots, immunoassays, PGR, etc., as described above. Suitable test animals include, but are 
not limited to, rodents (e.g., mice and rats), primates, and rabbits. Humanized mice can also be 

1 5 used as test animals, that is mice in which the endogenous mouse protein is ablated (knocked out) 
and the homologous human protein added back by standard transgenic approaches. Such mice 
express only the human form of a protein. Humanized mice expressing only the human GENSET 
can be used to study in vivo responses to potential agents regulating GENSET protein or mRNA 
levels. As an example, transgenic mice have been produced carrying the human apoE4 gene. They 

20 are then bred with a mouse line that lacks endogenous apoE, to produce an animal model carrying 
human proteins believed to be instrumental in development of Alzheimer's pathology. Such 
transgenic animals are useful for dissecting the biochemical and physiological steps of disease, and 
for development of therapies for disease intervention (Loring, et al, 1 996) (incorporated herein by 
reference in its entirety). 

25 Uses for compounds modulating GENSET expression and/or biological activity 

Using in vivo (or in vitro) systems, it may be possible to identify compounds that exert a 
tissue specific effect, for example, that increase GENSET expression or activity only in tissues of 
interest, such as the adrenal gland, bone marrow, brain, cerebellum, colon, fetal brain, fetal kidney, 
fetal liver, heart, hypertrophic prostate, kidney, liver, lung, lymph ganglia, lymphocytes, muscle, 

30 ovary, pancreas, pituitary gland, placenta, prostate, salivary gland, spinal cord, spleen, stomach, 
intestine, substantia nigra, testis, thyroid, umbilical cord, and uterus. Screening procedures such as 
those described above are also useful for identifying agents for their potential use in 
pharmacological intervention strategies. Agents that enhance GENSET gene expression or 
stimulate its activity may thus be used to induce any phenotype associated with a GENSET gene, or 

35 to treat disorders resulting from a deficiency of a GENSET polypeptide activity or expression. 
Compounds that suppress GENSET polypeptide expression or inhibit its activity can be used to 
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treat any disease or condition associated with increased or deleterious GENSET polypeptide activity 
or expression. 

Also encompassed by the present invention is an agent which interacts with a GENSET 
gene or polypeptide directly or indirectly, and inhibits or enhances GENSET polypeptide expression 

S and/or function. In one embodiment, the agent is an inhibitor which interferes with a GENSET 
polypeptide directly (e.g., by binding the GENSET polypeptide) or indirectly (e.g., by blocking the 
ability of the GENSET polypeptide to have a GENSET biological activity). In a particular 
embodiment, an inhibitor of a GENSET protein is an antibody specific for the GENSET protein or a 
functional portion of the GENSET protein; that is, the antibody binds a GENSET polypeptide. For 

10 example, the antibody can be specific for a polypeptide encoded by one of the nucleic acid 

sequences of human GENSETs (SEQ ID NOs: 1-169, 339-455, 561-784), a mammalian GENSET 
nucleic acid, or portions thereof Alternatively, the inhibitor can be an agent other than an antibody 
(e.g., small organic molecule, protein or peptide) which binds the GENSET polypeptide and blocks 
its activity. For example, the inhibitor can be an agent which mimics the GENSET polypeptide 

15 structurally, but lacks its function. Alternatively, it can be an agent which binds to or interacts with 
a molecule which the GENSET polypeptide normally binds to or interacts with, thus blocking the 
GENSET polypepetide from doing so and preventing it from exerting the effects it would normally 
exert. 

In another embodiment, the agent is an enhancer (activator) of a GENSET polypeptide 

20 which increases the activity of the GENSET polypeptide (increases the effect of a given amount or 
level of GENSET), increases the length of time it is effective (by preventing its degradation or 
otherwise prolonging the time during which it is active) or both either directly or indirectly. For 
example, GENSET polynucleotides and polypeptides can be used to identify drugs which increase 
or decrease the ability of GENSET polypeptides to induce GENSET biological activity, which 

25 drugs are useful for the treatment or prevention of any disease or condition associated with a 
GENSET biological activity. 

The GENSET sequences of the present invention can also be used to generate nonhuman 
gene knockout animals, such as mice, which lack a GENSET gene or transgenically overexpress a 
GENSET gene. For example, such GENSET gene knockout mice can be generated and used to 

30 obtain further insight into the function of the GENSET gene as well as assess the specificity of 
GENSET activators and inhibitors. Also, over expression of the GENSET gene (e.g., a human 
GENSET gene) in transgenic mice can be used as a means of creating a test system for GENSET 
activators and inhibitors (e.g., against a human GENSET polypeptide). In addition, the GENSET 
gene can be used to clone the GENSET promoter/enhancer in order to identify regulators of 

35 GENSET gene transcription. GENSET gene knockout animals include animals which completely 
or partially lack the GENSET gene and/or GENSET activity or ftjnction. Thus the present 
invention relates to a method of inhibiting (partially or completely) a GENSET biological activity in 
a mammal (e.g., a human), the method comprising administering to the mammal an effective 
amount of an inhibitor of a GENSET polypeptide or polynucleotide. The invention also relates to a 
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method of enhancing a GENSET biological activity in a mammal, the method comprising 
administering to the mammal an effective amount of an enhancer of a GENSET polypeptide or 
polynucleotide. 

Inhibiting GENSET gene expression 

S Therapeutic compositions according to the present invention may comprise advantageously 

one or several GENSET oligonucleotide fragments as an antisense tool or a triple helix tool that 
inhibits the expression of the corresponding GENSET gene. 

Antisense Approach 

In antisense approaches, nucleic acid sequences complementary to an mRNA are 

10 hybridized to the mRNA intracellularly, thereby blocking the expression of the protein encoded by 
the mRNA. The antisense nucleic acid molecules to be used in gene therapy may be either DNA or 
RNA sequences. Preferred methods using antisense polynucleotide according to the present 
invention are the procedures described by Sczakiel et a/.(199S), which disclosure is hereby 
incorporated by reference in its entirety. 

1 5 Preferably, the antisense tools are chosen among the polynucleotides (1 5-200 bp long) that 

are complementary to GENSET mRNA, more preferably to the 5 'end of the GENSET mRNA. In 
another embodiment, a combination of different antisense polynucleotides complementary to 
different parts of the desired targeted gene are used. 

Other preferred antisense polynucleotides according to the present invention are sequences 

20 complementary to either a sequence of GENSET mRNAs comprising the translation initiation 
codon ATG or a sequence of GENSET genomic DNA containing a splicing donor or acceptor site. 

Preferably, the antisense polynucleotides of the invention have a 3' polyadenylation signal 
that has been replaced with a self-cleaving ribozyme sequence, such that RNA polymerase II 
transcripts are produced without poly(A) at their 3' ends, these antisense polynucleotides being 

25 incapable of export from the nucleus, such as described by Liu et a/.(I994), which disclosure is 
hereby incorporated by reference in its entirety. In a preferred embodiment, these GENSET 
antisense polynucleotides also comprise, within the ribozyme cassette, a histone stem-loop structure 
to stabilize cleaved transcripts against 3'-5' exonucleolytic degradation, such as the structure 
described by Eckner et a/.(1991), which disclosure is hereby incorporated by reference in its 

30 entirety. 

The antisense nucleic acids should have a length and melting temperature sufficient to 
permit formation of an intracellular duplex having sufficient, stability to inhibit the expression of the 
GENSET mRNA in the duplex. Strategies for designing antisense nucleic acids suitable for use in 
gene therapy are disclosed in Green et al, (1986) and Izant and Weintraub, (1984), the disclosures 
35 of which are incorporated herein by reference. 

In some strategies, antisense molecules are obtained by reversing the orientation of the 
GENSET coding region with respect to a promoter so as to transcribe the opposite strand from that 
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which is normally transcribed in the cell. The antisense molecules may be transcribed using in vitro 
transcription systems such as those which employ T7 or SP6 polymerase to generate the transcript. 
Another approach involves transcription of GENSET antisense nucleic acids in vivo by operably 
linking DNA containing the antisense sequence to a promoter in a suitable expression vector. 
5 Alternatively, oligonucleotides which are complementary to the strand normally transcribed 

in the cell may be synthesized in vitro. Thus, the antisense nucleic acids are complementary to the 
corresponding mRNA and are capable of hybridizing to the mRNA to create a duplex. In some 
embodiments, the antisense sequences may contain modified sugar phosphate backbones to increase 
stability and make them less sensitive to RNase activity. Examples of modifications suitable for use 

10 in antisense strategies include 2' 0-methyl RNA oligonucleotides and Protein-nucleic acid (PNA) 
oligonucleotides. Further examples are described by Rossi et al^ (1991), which disclosure is hereby 
incorporated by reference in its entirety. 

Various types of antisense oligonucleotides complementary to the sequence of the GENSET 
cDNA or genomic DNA may be used. In one preferred embodiment, stable and semi-stable 

1 5 antisense oligonucleotides described in International Application No. PCT WO94/23026, hereby 
incorporated by reference, are used. In these molecules, the 3' end or both the 3' and 5' ends are 
engaged in intramolecular hydrogen bonding between complementary base pairs. These molecules 
are better able to withstand exonuclease attacks and exhibit increased stability compared to 
conventional antisense oligonucleotides. 

20 In another preferred embodiment, the antisense oligodeoxy nucleotides against herpes 

simplex virus types 1 and 2 described in International Application No. WO 95/04141, hereby 
incorporated by reference, are used. 

In yet another preferred embodiment, the covalently cross-linked antisense oligonucleotides 
described in International Application No. WO 96/31523, hereby incorporated by reference, are 

25 used. These double- or single-stranded oligonucleotides comprise one or more, respectively, inter- 
or intra-oligonucleotide covalent cross-linkages, wherein the linkage consists of an amide bond 
between a primary amine group of one strand and a carboxyl group of the other strand or of the 
same strand, respectively, the primary amine group being directly substituted in the 2* position of 
the strand nucleotide monosaccharide ring, and the carboxyl group being carried by an aliphatic 

30 spacer group substituted on a nucleotide or nucleotide analog of the other strand or the same strand, 
respectively. 

The antisense oligodeoxynucleotides and oligonucleotides disclosed in International 
Application No. WO 92/1 8522, incorporated by reference, may also be used. These molecules are 
stable to degradation and contain at least one transcription control recognition sequence which binds 
35 to control proteins and are effective as decoys therefor. These molecules may contain "hairpin" 
structures, "dumbbell" structures, "modified dumbbell" structures, "cross-linked" decoy structures 
and "loop" structures. 

In another preferred embodiment, the cyclic double-stranded oligonucleotides described in 
European Patent Application No. 0 572 287 A2, hereby incorporated by reference are used. These 
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ligated oligonucleotide "dumbbells" contain the binding site for a transcription factor and inhibit 
expression of the gene under control of the transcription factor by sequestering the factor. 

Use of the closed antisense oligonucleotides disclosed in International Application No. WO 
92/19732, hereby incorporated by reference, is also contemplated. Because these molecules have 
S no free ends, they are more resistant to degradation by exonucleases than are conventional 

oligonucleotides. These oligonucleotides may be multifunctional, interacting with several regions 
which are not adjacent to the target mRNA. 

The appropriate level of antisense nucleic acids required to inhibit gene expression may be 
determined using in vitro expression analysis. The antisense molecule may be introduced into the 

10 cells by diffusion, injection, infection or transfection using procedures known in the art. For 
example, the antisense nucleic acids can be introduced into the body as a bare or naked 
oligonucleotide, oligonucleotide encapsulated in lipid, oligonucleotide sequence encapsidated by 
viral protein, or as an oligonucleotide operably linked to a promoter contained in an expression 
vector. The expression vector may be any of a variety of expression vectors known in the art, 

1 5 including retroviral or viral vectors, vectors capable of extrachromosomal replication, or integrating 
vectors. The vectors may be DN A or RN A. 

The antisense molecules are introduced onto cell samples at a number of different 
concentrations preferably between 1x10'*^ to IxlO^M. Once the minimum concentration that can 
adequately control gene expression is identified, the optimized dose is translated into a dosage 

20 suitable for use in vivo. For example, an inhibiting concentration in culture of 1x10'^ translates into 
a dose of approximately 0.6 mg/kg body weight. Levels of oligonucleotide approaching 100 mg/kg 
bodyweight or higher may be possible after testing the toxicity of the oligonucleotide in laboratory 
animals. It is additionally contemplated that cells from the vertebrate are removed, treated with the 
antisense oligonucleotide, and reintroduced into the vertebrate, 

25 In a preferred application of this invention, the polypeptide encoded by the gene is first 

identified, so that the effectiveness of antisense inhibition on translation can be monitored using 
techniques that include but are not limited to antibody-mediated tests such as RIAs and ELISA, 
functional assays, or radiolabeling. 

An alternative to the antisense technology that is used according to the present invention 

30 comprises using ribozymes that will bind to a target sequence via their complementary 

polynucleotide tail and that will cleave the corresponding RNA by hydrolyzing its target site 
(namely "hammerhead ribozymes"). Briefly, the simplified cycle of a hammerhead ribozyme 
comprises (1) sequence specific binding to the target RNA via complementary antisense sequences; 
(2) site-specific hydrolysis of the cleavable motif of the target strand; and (3) release of cleavage 

35 products, which gives rise to another catalytic cycle. Indeed, the use of long-chain antisense 

polynucleotide (at least 30 bases long) or ribozymes with long antisense arms are advantageous. A 
preferred delivery system for antisense ribozyme is achieved by covaiently linking these antisense 
ribozymes to lipophilic groups or to use liposomes as a convenient vector. Preferred antisense 
ribozymes according to the present invention are prepared as described by Rossi e( ah (1991) and 
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Sczakiel et al. (1995), the specific preparation procedures being referred to in said articles being 
herein incorporated by reference. 

Triple Hehx Approach 

The GENSET genomic DNA may also be used to inhibit the expression of the GENSET 
S gene based on intracellular triple helix formation. 

Triple helix oligonucleotides are used to inhibit transcription from a genome. They are 
particularly useful for studying alterations in cell activity when it is associated with a particular 
gene. The GENSET cDNAs or genomic DNAs of the present invention or, more preferably, a 
fragment of those sequences, can be used to inhibit gene expression in individuals having diseases 
10 associated with expression of a particular gene. Similarly, a portion of the GENSET genomic DNA 
can be used to study the effect of inhibiting GENSET gene transcription within a cell. 
Traditionally, homopurine sequences were considered the most useful for triple helix strategies. 
However, homopyrimidine sequences can also inhibit gene expression. Such homopyrimidine 
oligonucleotides bind to the major groove at homopurine:homopyrimidine sequences. Thus, both 
1 5 types of sequences firom the GENSET genomic DNA are contemplated within the scope of this 
invention. 

To carry out gene therapy strategies using the triple helix approach, the sequences of the 
GENSET genomic DNA are first scanned to identify 10-mer to 20-mer homopyrimidine or 
homopurine stretches which could be used in triple-helix based strategies for inhibiting GENSET 
. 20 expression. Following identification of candidate homopyrimidine or homopurine stretches, their 
efficiency in inhibiting GENSET expression is assessed by introducing varying amounts of 
oligonucleotides containing the candidate sequences into tissue culture cells which express the 
GENSET gene. 

The oligonucleotides can be introduced into the cells using a variety of methods known to 
2S those skilled in the art, including but not limited to calcium phosphate precipitation, DEAE- 
Dextran, electroporation, liposome-mediated transfection or native uptake. 

Treated cells are monitored for altered cell function or reduced GENSET expression using 
techniques such as Northern blotting, RNase protection assays, or PGR based strategies to monitor 
the transcription levels of the GENSET gene in cells which have been treated with the 
30 oligonucleotide. The cell functions to be monitored are predicted based upon the homologies of the 
target gene corresponding to the cDNA from which the oligonucleotide was derived with known 
gene sequences that have been associated with a particular function. The cell functions can also be 
predicted based on the presence of abnormal physiology within cells derived from individuals with 
a particular inherited disease, particularly when the cDNA is associated with the disease using 
35 techniques described in the section entitled ''Identification of genes associated with hereditary 
diseases or drug response". 
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The oligonucleotides which are effective in inhibiting gene expression in tissue culture cells 

may then be introduced in vivo using the techniques and at a dosage calculated based on the in vitro 

results, as described in the section entitled "Antisense Approach". 

In some embodiments, the natural (beta) anomers of the oligonucleotide units can be 
S replaced with alpha anomers to render the oligonucleotide more resistant to nucleases. Further, an 

intercalating agent such as ethidium bromide, or the like, can be attached to the 3' end of the alpha 

oligonucleotide to stabilize the triple helix. For information on the generation of oligonucleotides 

suitable for triple helix formation see Griffin et al. (1989), which is hereby incorporated by this 

reference. 

1 0 Treating GENSET gene-related disorders 

The present invention further relates to methods, uses of GENSET polypeptides and 
polynucleotides, and uses of modulators of GENSET polypeptides and polynucleotides, for treating 
diseases/disorders associated with GENSET genes by increasing or decreasing GENSET gene 
activity and/or expression. These methodologies can be effected using compounds selected using 

1 S screening protocols such as those described herein and/or by using the gene therapy and antisense 
approaches described in the art and herein. Gene therapy can be used to effect targeted expression 
of GENSET genes in any tissue, e.g. a tissue associated with the disease or condition to be treated. 
The GENSET coding sequence can be cloned into an appropriate expression vector and targeted to 
a particular cell type(s) to achieve efficient, high level expression. Introduction of the GENSET 

20 coding sequence into target cells can be achieved, for example, using particle mediated DNA 

delivery, (Haynes, 1996 and Maurer, 1999), direct injection of naked DNA, (Levy et al, 1996; and 
Feigner, 1996), or viral vector mediated transport (Smith et al, 1996, Stone et al, 2000; Wu and 
Atai, 2000), each of which disclosures are hereby incorporated by reference in their entireties. 
Tissue specific effects can be achieved, for example, in the case of vims mediated transport by 

25 using viral vectors that are tissue specific, or by the use of promoters that are tissue specific. For 
instance, any tissues-specific promoter may be used to achieve specific expression, for example 
albumin promoters (liver specific; Pinkert et al., 1987 Genes Dev. 1:268-277), lymphoid specific 
promoters (Calame et al., 1988 Adv. Immunol. 43:235-275), promoters of T-cell receptors (Winoto 
et al., 1989 EMBO J. 8:729-733) and immunoglobulins (Banerji et al., 1983 Cell 33:729-740; 

30 Queen and Baltimore 1983 Cell 33:741-748), neuron-specific promoters (e.g. the neurofilament 
promoter; Byrne et al., 1989 Proc. Natl. Acad. Sci. USA 86:5473-5477), pancreas-specific 
promoters (Ediunch et al., 1985 Science 230:912-916) or mammary gland-specific promoters (milk 
whey promoter, U.S. Pat. No. 4,873,316 and European Application Publication No. 264, 166). 
Developmentally-regulated promoters can also be used, such as the murine homeobox promoters 

35 (Kessel et al., 1 990 Science 249:374-379) or the alpha-fetoprotein promoter (Campes et al., 1 989 
Genes Dev. 3:537-546). 

Combinatorial approaches can also be used to ensure that the GENSET coding sequence is 
activated in the target tissue (Butt and Karathanasis, 1995; Miller and Whelan, 1997), which 
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disclosures are hereby incorporated by reference in their entireties. Antisense oligonucleotides 
complementary to GENSET mRNA can be used to selectively diminish or ablate the expression of 
the protein, for example, at sites of inflammation. More specifically, antisense constructs or 
antisense oligonucleotides can be used to inhibit the production of GENSET in high expressing 
S cells such as those cited in the third column of Table V. Antisense mRNA can be produced by 
transfecting into target cells an expression vector with the GENSET gene sequence, or a portion 
thereof, oriented in an antisense direction relative to the direction of transcription. Appropriate 
vectors include viral vectors, including retroviral, adenoviral, and adeno-associated viral vectors, as 
well as nonviral vectors. Tissue specific promoters can be used, as described supra. Alternatively, 

10 antisense oligonucleotides can be introduced directly into target cells to achieve the same goal. 
(See also other delivery methodologies described herein in connection with gene therapy.). 
Oligonucleotides can be selected/designed to achieve a high level of specificity (Wagner et al^ 
1996), which disclosure is hereby incorporated by reference in its entirety. The therapeutic 
methodologies described herein are applicable to both human and non-human mammals (including 

IS cats and dogs). 

Pharmaceutical and physiologically acceptable compositions 

The present invention also relates to pharmaceutical or physiologically acceptable 
compositions comprising, as active agent, the polypeptides, nucleic acids or antibodies of the 
invention. The invention also relates to compositions comprising, as active agent, compounds 

20 selected using the above-described screening protocols. Such compositions include the active agent 
in combination with a pharmaceutical or physiologically acceptable carriers. In the case of naked 
DNA, the "carrier" may be gold particles. The amount of active agent in the composition can vary 
with the agent, the patient and the effect sought. Likewise, the dosing regimen can vary depending 
on the composition and the disease/disorder to be treated. 

2S Therefore, the invention related to methods for the production of pharmaceutical 

composition comprising a method for selecting an active agent, compound, substance or molecule 
using any of the screening method described herein and furthermore mixing the identified active 
agent, compound, substance or molecule with a pharmaceutical ly acceptable carrier. 

The pharmaceutical compositions utilized in this invention may be administered by any 

30 number of routes including, but not limited to, oral, intravenous, intramuscular, intra-arterial, 
intramedullary, intrathecal, intraventricular, transdermal, subcutaneous, intraperitoneal, intranasal, 
enteral, topical, sublingual, or rectal means. In addition to the active ingredients, these 
pharmaceutical compositions may contain suitable pharmaceutically acceptable carriers comprising 
excipients and auxiliaries which facilitate processing of the active compounds into preparations 

35 which can be used pharmaceutically. Further details on techniques for formulation and 

administration may be found in the latest edition of Remington's Pharmaceutical Sciences (Maack 
Publishing Co. Easton, Pa). 
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Pharmaceutical compositions for oral administration can be formulated using 
pharmaceuticaliy acceptable carriers well known in the art in dosages suitable for oral 
administration. Such carriers enable the pharmaceutical compositions to be formulated as tablets, 
pills, dragees, capsules, liquids, gels, syrups, slurries, suspensions, and the like, for ingestion by the 
5 patient. 

Pharmaceutical preparations for oral use can be obtained through a combination of active 
compounds with solid excipient, suiting mixture is optionally grinding, and processing the mixture 
of granules, after adding suitable auxiliaries, if desired, to obtain tablets or dragee cores. Suitable 
excipients are carbohydrate or protein fillers, such as sugars, including lactose, sucrose, mannitol, or 

10 sorbitol; starch from com, wheat, rice, potato, or other plants; cellulose, such as methyl cellulose, 
hydroxypropylmethyUcellulose, or sodium carboxymethylcellulose; gums including arabic and 
tragacanth; and proteins such as gelatin and collagen. If desired, disintegrating or solubilizing 
agents may be added, such as the cross-linked polyvinyl pyrrolidone, agar, alginic acid, or a salt 
thereof, such as sodium alginate. 

15 Dragee cores may be used in conjunction with suitable coatings, such as concentrated sugar 

solutions, which may also contain gum arabic, talc, polyvinylpyrrolidone, carbopol gel, 
polyethylene glycol, and/or titaniumdioxide, lacquer solutions, and suitable organic solvents or 
solvent mixtures. DyestufTs or pigments may be added to the tablets or dragee coatings for product 
identification or to characterize the quantity of active compound, i.e., dosage. 

20 Pharmaceutical preparations which can be used orally include push-fit capsules made of 

gelatin, as well as soft, sealed capsules made of gelatin and a coating, such as glycerol or sorbitol. 
Push-fit capsules can contain active ingredients mixed with a filler or binders, such as lactose or 
starches, lubricants, such as talc or magnesium stearate, and, optionally, stabilizers. In soft 
capsules, the active compounds may be dissolved or suspended in suitable liquids, such as fatty oils, 

25 liquid, or liquidpolyethylene glycol with or without stabilizers. 

Pharmaceutical formulations suitable for parenteral administration may be formulated in 
aqueous solutions, preferably in physiologically compatible buft^ers such as Hanks solution. 
Ringer's solution, or physiologically buffered saline. Aqueous injection suspensions may contain 
substances which increase the viscosity of the suspension, such as sodium carboxymethylcellulose, 

30 sorbitol, or dextran. Additionally, suspensions of the active compounds may be prepared as 
appropriate oily injection suspensions. Suitable lipophilic solvents or vehicles include fatty oils 
such as sesame oil, or synthetic fatty acid esters, such as ethyl oleate or triglycerides, or liposomes. 
Optionally, the suspension may also contain suitable stabilizers or agents which increase the 
solubility of the compounds to allow for the preparation of highly concentrated solutions. 

35 For topical or nasal administration, penetrants appropriate to the particular barrier to be 

permeated are used in the formulation. Such penetrants are generally known in the art. 

The pharmaceutical compositions of the present invention may be manufactured in a 
manner that is known in the art, e.g., by means of conventional mixing, dissolving, granulating, 
dragee-making, levigating, emulsifying, encapsulating, entrapping, or lyophilizing processes. 
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The pharmaceutical composition may be provided as a salt and can be formed with many 
acids, including but not limited to, hydrochloric, sulfuric, acetic, lactic, tartaric, malic, succinic, etc. 
Salts tend to be more soluble in aqueous or other protonic solvents than are the corresponding free 
base forms. In other cases, the preferred preparation may be a lyophilized powder which may 
S contain any or all of the following: 1-50 mM histidine, 0.1%-2% sucrose, and 2-7% mannitol, at a 
pH range of 4.5 to S.S, that is combined with buifer prior to use. 

After pharmaceutical compositions have been prepared, they can be placed in an 
appropriate container and labeled for treatment of an indicated condition. For administration of a 
GENSET polypeptide, such labeling would include amount, frequency, and method of 
10 administration. 

Pharmaceutical compositions suitable for use in the invention include compositions wherein 
the active ingredients are contained in an effective amount to achieve the intended purpose. The 
determination of an effective dose is well within the capability of those skilled in the art. 

For any compound, the therapeutically effective dose can be estimated initially either in cell 

15 culture assays, e.g., of neoplastic cells, or in animal models, usually mice, rabbits, dogs, or pigs. 
The animal model may also be used to determine the appropriate concentration range and route of 
administration. Such information can then be used to determine useful doses and routes for 
administration in humans. 

A therapeutically efTective dose refers to that amount of active ingredient, for example a 

20 GENSET polypeptide or fragments thereof, antibodies specific to GENSET polypeptides, agonists, 
antagonists or inhibitors of GENSET polypeptides, which ameliorates the symptoms or condition. 
Therapeutic efficacy and toxicity may be determined by standard pharmaceutical procedures in cell 
cultures or experimental animals, e.g., ED50 (the dose therapeutically effective in 50% of the 
population) and LD50 (the dose lethal to 50% of the population). The dose ratio between 

25 therapeutic and toxic effects is the therapeutic index, and it can be expressed as the ratio, 

LD50/ED50. Pharmaceutical compositions which exhibit large therapeutic indices are preferred. 
The data obtained from cell culture assays and animal studies is used in formulating a range of 
dosage for human use. The dosage contained in such compositions is preferably within a range of 
circulating concentrations that include the ED50 with little or no toxicity. The dosage varies within 

30 this range depending upon the dosage form employed, sensitivity of the patient, and the route of 
administration. 

The exact dosage will be determined by the practitioner, in light of factors related to the 
subject that requires treatment. Dosage and administration are adjusted to provide sufficient levels 
of the active moiety or to maintain the desired effect. Factors which may be taken into account 
35 include the severity of the disease state, general health of the subject, age, weight, and gender of the 
subject, diet, time and frequency of administration, drug combination(s), reaction sensitivities, and 
tolerance/response to therapy. Long-acting pharmaceutical compositions maybe administered every 
3 to 4 days, every week, or once every two weeks depending on half-life and clearance rate of the 
particular formulation. 
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Normal dosage amounts may vary from 0.1 to 100,000 micrograms, up to a total dose of 
about 1 g, depending upon the route of administration. Guidance as to particular dosages and 
methods of delivery is provided in the literature and generally available to practitioners in the art. 
Those skilled in the art will employ different formulations for nucleotides than for proteins or their 
S inhibitors. Similarly, delivery of polynucleotides or polypeptides will be specific to particular cells, 
conditions, locations, etc. 



Uses of Genset Sequences: Computer-Rela ted Embodiments 

As used herein the term "cDNA codes of SEQ ID NOs:l-169, 339-455, 561-784" 

10 encompasses the nucleotide sequences of SEQ ID NOs:l-169, 339-455, 561-784 and of clones 
inserts of the deposited clone pool, fragments thereof, nucleotide sequences homologous thereto, 
and sequences complementary to all of the preceding sequences. The fragments include fragments 
ofSEQ IDNOs:M69, 339-455, 561-784 comprising at least 8, 10, 12, 15, 18,20,25,28, 30,35, 
40, 50, 75, 100, 150, 200, 300, 400, 500, 1000 or 2000 consecutive nucleotides of SEQ IDNOs:!- 

15 169, 339-455, 561-784. Preferably the fragments include polynucleotides described herein as 
encoding polypeptides having a biological activity. Homologous sequences and fragments of SEQ 
ID NOs: 1-169, 339-455, 561-784 refer to a sequence having at least 99%, 98%, 97%, 96%, 95%, 
90%, 85%, 80%, or 75% identity to these sequences. Identity may be determined using any of the 
computer programs and parameters described herein, including BLAST2N with the default 

20 parameters or with any modified parameters. Homologous sequences also include RNA sequences 
in which uridines replace the thymines in the cDNA codes of SEQ ID NOs: I -169, 339-455, 561- 
784, The homologous sequences may be obtained using any of the procedures described herein or 
may result from the correction of a sequencing error as described above. It will be appreciated that 
the cDNA codes of SEQ ID NOs: 1-169, 339-455, 561-784 can be represented in the traditional 

25 single character format (see, e.g. the inside back cover of Stryer, 1 995) or in any other format which 
records the identity of the nucleotides in a sequence. 

As used herein the term "polypeptide codes of SEQ ID NOs: 1 70-33 8, 456-560, 785-9 1 8" 
encompasses the polypeptide sequences of SEQ ID NOs:170-338, 456-560, 785-918 which are 
encoded by the cDNAs of SEQ ID NOs: 1-169, 339-455, 561-784, the polypeptide sequences 

30 encoded by the clone inserts of the deposited clone pool, polypeptide sequences homologous 

thereto, or fragments of any of the preceding sequences. Homologous polypeptide sequences refer 
to a polypeptide sequence having at least 99%, 98%, 97%, 96%, 95%, 90%, 85%, 80%, 75% 
identity to one of the polypeptide sequences of SEQ ID NOs: 170-338, 456-560, 785-91 8. Identity 
may be determined using any of the computer programs and parameters described herein, including 

35 FASTA with the default parameters or with any modified parameters. The homologous sequences 
may be obtained using any of the procedures described herein or may result from the correction of a 
sequencing error as described above. The polypeptide fragments comprise at least 5, 6, 8, 10, 12, 
15, 20, 25, 30, 35, 40, 50, 60, 75, 100, 1 50 or 200 consecutive amino acids of the polypeptides of 
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SEQ [DNOs:170O38, 456-560, 785-918. Preferably, the fragments include polypeptides described 
herein as having a biological activity, or fragments comprising at least 5, 10, 1 5, 20, 25, 30, 35, 40, 
50, 75, 100, or 150 consecutive amino acids of polypeptides described herein as having a biological 
activity. It will be appreciated that the polypeptide codes of the SEQ ID NOs: 170-338, 456-560, 
5 785-9 1 8 can be represented in the traditional single character format or three letter format (see, the 
inside back cover of Stryer, 1995) or in any other format which relates the identity of the 
polypeptides in a sequence. 

It will be appreciated by those skilled in the art that the nucleic acid codes of the invention 
and polypeptide codes of the invention can be stored, recorded, and manipulated on any medium 

10 which can be read and accessed by a computer. As used herein, the words "recorded" and "stored" 
refer to a process for storing information on a computer medium. A skilled artisan can readily 
adopt any of the presently known methods for recording information on a computer readable 
medium to generate manufactures comprising one or more of the nucleic acid codes of the 
invention, or one or more of the polypeptide codes of the invention. Another aspect of the present 

1 5 invention is a computer readable medium having recorded thereon at least 2, 5, 1 0, 1 5, 20, 25, 30, or 
50 nucleic acid codes of the invention. Another aspect of the present invention is a computer 
readable medium having recorded thereon at least 2, 5, 10, 1 5, 20, 25, 30, or 50 polypeptide codes 
of the invention. 

Computer readable media include magnetically readable media, optically readable media, 
20 electronically readable media and magnetic/optical media. For example, the computer readable 
media may be a hard disk, a floppy disk, a magnetic tape, CD-ROM, Digital Versatile Disk (DVD), 
Random Access Memory (RAM), or Read Only Memory (ROM) as well as other types of other 
media known to those skilled in the art. 

Embodiments of the present invention include systems, particularly computer systems 
25 which store and manipulate the sequence information described herein. One example of a computer 
system 100 is illustrated in block diagram form in Figure 1 . As used herein, **a computer system" 
refers to the hardware components, software components, and data storage components used to 
analyze the nucleotide sequences of the nucleic acid codes of the invention or the amino acid 
sequences of the polypeptide codes of the invention. In one embodiment, the computer system 1 00 
30 is a Sun Enterprise 1000 server (Sun Microsystems, Palo Alto, CA). The computer system 100 
preferably includes a processor for processing, accessing and manipulating the sequence data. The 
processor 105 can be any well-known type of central processing unit, such as the Pentium III from 
Intel Corporation, or similar processor from Sun, Motorola, Compaq or International Business 
Machines. 

35 Preferably, the computer system 1 00 is a general purpose system that comprises the 

processor 105 and one or more internal data storage components 1 10 for storing data, and one or 
more data retrieving devices for retrieving the data stored on the data storage components. A 
skilled artisan can readily appreciate that any one of the currently available computer systems are 
suitable. 
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In one particular embodiment, the computer system 100 includes a processor 105 connected 
to a bus which is connected to a main memory 1 1 5 (preferably implemented as RAM) and one or 
more internal data storage devices 1 10, such as a hard drive and/or other computer readable media 
having data recorded thereon. In some embodiments, the computer system 100 further includes one 
S or more data retrieving device 1 1 8 for reading the data stored on the internal data storage devices 
110. 

The data retrieving device 1 1 8 may represent, for example, a floppy disk drive, a compact 
disk drive, a magnetic tape drive, etc. In some embodiments, the internal data storage device 1 10 is 
a removable computer readable medium such as a floppy disk, a compact disk, a magnetic tape, etc. 
10 containing control logic and/or data recorded thereon. The computer system 1 00 may 

advantageously include or be programmed by appropriate software for reading the control logic 
and/or the data from the data storage component once inserted in the data retrieving device. 

The computer system 100 includes a display 120 which is used to display output to a 
computer user. It should also be noted that the computer system 100 can be linked to other 
IS computer systems 125a-c in a network or wide area network to provide centralized access to the 
computer system 100. 

Software for accessing and processing the nucleotide sequences of the nucleic acid codes of 
the invention or the amino acid sequences of the polypeptide codes of the invention (such as search 
tools, compare tools, and modeling tools etc.) may reside in main memory 1 IS during execution. 

20 In some embodiments, the computer system 100 may further comprise a sequence comparer 

for comparing the above*described nucleic acid codes of the invention or the polypeptide codes of 
the invention stored on a computer readable medium to reference nucleotide or polypeptide 
sequences stored on a computer readable medium. A "sequence comparer" refers to one or more 
programs which are implemented on the computer system 100 to compare a nucleotide or 

25 polypeptide sequence with other nucleotide or polypeptide sequences and/or compounds including 
but not limited to peptides, peptidomimetics, and chemicals stored within the data storage means. 
For example, the sequence comparer may compare the nucleotide sequences of nucleic acid codes 
of the invention or the amino acid sequences of the polypeptide codes of the invention stored on a 
computer readable medium to reference sequences stored on a computer readable medium to 

30 identify homologies, motifs implicated in biological function, or structural motifs. The various 
sequence comparer programs identified elsewhere in this patent specification are particularly 
contemplated for use in this aspect of the invention. 

Figure 2 is a flow diagram illustrating one embodiment of a process 200 for comparing a 
new nucleotide or protein sequence with a database of sequences in order to determine the 

35 homology levels between the new sequence and the sequences in the database. The database of 
sequences can be a private database stored within the computer system 1 00, or a public database 
such as GENBANK, PIR OR SWISSPROT that is available through the Internet. 
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The process 200 begins at a start state 201 and then moves to a state 202 wherein the new 
sequence to be compared is stored to a memory in a computer system 100. As discussed above, the 
memory could be any type of memory, including RAM or an internal storage device. 

The process 200 then moves to a state 204 wherein a database of sequences is opened for 
5 analysis and comparison. The process 200 then moves to a state 206 wherein the first sequence 
stored in the database is read into a memory on the computer. A comparison is then performed at a 
state 210 to determine if the first sequence is the same as the second sequence. It is important to 
note that this step is not limited to performing an exact comparison between the new sequence and 
the first sequence in the database. Well-known methods are known to those of skill in the art for 
1 0 comparing two nucleotide or protein sequences, even if they are not identical. For example, gaps 
can be introduced into one sequence in order to raise the homology level between the two tested 
sequences. The parameters that control whether gaps or other features are introduced into a 
sequence during comparison are normally entered by the user of the computer system. 

Once a comparison of the two sequences has been performed at the state 2 10, a 
1 5 determination is made at a decision state 2 1 0 whether the two sequences are the same. Of course, 
the term "same" is not limited to sequences that are absolutely identical. Sequences that are within 
the homology parameters entered by the user will be marked as "same" in the process 200. 

If a determination is made that the two sequences are the same, the process 200 moves to a 
state 214 wherein the name of the sequence from the database is displayed to the user. This state 
20 notifies the user that the sequence with the displayed name fulfills the homology constraints that 
were entered. Once the name of the stored sequence is displayed to the user, the process 200 moves 
to a decision state 2 1 8 wherein a determination is made whether more sequences exist in the 
database. If no more sequences exist in the database, then the process 200 terminates at an end state 
220. However, if more sequences do exist in the database, then the process 200 moves to a state 
25 224 wherein a pointer is moved to the next sequence in the database so that it can be compared to 
the new sequence. In this manner, the new sequence is aligned and compared with every sequence 
in the database. 

It should be noted that if a determination had been made at the decision state 212 that the 
sequences were not homologous, then the process 200 would move immediately to the decision 
30 state 2 1 8 in order to determine if any other sequences were available in the database for 
comparison. 

Accordingly, one aspect of the present invention is a computer system comprising a 
processor, a data storage device having stored thereon a nucleic acid code of the invention or a 
polypeptide code of the invention, a data storage device having retrievably stored thereon reference 
35 nucleotide sequences or polypeptide sequences to be compared to the nucleic acid code of the 
invention or polypeptide code of the invention and a sequence comparer for conducting the 
comparison. The sequence comparer may indicate a homology level between the sequences 
compared or identify motifs implicated in biological function and structural motifs in the nucleic 
acid code of the invention and polypeptide codes of the invention or it may identify structural 
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motifs in sequences which are compared to these nucleic acid codes and polypeptide codes. In 
some embodiments, the data storage device may have stored thereon the sequences of at least 2, 5, 
10, 15, 20, 25, 30, or 50 of the nucleic acid codes of the invention or polypeptide codes of the 
invention. 

S Another aspect of the present invention is a method for determining the level of homology 

between a nucleic acid code of the invention and a reference nucleotide sequence, comprising the 
steps of reading the nucleic acid code and the reference nucleotide sequence through the use of a 
computer program which determines homology levels and determining homology between the 
nucleic acid code and the reference nucleotide sequence with the computer program. The computer 

10 program may be any of a number of computer programs for determining homology levels, including 
those specifically enumerated herein, including BLAST2N with the default parameters or with any 
modified parameters. The method may be implemented using the computer systems described 
above. The method may also be performed by reading 2, 5, 10, 15, 20, 25, 30, or 50 of the above 
described nucleic acid codes of the invention through the use of the computer program and 

15 determining homology between the nucleic acid codes and reference nucleotide sequences. 

Figure 3 is a flow diagram illustrating one embodiment of a process 250 in a computer for 
determining whether two sequences are homologous. The process 250 begins at a start state 252 and 
then moves to a state 254 wherein a first sequence to be compared is stored to a memory. The 
second sequence to be compared is then stored to a memory at a state 256. The process 250 then 

20 moves to a state 260 wherein the first character in the first sequence is read and then to a state 262 
wherein the first character of the second sequence is read. It should be understood that if the 
sequence is a nucleotide sequence, then the character would normally be either A, T, C, G or U. If 
the sequence is a protein sequence, then it should be in the single letter amino acid code so that the 
first and sequence sequences can be easily compared. 

25 A determination is then made at a decision state 264 whether the two characters are the 

same. If they are the same, then the process 250 moves to a state 268 wherein the next characters in 
the first and second sequences are read. A determination is then made whether the next characters 
are the same. If they are, then the process 250 continues this loop until two characters are not the 
same. If a determination is made that the next two characters are not the same, the process 250 

30 moves to a decision state 274 to determine whether there are any more characters either sequence to 
read. 

If there are no more characters to read, then the process 250 moves to a state 276 wherein 
the level of homology between the first and second sequences is displayed to the user. The level of 
homology is determined by calculating the proportion of characters between the sequences that 
35 were the same out of the total number of sequences in the first sequence. Thus, if every character in 
a first 100 nucleotide sequence aligned with a every character in a second sequence, the homology 
level would be 100%. 

Alternatively, the computer program may be a computer program which compares the 
nucleotide sequences of the nucleic acid codes of the present invention, to reference nucleotide 
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sequences in order to determine whether the nucleic acid code of the invention differs from a 
reference nucleic acid sequence at one or more positions. Optionally such a program records the 
length and identity of inserted, deleted or substituted nucleotides with respect to the sequence of 
either the reference polynucleotide or the nucleic acid code of the invention. In one embodiment, 
S the computer program may be a program which determines whether the nucleotide sequences of the 
nucleic acid codes of the invention contain one or more single nucleotide polymorphisms (SNP) 
with respect to a reference nucleotide sequence. These single nucleotide polymorphisms may each 
comprise a single base substitution, insertion, or deletion. 

Another aspect of the present invention is a method for determining the level of homology 

10 between a polypeptide code of the invention and a reference polypeptide sequence, comprising the 
steps of reading the polypeptide code of the invention and the reference polypeptide sequence 
through use of a computer program which determines homology levels and determining homology 
between the polypeptide code and the reference polypeptide sequence using the computer program. 
Accordingly, another aspect of the present invention is a method for determining whether a 

IS nucleic acid code of the invention differs at one or more nucleotides from a reference nucleotide 
sequence comprising the steps of reading the nucleic acid code and the reference nucleotide 
sequence through use of a computer program which identifies differences between nucleic acid 
sequences and identifying differences between the nucleic acid code and the reference nucleotide 
sequence with the computer program. In some embodiments, the computer program is a program 

20 which identifies single nucleotide polymorphisms. The method may be implemented by the 

computer systems described above and the method illustrated in Figure 3, The method may also be 
performed by reading at least 2, 5, 10, 1 5, 20, 25, 30, or 50 of the nucleic acid codes of the 
invention and the reference nucleotide sequences through the use of the computer program and 
identifying differences between the nucleic acid codes and the reference nucleotide sequences with 

25 the computer program. 

In other embodiments the computer based system may further comprise an identifier for 
identifying features within the nucleotide sequences of the nucleic acid codes of the invention or the 
amino acid sequences of the polypeptide codes of the invention. An "identifier*' refers to one or 
more programs which identifies certain features within the above-described nucleotide sequences of 

30 the nucleic acid codes of the invention or the amino acid sequences of the polypeptide codes of the 
invention. In one embodiment, the identifier may comprise a program which identifies an open 
reading frame in the cDNAs codes of the invention. 

Figure 4 is a flow diagram illustrating one embodiment of an identifier process 300 for 
detecting the presence of a feature in a sequence. The process 300 begins at a start state 302 and 

35 then moves to a state 304 wherein a first sequence that is to be checked for features is stored to a 
memory 1 15 in the computer system 100. The process 300 then moves to a state 306 wherein a 
database of sequence features is opened. Such a database would include a list of each feature's 
attributes along with the name of the feature. For example, a feature name could be "Initiation 
Codon" and the attribute would be "ATG". Another example would be the feature name 
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"TAATAA Box" and the feature attribute would be "TAATAA". An example of such a database is 
produced by the University of Wisconsin Genetics Computer Group (www.gcg.com). 

Once the database of features is opened at the state 306, the process 300 moves to a state 
308 wherein the first feature is read from the database. A comparison of the attribute of the first 
5 feature with the first sequence is then made at a state 3 10. A determination is then made at a 
decision state 3 1 6 whether the attribute of the feature was found in the first sequence. If the 
attribute was found, then the process 300 moves to a state 3 1 8 wherein the name of the found 
feature is displayed to the user. 

The process 300 then moves to a decision state 320 wherein a determination is made 

10 whether move features exist in the database. If no more features do exist, then the process 300 
terminates at an end state 324. However, if more features do exist in the database, then the process 
300 reads the next sequence feature at a state 326 and loops back to the state 3 10 wherein the 
attribute of the next feature is compared against the first sequence. 

It should be noted, that if the feature attribute is not found in the first sequence at the 

1 S decision state 3 1 6, the process 300 moves directly to the decision state 320 in order to determine if 
any more features exist in the database. 

In another embodiment, the identifier may comprise a molecular modeling program which 
determines the 3-dimensionaI structure of the polypeptides codes of the invention. Such programs 
may use any methods known to those skilled in the art including methods based on homology- 

20 modeling, fold recognition and ab initio methods as described in Sternberg et al, 1999, which 
disclosure is hereby incorporated by reference in its entirety. In some embodiments, the molecular 
modeling program identifies target sequences that are most compatible with profiles representing 
the structural environments of the residues in known three-dimensional protein structures. (See, 
e.g., Eisenberg et al., U.S. Patent No. 5,436,850 issued July 25, 1995, which disclosure is hereby 

25 incorporated by reference in its entirety). In another technique, the known three-dimensional 
structures of proteins in a given family are superimposed to define the structurally conserved 
regions in that family. This protein modeling technique also uses the known three-dimensional 
structure of a homologous protein to approximate the structure of the polypeptide codes of the 
invention. (See e.g., Srinivasan, et aL, U.S. Patent No. 5,557,535 issued September 17, 1996, which 

30 disclosure is hereby incorporated by reference in its entirety). Conventional homology modeling 
techniques have been used routinely to build models of proteases and antibodies. (Sowdhamini et 
al.y (1997)). Comparative approaches can also be used to develop three-dimensional protein models 
when the protein of interest has poor sequence identity to template proteins. In some cases, proteins 
fold into similar three-dimensional structures despite having very weak sequence identities. For 

35 example, the three-dimensional structures of a number of helical cytokines fold in similar three- 
dimensional topology in spite of weak sequence homology. 

The recent development of threading methods now enables the identification of likely 
folding patterns in a number of situations where the structural relatedness between target and 
template(s) is not detectable at the sequence level. Hybrid methods, in which fold recognition is 



223 



wo 02/083898 PCT/IBO 1/0091 4 

performed using Multiple Sequence Threading (MST), structural equivalencies are deduced from 
the threading output using a distance geometry program DRAGON to construct a low resolution 
model, and a full-atom representation is constructed using a molecular modeling package such as 
QUANTA. 

S According to this 3-step approach, candidate templates are first identified by using the 

novel fold recognition algorithm MST, which is capable of performing simultaneous threading of 
multiple aligned sequences onto one or more 3-D structures. In a second step, the structural 
equivalencies obtained from the MST output are converted into interresidue distance restraints and 
fed into the distance geometry program DRAGON, together with auxiliary information obtained 

10 from secondary structure predictions. The program combines the restraints in an unbiased manner 
and rapidly generates a large number of low resolution model confirmations. In a third step, these 
low resolution model confirmations are converted into full-atom models and subjected to energy 
minimization using the molecular modeling package QUANTA. (See e.g., Aszodi et al., (1997)). 
The results of the molecular modeling analysis may then be used in rational drug design 

1 S techniques to identify agents which modulate the activity of the polypeptide codes of the invention. 
Accordingly, another aspect of the present invention is a method of identifying a feature 
within the nucleic acid codes of the invention or the polypeptide codes of the invention comprising 
reading the nucleic acid code(s) or the polypeptide code(s) through the use of a computer program 
which identifies features therein and identifying features within the nucleic acid code(s) or 

20 polypeptide code(s) with the computer program. In one embodiment, computer program comprises 
a computer program which identifies open reading frames. In a further embodiment, the computer 
program identifies linear or structural motifs in a polypeptide sequence. In another embodiment, 
the computer program comprises a molecular modeling program. The method may be performed by 
reading a single sequence or at least 2, 5, 10, 1 5, 20, 25, 30, or 50 of the nucleic acid codes of the 

25 invention or the polypeptide codes of the invention through the use of the computer program and 
identifying features within the nucleic acid codes or polypeptide codes with the computer program. 

The nucleic acid codes of the invention or the polypeptide codes of the invention may be 
stored and manipulated in a variety of data processor programs in a variety of formats. For 
example, they may be stored as text in a word processing file, such as MicrosoftWORD or 

30 WORDPERFECT or as an ASCII file in a variety of database programs familiar to those of skill in 
the art, such as DB2, SYBASE, or ORACLE. In addition, many computer programs and databases 
may be used as sequence comparers, identifiers, or sources of reference nucleotide or polypeptide 
sequences to be compared to the nucleic acid codes of the invention or the polypeptide codes of the 
invention. The following list is intended not to limit the invention but to provide guidance to 

35 programs and databases which are useful with the nucleic acid codes of the invention or the 

polypeptide codes of the invention. The programs and databases which may be used include, but 
are not limited to: MacPattem (EMBL), DiscoveryBase (Molecular Applications Group), GeneMine 
(Molecular Applications Group), Look (Molecular Applications Group), MacLook (Molecular 
Applications Group), BLAST and BLAST2 (NCBI), BLASTN and BLASTX (Altschul et al, 1990), 
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FASTA (Pearson and Lipman, 1988), FASTDB (Brutlag etal, 1990), Catalyst (Molecular 
Simulations Inc.), Catalyst/SHAPE (Molecular Simulations Inc.), Cerius2.DBAccess (Molecular 
Simulations Inc.), HypoGen (Molecular Simulations Inc.), Insight II, (Molecular Simulations Inc.), 
Discover (Molecular Simulations Inc.), CHARMm (Molecular Simulations Inc.), Felix (Molecular 
S Simulations Inc.), DelPhi, (Molecular Simulations Inc.), QuanteMM, (Molecular Simulations Inc.), 
Homology (Molecular Simulations Inc.), Modeler (Molecular Simulations Inc.), ISIS (Molecular 
Simulations Inc.), Quanta/Protein Design (Molecular Simulations Inc.)» WebLab (Molecular 
Simulations Inc.), WebLab Diversity Explorer (Molecular Simulations Inc.), Gene Explorer 
(Molecular Simulations Inc.), SeqFold (Molecular Simulations Inc.), the EMBL/Swissprotein 

10 database, the MDL Available Chemicals Directory database, the MDL Drug Data Report data base, 
the Comprehensive Medicinal Chemistry database, Derwents's World Drug Index database, the 
BioByteMasterFile database, the Genbank database, and the Genseqn database. Many other 
programs and data bases would be apparent to one of skill in the art given the present disclosure. 
Motifs which may be detected using the above programs include sequences encoding 

15 leucine zippers, helix-tum-helix motifs, glycosylation sites, ubiquitination sites, alpha helices, and 
beta sheets, signal sequences encoding signal peptides which direct the secretion of the encoded 
proteins, sequences implicated in transcription regulation such as homeoboxes, acidic stretches, 
enzymatic active sites, substrate binding sites, and en^matic cleavage sites. 

Conclusion 

20 As discussed above, the GENSET polynucleotides and polypeptides of the present 

invention or fragments thereof can be used for various purposes. The polynucleotides can be used 
to express recombinant protein for analysis, characterization or therapeutic use; as markers for 
tissues in which the corresponding protein is preferentially expressed (either constitutively or at a 
particular stage of tissue differentiation or development or in disease states); as molecular weight 

25 markers on Southern gels; as chromosome markers or tags (when labeled) to identify chromosomes 
or to map related gene positions; as a reagent (including a labeled reagent) in assays designed to 
quantitatively determine levels of GENSET expression in biological samples; to compare with 
endogenous DNA sequences in patients to identify potential genetic disorders; as probes to 
hybridize and thus discover novel, related DNA sequences; as a source of information to derive 

30 PCR primers for genetic flngerprinting; for selecting and making oligomers for attachment to a 
"gene chip" or other support, including for examination for expression patterns; to raise anti-protein 
antibodies using DNA immunization techniques; and as an antigen to raise anti-DNA antibodies or 
elicit another immune response. Where the polynucleotide encodes a protein which binds or 
potentially binds to another protein (such as, for example, in a receptor-ligand interaction), the 

35 polynucleotide can also be used in interaction trap assays (such as, for example, that described in 
Gyuris et al^ (1993) to identify polynucleotides encoding the other protein with which binding 
occurs or to identify inhibitors of the binding interaction. 
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The proteins or polypeptides provided by the present invention can similarly be used in 
assays to determine biological activity, including in a panel of multiple proteins for high-throughput 
screening; to raise antibodies or to elicit another immune response; as a reagent (including the 
labeled reagent) in assays designed to quantitatively determine levels of the protein (or its receptor) 
S in biological fluids; as markers for tissues in which the corresponding protein is preferentially 
expressed (either constitutively or at a particular stage of tissue differentiation or development or in 
a disease state); and, of course, to isolate correlative receptors or ligands. Where the protein binds 
or potentially binds to another protein (such as, for example, in a receptor-ligand interaction), the 
protein can be used to identify the other protein with which binding occurs or to identify inhibitors 
1 0 of the binding interaction. Proteins involved in these binding interactions can also be used to screen 
for peptide or small molecule inhibitors or agonists of the binding interaction. 

Any or all of these research utilities are capable of being developed into reagent grade or kit 
format for commercialization as research products. 

Methods for performing the uses listed above are well known to those skilled in the art. 
1 S References disclosing such methods include without limitation "Molecular Cloning; A Laboratory 
Manual", 2d ed., Cole Spring Harbor Laboratory Press, Sambrook, J., E.F. Fritsch and T. Maniatis 
eds., 1989, and "Methods in Enzymology; Guide to Molecular Cloning Techniques", Academic 
Press, Berger and Kimmel eds., 1987, which disclosures are hereby incorporated by reference in 
their entireties. 

20 Polynucleotides and proteins of the present invention can also be used as nutritional sources 

or supplements. Such uses include without limitation use as a protein or amino acid supplement, 
use as a carbon source, use as a nitrogen source and use as a source of carbohydrate. In such cases 
the protein or polynucleotide of the invention can be added to the feed of a particular organism or 
can be administered as a separate solid or liquid preparation, such as in the form of powder, pills, 

25 solutions, suspensions or capsules. In the case of microorganisms, the protein or polynucleotide of 
the invention can be added to the medium in or on which the microorganism is cultured. 

Although this invention has been described in terms of certain preferred embodiments, 
other embodiments which will be apparent to those of ordinary skill in the art in view of the 
disclosure herein are also within the scope of this invention. Accordingly, the scope of the 

30 invention is intended to be deflned only by reference to the appended claims. 



Examples 

Preparation of Antibody Compositions to the GENSET protein 

Substantially pure protein or polypeptide is isolated from transfected or transformed cells 
35 containing an expression vector encoding the GENSET protein or a portion thereof. The 

concentration of protein in the final preparation is adjusted, for example, by concentration on an 
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Amicon filter device, to the level of a few micrograms/ml. Monoclonal or polyclonal antibody to the 
protein can then be prepared as follows: 

A. Monoclonal Antibody Production bv Hvbridoma Fusion 

Monoclonal antibody to epitopes in the GENSET protein or a portion thereof can be 
S prepared from murine hybridomas according to the classical method of Kohler and Milstein, (1 97S) 
or derivative methods thereof Also see Harlow and Lane. (1988). 

Briefly, a mouse is repetitively inoculated with a few micrograms of the GENSET protein 
or a portion thereof over a period of a few weeks. The mouse is then sacrificed, and the antibody 
producing cells of the spleen isolated. The spleen cells are fused by means of polyethylene glycol 

10 with mouse myeloma cells, and the excess unfused cells destroyed by growth of the system on 
selective media comprising aminopterin (HAT media). The successfully fused cells are diluted and 
aliquots of the dilution placed in wells of a microtiter plate where growth of the culture is 
continued. Antibody-producing clones are identified by detection of antibody in the supernatant 
fluid of the wells by immunoassay procedures, such as ELISA, as originally described by Engvall, 

15 (1 980), which disclosure is hereby incorporated by reference in its entirety, and derivative methods 
thereof. Selected positive clones can be expanded and their monoclonal antibody product harvested 
for use. Detailed procedures for monoclonal antibody production are described in Davis, et aL 
(1986) Section 21-2. 

20 B. Polyclonal Antibody Production bv Immunization 

Polyclonal antiserum containing antibodies to heterogeneous epitopes in the GENSET 
protein or a portion thereof can be prepared by immunizing suitable non-human animal with the 
GENSET protein or a portion thereof, which can be unmodified or modified to enhance 
immunogenicity. A suitable non-human animal is preferably a non-human mammal is selected, 

25 usually a mouse, rat, rabbit, goat, or horse. Alternatively, a crude preparation which has been 
enriched for GENSET concentration can be used to generate antibodies. Such proteins, fragments 
or preparations are introduced into the non-human mammal in the presence of an appropriate 
adjuvant (e.g. aluminum hydroxide, RIBI, etc.) which is known in the art. In addition the protein, 
fragment or preparation can be pretreated with an agent which will increase antigenicity, such 

30 agents are known in the art and include, for example, methylated bovine serum albumin (mBSA), 
bovine serum albumin (BSA), Hepatitis B surface antigen, and keyhole limpet hemocyanin (KLH). 
Serum from the immunized animal is collected, treated and tested according to known procedures. 
If the serum contains polyclonal antibodies to undesired epitopes, the polyclonal antibodies can be 
purified by immunoaffinity chromatography. 

35 EfTective polyclonal antibody production is affected by many factors related both to the 

antigen and the host species. Also, host animals vary in response to site of inoculations and dose, 
with both inadequate or excessive doses of antigen resulting in low titer antisera. Small doses (ng 
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level) of antigen administered at multiple intradermal sites appears to be most reliable. Techniques 
for producing and processing polyclonal antisera are Icnown in the art. An effective immunization 
protocol for rabbits can be found in Vaitukaitis etal, (1971), which disclosure is hereby 
incorporated by reference in its entirety. 

5 Booster injections can be given at regular intervals, and antiserum harvested when antibody 

titer thereof, as determined semi-quantitatively, for example, by double immunodiffusion in agar 
against known concentrations of the antigen, begins to fall. See, for example, Ouchterlony et aL^ 
(1973), which disclosure is hereby incorporated by reference in its entirety. Plateau concentration 
of antibody is usually in the range of 0.1 to 0.2 mg/ml of serum (about 12 uM). Affmity of the 

10 antisera for the antigen is determined by preparing competitive binding curves, as described, for 
example, by Fisher (1980), which disclosure is hereby incorporated by reference in its entirety. 

Antibody preparations prepared according to either the monoclonal or the polyclonal 
protocol are useful in quantitative immunoassays which determine concentrations of antigen- 
bearing substances in biological samples; they are also used semi-quantitatively or qualitatively to 

1 5 identify the presence of antigen in a biological sample. The antibodies may also be used in 

therapeutic compositions for killing cells expressing the protein or reducing the levels of the protein 
in the body. 
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13256 


NUC153 


PRT322 


NUC445 


165123 


13256 


NUC154 


PRT323 


NUC445 


165643 


13256 






NUC445 


1 42964; 1 502 1 4;223536;245008 


13267 


NUC155 


PRT324 


NUC446 


488818 


13285 






NUC694 


193487;238191;248387 


26638 






NUC695 


645819 


26710 






NUC696 


600909;608784;61 1758;614721; 

625933;625983;626308;627299; 
62748 1 ;628 1 47;628753 .63 1 655; 
633039;63337 1 ;633760;634553; 
642966 


26726 






NUC697 


421115 


26786 






haJC698 


500702480 


26982 






NUC699 


638872 


27084 


NUC156 


PRT325 


NUC447 


242080 


27084 






NUC447 


128161;186671;210505;211578; 
2 1 4909;22 1 663 ;222 1 0 1 ;223000; 
224361 ;226849;242326;242424; 
243662;24491 3.24791 2 



242 
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27273 






NUC700 


525674;60!556 


27301 






NUC701 


135037 


27336 






NUC702 


500742735 


27361 






NUC703 


205346;530902 


27374 






NUC704 


643006 








NIJC705 

1^ \J / V™/ 


129706;223196 


97/CO7 

z i\)y I 








500701900 


97X77 








99497 


984 n 

^OH 1 J 








135042 








TMi ir7nQ 


150011*201848 


ZOJ 10 








50072 1 700- 500729093 -5007301 
52 


29120 


NUC157 


PRT326 


NUC448 


488444 


29469 






NUC711 


638852 


29472 






NUC712 


637812 


29557 






NUC713 


188208 


29673 






NUC714 


650606 


29814 






NUC715 


813496 


30218 






NUC716 


241681;242553;589203 


30446 






NUC717 


105288 


30477 






NUC718 


500724995;500758517 


30583 






NUC719 


117932;194613;2250I3;331614 


30719 






NUC720 


637363 


31356 






NUC721 


106998 


31422 


NUC158 


PRT327 


NUC449 


500732587 


31554 






NUC722 


222161 


31627 






NUC723 


173050 


31726 






NUC724 


393750 


31744 






NUC725 


176380 


31790 






NUC726 


625728 


32102 






NUC727 


500762549 


32473 


NUC159 


PRT328 


NUC450 


183902 


32475 


NUCI60 


PRT329 


NUC451 


635993 


32962 






NUC728 


500740719 


33130 






NUC729 


165852; 165888 


33712 


NUC161 


PRT330 


NUC452 


398703 


35005 






NUC730 


124493 


35185 


NUC83 


PRT252 


NUC403 


589785 


35258 






NUC731 


224898 


35326 






NUC732 


145027 


35597 






NUC733 


237630 


35912 






NUC734 


637431 


35984 






NUC735 


117238 


36122 






NUC736 


226039 


37337 






NUC737 


143508;196052,221995 


38112 






NUC738 


129444 


38220 






NUC739 


608709 


3831 1 






NUC740 


600921 


38631 






NUC741 


194909 


38749 






NUC742 


163588 


38890 


NUC162 


PRT331 


NUC453 


500742815 


38890 


NUC163 


PRT332 


NUC453 


500735594 


38890 


NUC164 


PRT333 


NUC453 


500737569 


38890 


NUC165 


PRT334 


NUC453 


500730242 


38890 


NUC166 


PRT335 


NUC453 


500766374 


38890 


NUC167 


PRT336 


NUC453 


500711885 


40163 






NUC743 


244540 



243 



wo 02/083898 PCT/IBO 1/009 14 



40975 






NUC744 


1 A<AA 1 
1 UOUO 1 


4099 1 






XTI ir^iA c 


lUoUOl 


42896 






lNUC/40 


1 171 1 A 
1 J / 1 lU 


43190 






XII li^lAf 

NUL747 




44053 






XII 1/^1 AO 


486460;509697;519210;528872; 
585226;589902;601550 


45091 






NUC749 


238559 


45179 






NUC750 


132269 


45274 


NUC84 


PRT253 


NUC404 


1000867870 


46679 


NUC85 


PRT254 


NUC405 


140265 


47171 






NUC751 


420959 


48024 


NUC168 


PRT337 


NUC454 


113448 


48548 






NUC752 


227400 


48603 






NUC753 


200687;244886 


48670 






NUC754 


164887 


48671 






NUC755 


221136 


48823 






NUC756 


525888 


48901 






NUC757 


525775 


49018 






NUC758 


229481 


49034 






NUC759 


237358 


49133 






NUC760 


224706;582974 


49140 






NUC761 


636146 


49261 






NUC762 


186091 


49387 






NUC763 


212526 


49416 






NUC764 


530211 


49426 






NUC765 


313I50;313151 


49493 






NUC766 


213393 


49640 






NUC767 


626919 


49863 






NUC768 


I81361;382057;631692 


49871 






NUC769 


181361;382057;631692 


50015 






NUC770 


203070;331013 


50049 






NUC771 


196104 


50112 






NUC772 


118123:335719 


50185 






NUC773 


489405:496185 


50241 






NUC774 


145339; 156 186:244350 


50353 






NUC775 


119282 


50763 






NUC776 


500701668 


50982 






NUC777 


637372 


51130 






NUC778 


180402 


51212 






NUC779 


238923 


51346 






NUC780 


646118 


51380 


NUC169 


PRT338 


NUC455 


523002 


51400 






NUC781 


231102 


51796 






NUC782 


210281 


51954 






NUC783 


141971;183117;205855;50224l 


52076 






NUC784 


625028 



TABLE II 



SEO ID NO. 
in orioritv 
aoDlication 


Chromosomal location 


51 


3q26.2 
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SEO ID NO. 
in Driorltv 
apDitcation 


Chromosomal location 




Zq J 1 


AQ1 


1 ipi i.z 


J 05 


OA— lO 


<'71 




/yo 




1 1 C 1 

i 1 5 1 


00*1 1 1 0 
zzq 1 1 .2 


1590 


V 

A 




A 


moo 
2932 


op23 




Zl 


3326 


15 


3804 


I6pl3.3 


4172 


1, 15ql3-ql4 


4340 


1 


4609 


4pl3 


4647 


t 1 *.oo 1 1 ^01 1 

I lq23, 1 lq23.3 


4694 


12pl3.2 


4733 


1 A 1 

14q32.1 


4855 


1 — 00 1 —Oil 

Iq23-lq24 


4908 


1 o«» 1 1 
12pl3 


501 1 


7q32, 7q32-36 


CA>I A 

5040 


Qm07 

op23 


C AO A 

5089 


A^t A 

4ql 1 


5167 


1 0 
12 


5278 


1 A^ 100 

i9ql3.2 


5563 


1 0 
1,2 


5947 


1 1 ^OT 

1 lq23 


5974 


4q28, 4q28*q3 1 


^A 1 1 

601 1 


A 

4 


6290 


0-» 1 0 

2pl3 


6322 


o» 1 

2pl3 


6345 


16q24.3 


^ il AA 

6400 


IAmOC OiC 

I0q25*26 


6892 


A 

4 


/oy / 


zpl 1 , 4qzo 


8166 


1 /I^IO 

14q32 


8666 


1 £ 
lo 


o67l 


to 


9/55 


3 


1UU44 


I2q, lo 




iqi 1.1, iqzi 


lU5o4 


3p213 


t/\'7AA 


0 


1 1 A1 O 

1 1U19 


1 1 _ 1 1 0 00— 1 1 
1 ipi 1.2, 22qi3 


1 1342 


"7^01 

7C121 


1 mm 
12907 


0_'>iC 1 

7q36.1 




1 < 
1 J 


13285 


2q23-24, llq23.2-24.2 


27697 


12pll.2 


28517 


2q31 


28518 


2q31 


29673 


16pl3.3 


29814 


11 



245 



i 



wo 02/083898 PCT/IBO 1/009 14 



SEO ID NO 


^hrnmnconnfll locflfinn 


in orioritY 




aDDlication 




31422 


llql3 


31627 


11 


32962 


9q32-33 


33712 


3p21.3,3p21.31 


35185 


2 


38631 


15 


48823 


13q34-qter, 17 


49018 


7q22 


49133 


lq32 


49387 


17q21 


50015 


5q31 


51380 


Xq28 



TABLE HI 



SEO ID NO in 


Ti<<iii^ Distribution 


37 


1:1 1 


51 


A:55 B:4 C:l E:l F:30 G:19 H:9 


179 


F:16 K:4 


180 


F:5 K:l 


183 


H:6 


201 


C:l 


326 


1:3 


362 


K:2 


440 


A:7 


452 


F:l G:5 


483 


G:4 


500 


1:1 


505 


A:2 


528 


K:2 


573 


F:3K:1 


574 


A:l 


587 


H:l 1:3 


588 


F:6 K:2 


593 


A:2C:15F:2 G:14 


599 


A:8 F:7 K:3 


603 


K:l 


621 


C:l 


628 


F:l K:3 


653 


B:6 F:3 G:l 1:1 K:7 


670 


K:ll 


678 


F:l K:2 


693 


B:7 G:7 


703 


F:9 K:4 


746 


A:9H:l K:2 


770 


G:4 


775 


F:l K:13 


796 


G:l 


812 


A:13F:1 H:! 


940 


F:8 


988 


F:8K:1 


996 


K:4 
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SEO ID NO. in 


Tissue Distribution 


1036 


A:13 1:11 


1064 


A:12 B:130 C:16 D:7 F:l G:I6 


1 1 C 1 

1 151 


A:9 


1 190 


B:4 C:l H:7 1:6 


1458 


J:55 


1590 


A: I 


1853 


A:l G:l H:l 


1904 


H:l 


2028 


A:8 


2173 


8:1 C:3 D:l G:l 


2368 


K:4 


2553 


8:7 D:5 H:39 


2556 


8:7 D:5 H:39 


2658 


D:l . 


2690 


K:2 


2755 


H:l 


2800 


F:8 K:6 


2843 


A:8G:1K:3 


2852 


F:l 


2932 


D:l 


2955 


K:l 


3078 


H:2 


3280 


F:4 K:3 


3326 


H:2 


3387 


H:7 


3439 


1:5 


3501 


A:10C:1 


3633 


A:6 H:3 


3678 


K:2 


3714 


F:4 K:l 


3796 


F:l K:l 


3801 


K:2 


3804 


8:1 C:10D:1 0:21:11 


3892 


A:9 


3985 


8:1 C:5 D:l H:l J:6 


4005 


F:6 1:2 K:2 


4063 


A:2 8:10 C:4 


4088 


K:6 


4111 


A:ll 


4126 


F:3 K:l 


4172 


F:3 K:2 


4261 


H:2 


4340 


A:3 G:3 1:1 


4436 


F:3 


4609 


D:10 


4647 


D:3 


4660 


0:1 


4664 


8:48 C:2 H:2 1:3 


4678 


C:5 0:17 G:4 


4682 


G:l 


4687 


0:5 F:2 H:l 


4690 


D:3 F:l 1:1 K:i 


4694 


1:3 


4696 


1:1 


4733 


0:1 


4807 


8:11 H:] 


4809 


1:1 


4830 


K:l 
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SEO ID NO. in 


Tissue Distribution 


4855 


H:l 


4900 


All 


4908 


A:16 


4943 


F:6 K:2 


4947 


K:3 


4976 


J:2 


5000 


A:2 B:14C:17D:5 F:l G:9 H:3 1:3 


5002 


B:2 


5005 


C:31 F:l J:2 K:2 


5011 


H:5 1:2 


5040 


8:1 C:2 D:ll G:l J:l 


5058 


K:l 


5071 


F:l K:I 


5089 


1:1 


5117 


H:7 


5141 


F:2 K:l 


5162 


A:l 8:8 


5167 


1:1 


5178 


G:l 


5192 


F:14 K:3 


5214 


A lO 8-19 C*5 


5230 


A-3 8-6 C*9 D'l F*l G-4 H-3 V2 


5240 


A:l 


5250 


A:3 B:39 C:29 D:3 H:3 1:1 K:l 


5262 


A*3 B'39C'34 D*3 H*3 M K:I 


5270 


8:8 


5278 


D:l 


5358 


G:l 


5453 


A:2 C:6 


5494 


A:l 0:1 


5499 


A:3 


5533 


A:2 8:7 C:l 


5563 


D:l F:l 


5609 


G:3 


5657 


G:l 


5691 


K:l 


5748 


A:3 8:1 C:4 D:l G:30H:1 


5806 


A:2 8:7 C:l F:l 1:3 K:3 


5816 


B:5 


5824 


H:3 


5861 


8:16 C:27 D:13 F:7 H:21 1:12 K:4 


5885 


A:l 


5913 


K:l 


5947 


D:2 


5966 


G:ll 


5970 


C:4 


5974 


D:12 


5983 


A:5 


5985 


8:2 C:5 H:l 1:1 K:2 


6011 


8:1 


6080 


8:1 1:1 


6081 


1:9 


6108 


F:5 


6159 


8:1 


6231 


A:l 


6238 


A:15B:5G:14H:3 1:2 


6252 


G:7 


6283 


F:l K:l 
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PCiV/ iLf In 


1 issue LFisiriuuilun 




R**) 




P'l 

V/. 1 




Al 
M. 1 




r . J 1 rv. i J 


WJ*T J 


A'O n* 1 




A-1 R-9 




R-0 H'1 
D,y n. 1 




R-l C'} 




1^-1 

Ix. 1 


OH 1 O 




UH J 1 


A-1 R'1 K'\ 




r . 1 


66'^6 


/\.o 0. J / x^. 1 1 u. 1 r.i w.t n.i j.i 




F*l 1^*9 


UOOO 


A'l R'49 n*l n*! T'^ I^'*? 


o /z / 


P*i6 n*^ V'lO'^ 




N.. 1 


OoOD 


A •! f • 1 




R'9'IP'I H'9 1*1 M9 




0. 1 


7041 
/Uf 1 






vJ.Z 




A'^ 


7^77 


p* 1 


7A07 


Pk'lO P*l 


77 17 


P'l V«l 

r.j K.J 


0UU7 


R*7 r«»o 14' 1 
D.Z S^.Z ti. 1 


fin75t 
oU fo 


c.< won 






Sti AA 
0 100 


A '7 ri'S r:*<9 


fl7/%0 


rl.l 




A'l P'9 H'l H»9 M 

A. 1 r.z ij*j ri.z 1. 1 




P*l V'A 


oDOO 


DM H'l r:'i6 1^*1 

D,H LI. 1 w.lO iV. 1 


00 / 1 


n*i If '9 




14*1 
n. 1 


8Q68 
07O0 


A*i9 r;*'^ H' 1 


07y*# 




0907 


n* 1 

U. I 


y^i. 1 


if • 1 




If -4 


7**UO 


n. in n>l M*')0 
D. lU LI. J n.j7 


0407 


D. lU LI. J n. J7 


0668 


P*0 If '41 


0670 


R'1 P*1 lf'1 
0,1 r . 1 IX. 1 




If 


0868 

70U0 


P-0 lf'1 
r .7 rv. I 




A'10 R-10 D' 14 F'9 fi'94 H'S If '4 




D.l p.7 l/.l 
D. 1 r.z Pw.J 


10S96 


R'l l-i*'^ 
D.J ri.j 


10^84 


A -7 


10650 


K:3 


10739 


A:l B:l G:3 J:I 


10743 


C:l J:l 


10744 


A:6B:3 G:2 1:3 


10761 


H:l 


10880 


D:l G:l 


10942 


A:30 
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SEO ID NO. in 


Tissue Distribution 


I 1019 


r:4 


1 1278 


!• 1 

1:1 


1 1 1/1'^ 
i 1342 


r:i K.:2 


1 1 j02 


r:l N.:4 


1 f £.00 


r:2 K.:12 


1 1 11 Q 


A:2j d:j7 C7 u.\ r. 1 0, 1 K: 1 




A:d 




o:3 


12043 


A:o 


12046 


r:l H:l 


12098 


A:4 0:8 H:2 1:1 


12202 


K:2 


12220 


C:2 


12243 


A:l B:132 C:38 L):31 r:6 0:ll 


12263 


H:4 


122/0 


A: 18 


1 'tvl AA 

12490 


A. A B.I 1 /^.O/^.ii 

A:9 B:l 1 C:8 G:4 


12604 


A:6 B:14 C:22 D:l F:l G:4 H:4 1:1 


12657 


8:1 C:6J:2 


12788 


K:2 


12901 


G:2 


12907 


1:6 


13013 


D:5 


13202 


A:2 8:14 C:9 D:l 0:3 H:3 J:3 K:l 


13229 


A:3 


13256 


F:2 K:5 


13267 


1:8 


13285 


8:1 C:42 D:5 1:1 J: 10 


26638 


D:2 


26710 


A:90 8:48 G:27 


26726 


C:4 


26786 


A:24 


26982 


A.I . 1 

A:l G:l 


27084 


F:16 K:83 


27273 


C:3 


2 /iUl 


A.I TS.I U.l 

A:3 B:3 H:2 


27336 


B:l G:l 


27361 


F :2 


27374 


13:1 u:l 


27627 


F:l K:l 


27697 


A.I 1 

A:l C:l 


27877 


U. 1 1. 1 A 

H:l 1:19 


2o4 1 J 


H:l 


2o5 17 


K:2 


2o3 lo 


G:3 


OAI '^A 

29120 


1:1 


29469 


A:2 G:2 


29472 


A . 1 

A:l 


OAC 

29557 


C:l 


29673 


D:l 


AO I A 

29814 


A:l 


1 0 


r./ JS.A 


30446 


B:l C:l H:10 


30477 


G:7 


30583 


K:5 


30719 


A: I 8:9 1:3 


31356 


H:2 


31422 


G:l 
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SEO ID NO. in 


Tissue Distribution 


31554 


K:l 


31627 


C:l 


31726 


C:l 


31744 


F:l 


31790 


A:25 


32102 


G:l 


32473 


B:l 


32475 


B:l 


32962 


G:l 


33130 


K:3 


33712 


B:1 C:] 


35005 


A:l 


35185 


C:l 


35258 


K:l 


35326 


3:1 F:3 0:1 H:l 1:J K:3 


35597 


F:l K:l 


35912 


A:l 


35984 


F:l 


36122 


K:l 


37337 


F:l K:2 


38112 


K:l 


38220 


A:] 


3831 1 


A:l 


38631 


K:l 


38749 


H:l 


38890 


0:21 


40163 


K:2 


40975 


B:8D:1 H:39 


40991 


B:8 DA H:39 


42896 


H:l 


43190 


K:l 


44053 


C:ll D:l H:l 1:2 J:5 


45091 


J:l 


45179 


H:l 


45274 


C:4 H:2 


46679 


D:3 F:l H:l 


47171 


C:l 


48024 


B:4C:5 


48548 


F:l 


48603 


F:41K:19 


48670 


K:l 


48671 


K:2 


48823 


C:65 


48901 


B:l 0:2 1:2 


49018 


B:2 C:2 F:2 J:2 


49034 


D:l F:2 H:l 1:3 K:2 


49133 


F:4 K:2 


49140 


A:l C:l 


49261 


F:3 K:23 


49387 


F:2 K:l 


49416 


B:18C:1 


49426 


J:51 


49493 


C:l F:l 1:6 


49640 


A:5 F:l 


49863 


A:6C:2I:1 


49871 


A:6C:21:1 


50015 


K:2 


50049 


K:2 



251 



wo 02/083898 



PCT/IBOl/00914 



SEO ID NO. in 


Tissue Distribution 


50112 


K:3 


50185 


1:6 


50241 


F:3 H:l 


50353 


F:l K:l 


50763 


A:14 


50982 


A:2 


51130 


K:2 


51212 


J:2 


51346 


A:2 B:l D:! 


51380 


B:l F:l 


51400 


F:3 


51796 


F:l K:2 


51954 


B:13C:1 D:l F:2 K:l 


52076 


A:2 


TABLE IV 



SEQ ID NO. in 

priority 

application 


Tissue source 


37 


adenocarcinoina(2), carcinoid(2), testis(I), tonsil(l) 


51 


adipose tissue, white(4), cerebellum(3), cochlea(l), colon tumor rer+(2), dorsal root 
ganglion(I), hippocampus(l), kidney(l), liver(l), malignant melanoma, metastatic to lymph 
node(l)» muscle(2), normal leg muscle(l), parathyroid tumor(l)» pectoral muscle (after 
mastectomyX 1 X placenta( 1 ), substantia nigraC 1 ), total brain( 1 ) 


201 


pancreas(l) 


326 


ovarian tumor(l), uterus(t) 


440 


brain cortex(lX carcinoid tumor(l) 


483 


pbl(l), adenocarcinoma(l), astrocytoma(l), ovarian tumor(l), schizophrenic brain s-U 
frontal lobe(l) 


500 


colon(4), colon tumor rer+(l), pooled germ cell tumors(l) 


505 


total brain(l) 


528 


brain(2). cerebellum(l), colon(l), ovarian tumor(6) 


573 


melanoma (mewo cell line)(I) 


587 


germinal center b cell(l), lymphoma(l), parathyroid tumor(l) 


593 


ovarian tumor( 1 ), placenta( 1 ) 


621 


p]acenta.(2) 


653 


2 pooled tumors (clear cell type)(2), anaplastic oligodendroglioma(2), glioblastoma 
(pooled)(2) 


678 


ovarian tumor(l), prostate(l) 


693 


brain(l), placenta(l) 


703 


small ceil carcinoma(2) 


746 


small cell carcinoma(l) 


770 


colon(l), frontal lobe(1), human pancreatic islets(l), normal leg musc!e(l), ovarian tumor(!), 
pancreatic islet(2), senescent fibroblastd), total brain(l) 


775 


anaplastic oligodendroglioma(1), frontal lobe(l) 


796 


adrenal adenoma(l), adrenal gland(l), breast tumor(3), placenta(2) 


812 


heart(l), brain(l), frontal lobe(3), neuroepithelial cells(l), retina(l), small cell carcinoma(lX 
total brain(l) 


988 


germinal center b cell(l} 


1064 


2 pooled tumors (clear cell type)(l), e wing's sarcoma(2}, adenocarcinoma(5), anaplastic 
oligodendroglioma(5), breast tumor(l), carcinoid(4), colon(3), colon tumor(l), colon tumor 
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rer+(2), frontal lobe(4), germinal center b cel!(7), kidney tumor(l), lung tumor(l), metastatic 
prostate bone lesion(2), ovarian tumor(6), parathyroid tumor(7), pectoral muscle (after 
mastectomy)(12), placenta(l), pooled germ cell tumors(4), senescent fibroblast{2), squamous 
cell carcinoma from base of tongue(l), tumor, 5 pooled (see description)(l) 


1151 


anaplastic oligodendroglioma(4), carcinoid(l), colon tumor rer+(l), medulloblastoma(l), 
normal prostate(l), parathyroid tumor(1), tumor(l} 


1190 


colon(l) 


1853 


2 pooled high-grade transitional cell tumors(l)» 2 pooled tumors (clear cell type)(]) 


2173 


2 pooled tumors (clear cell typeX2), b-cell, chronic lymphotic leukemia(2), cd34+, cd38- 
from normal bone marrow donor(6), adenocarcinoma(l), alveolar rhabdomyosarcoma(l), 
anaplastic oligodendroglioma(3), breast(l), carcinoid(l), cerebellum(2), cochlea(3), 
colon(ll), colon tumor rer+(2), dorsal root ganglion(l), early stage papillary serous 
carcinoma(]), epithelium (cell line)(t), follicular lymphoma(lX frontal lobe(15), germinal 
center b-cells(3), invasive adenocarcinoma(5), kidney(l), kidney tumor(2), larynx(l), 
liver(l), lung carcinoma( 1 ), lymphoma(l), meningioma(l), moderately differentiated 
adenocarcinoma(2), moderately-differentiated adenocarcinoma( 1 ), muscle( 1 ), normal 
prostate(6), normal prostatic epithelial cells(l), oligodendroglioma(3), ovarian tumor(4), 
papillary serous carcinoma(l), pectoral muscle (after mastectomy)(23), pooled germ cell 
tumors(2), prostate(l), stem cell 34+/38-K2), thyroid(l), tumor, 5 pooled (see description)(4), 
two pooled squamous cell carcinomas(2) 


2553 


brain(2), tumor(l) 


2556 


brain(2), tumor(l) 


2755 


frontal lobe(l) 


2843 


b-cell, chronic lymphotic leukemia(2), anaplastic oligodendroglioma(5), carcinoid(l), 
placenta( 1 ) 


2852 


frontal lobe(3) 


2932 


bone marrow(14), brain(l), hematopoietic from ami patient(l), liver(2), normal cortical 
stroma(l) 


3078 


frontal lobe(2) 


3280 


b-cell, chronic lymphotic leukemia(l), germinal center b cell(l), leiomyosarcoma(l), pooled 
germ cell tumors(l) 


3326 


muscle(3), normal leg muscle(l), parathyroid tumor(l) 


3387 


carcinoid(I), pooled germ cell tumors(4) 


3439 


2 pooled tumors (clear cell type)(l), b-cell, chronic lymphotic leukemia(3), ewing's 
sarcoma(l), anaplastic oligodendroglioma(l), colon(2), glioblastoma (pooIed)(l), mantle cell 
lymphoma(l), parathyroid tumor(l), senescent fibroblast(l), squamous cell carcinoma(l) 


3501 


anaplastic oligodendroglioma(l). breast(l), carcinoid(2), colon(l), colon tumor rer+(l), 
epithelium (cell Iine)(l), kidney tumor(l), ovarian tumor(l), parathyroid tumor(2), pooled 
germ cell tumors(4), senescent fibroblast(l), squamous cell carcinoma(l), testis(l), tumor(2) 


3633 


cerebellum(l), muscle(l), retina(4), small eel! carcinoma(l), total brain(3) 


3714 


2 pooled tumors (clear cell type)(l), colon(l) 


3804 


heart(l), anaplastic oligodendroglioma(l), carcinoid(l), colon tumor(2), germinal center b 
cell(2), kidney tumor(l), lung tumor(3), lymphoid(2), moderately differentiated 
adenocarcinoma(l), muscle(l), ovarian tumor(3), pectoral muscle (after mastectomy )( 1 ), 
squamous cell carcinoma(3) 




DlOOa^l )f lOiai vTaln{Z) 


3985 


anaplastic oligodendroglioma(lX breast(l) 


4005 


2 pooled tumors (clear cell type)(l), brain(2), germinal center b cell(l), muscle(l) 


4063 


cerebral cortex(]), brain(l), carcinoid(2), cerebellum(l), schizophrenic brain s-U frontal 
lobe(1), senescent fibroblast( 1 ), total brain(2) 


4111 


anaplastic oligodendroglioma(l), cerebellum(l), colon(l) 


4172 


retina(3) 


4340 


heart(l), blood(l), bram(2), colon(6), frontal lobe(9), mvasive tumor (cell lme)(l), 
melanocyte(l), neuroepithelial cetls(l), senescent fibroblast(l), small cell carcinoma(2), total 

brain(3) 


4436 


b-cell, chronic lymphotic leukemia(l), bulk tumor(l), colon(24), early stage papillary serous 
carcinoma(3), lung carcinoma(l), pancreatic cancer(l), pooled germ cell tumors(l) 


4609 


blood(l) 


4647 


colon(l), Iiver(l) • 
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4660 


adenocarcinoma(l) 


4664 


2 pooled tumors (clear cell type)(5), adenocarcinoma(3), brain(l), breast(l), colon(4), colon 
tumor rer+(l), frontal lobe(5), liver(l), neuroepithelial cells(l), normal prostate(l), ovarian 
tumor(l), ovary(l), total brain(l), tumor(l) 


4678 


colon tumor, rer+(2) 


4682 


2 pooled tumors (clear cell type)(4), anaplastic oligodendroglioma(2), breast(3), carcmoid(lX 
glioblastoma (pooledXl), pooled germ cell tumors(l) 


4687 


2 pooled tumors (clear cell type)(l), carcinoid(3), colon(3)» normal prostate(l), pooled germ 
cell tumors(l) 


4690 


carcinoid(l), colon(3), normal prostate(lX pooled gemi cell tumor$(l) 


4694 


colon tumor, rer+(l) 


4733 


colon(3), liver(14), pancreatic i$let(l) 


4807 


2 pooled tumors (clear cell type)(l), alveolar rhabdomyosarcoma(l), carcinoid{3), co]on(2), 
normal prostate(2X normal prostatic epithelial cells(l), ovarian tumor(23), prostate(2), serous 
adenocarcinoma(l), total brain(2), tumor(I), tumor, 5 pooled (see description)(4) 


4809 


2 pooled tumors (clear cell type)(l), alveolar rhabdomyosarcoma(l), carcinoid(3), colon(2), 
normal prostate(2), normal prostatic epithelial cells(lX ovarian tumor(28), prostate(2)i serous 
adenocaFcinoma(l), total brain(2), tumor(lX tumor, 5 pooled (see description)(5) 


4855 


b-cell, chronic lymphotic leukemia(l), carcinoid(l) 


4900 


2 pooled tumors (clear cell type)(l), b-cell, chronic lymphotic leukemia(l), colon(l), liver(l), 
low-grade prostatic neoplasia(l), normal prostate(l), pooled germ cell tumors(l) 


4908 


alveolar rhabdomyosarcoma(l), colon(l), hemopoietic system(l), liver(l}, malignant ascitic 
efFusion( 1 ), moderately-differentiated adenocarcinoma( 1 ), neuroepithelial cells(] ), 
parathyroid tumor(l), synovial membrane(t), uterus(l) 


4947 


parathyroid tumor(l), testis(2) 


5000 


2 pooled tumors (clear cell type)(3), b-cell, chronic lymphotic leukemia(2), anaplastic 
oligodendroglioma(5), breast(4), carcinoid(2X colon tumor, rer+(l), germ cell tumor(l), 
germinal center b cell(6), invasive prostate tumor(l), lung tumor(l), metastatic prostate bone 
lesion(2), normal prostate(l), parathyroid tumor(2), pectoral muscle (after mastectomy)(3), 
pooled germ cell tumors(l), senescent fibroblast(4), stroma(l), thyroid(l), tumor, 5 pooled 
(see descrlption)(2) 


5002 


pectoral muscle (after mastectomy )( 1 ), senescent fibroblast( 1 ) 


5005 


anaplastic oligodendroglioma(l), germinal center b cell(l), lobullar carcinoma in situ(l), 
lymphoma(l), oligodendroglioma(l), pooled germ cell tumors(8) 


5011 


breast cancer(l), normal prostate(5), seminal vesicles(I) 


5040 


bone marrow(18), brain(l)» hematopoietic from ami patient()), liver(2), normal cortical 
stroma(l) 


5089 


parotid gland(l) 


5117 


adenocarcinoma(2), breast(2), carcinoid(l), colon(3X colon tumor rer+(l), epithelium (cell 
line)(3), moderately differentiated adenocarcinoma(l ), moderately-differentiated 
adenocarcinoma(2), normal prostate(3), normal prostatic epithelial cells(2), placenta(l), 
prostate(3), small cell carcinoma(l), squamous cell carcinoma(2), squamous cell carcinoma 
from base of tongue(l) 


5162 


hippocampus(l), schizophrenic brain s-1 ] frontal lobe(l) 


5167 


colon(3), colon tumor rer+(2), pooled germ cell tumors(l) 


5214 


kidney(l), ovarian tumor(l), pituitar gland(l), total brain(2) 


5230 


2 pooled tumors (clear cell type)(5), placenta(l), breast(l), breast tumor(l), carcinoid(2), 
colon(l), colon carcinoma(l), colon mucosa(l), colon tumor rer+(4), germinal center b 
celi(l), kidney tumor(l), normal prostate(5), papillary serous carcinoma(l), parathyroid 
tumor(3), pectoral muscle (after mastectomy)(3), pooled germ cell tumors(2), senescent 
ribroblast(l), squamous cell carcinoma from base of tongue(l) 


5240 


bone(l), colon(2), frontal lobe(2), glioblastoma (pooled)(l), liver(l), ovarian tumor(l), 
parathyroid tumor(l), tumor, 5 pooled (see description)(l) 


5250 


2 pooled tumors (clear cell type)(l), anaplastic oligodendroglioma(2), brain(5), carcinoid(l), 
colon(2), colon tumor rer-»-(I), pooled genm cell tumors(l) 


5262 


2 pooled tumors (clear cell type)(l), anaplastic oligodendroglioma(2), brain(5X carcinoid(l), 
colon(2), colon tumor reri-(l), pooled germ cell tumors(l) 


5270 


ovarian tumor(l) 


5278 


2 pooled tumors (clear cell type)(7), anaplastic oligodendroglioma(7), breast(lX breast 
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tumor(4), carcinoid(3), colon(l), colon tumor rer+(2), glioblastoma (pooled)(]}, liver(3), 
pooled germ cell tumors(6), thyroid(2) 


5358 


2 pooled tumors (clear cell type)(l), anaplastic oligodendroglioma(2), brain(2), colon(2), 
germinal center b cell(4), melanocyte(I), normal prostate(3), parathyroid tumor(2), pectoral 
muscle (after mastectomy)(3), pooled germ cell tumors(4), senescent fibroblast(2) 


5494 


2 pooled tumors (clear cell type)(2), cd34+, cd38- from normal bone marrow donor(2), 
anaplastic oligodendroglioma(2), brain(2), colon(2), colon tumor rer+(l), early stage 
papillary serous carcinoma(l), germinal center b cell(3), glioblastoma (pooled)(3), 
moderately-differentiated adenocarcinoma(l), normal prostate(l)» normal prostatic epithelial 
celts(l), omentum(]), ovarian tumor(4), ovary(1), parathyroid tumor(3), pectoral muscle 
(after mastectomy)(]), pooled germ cell tumors(l), senescent ribroblast(2), stem cell 
34+/38+(l), synovial membrane(l), synovial sarcoma(l), tumor(l) 


5499 


2 pooled tumors (clear cell type)(l), brain(2X pancreatic islet(2) 


5533 


2 pooled tumors (clear cell type)(2), anaplastic oligodendroglioma(3X breast(l), carcinoid(l), 
germinal center b cell(2), glioblastoma (pooled)(2), pooled germ cell tumors(2}, senescent 
flbroblast(l), small cell carcinoma(2) 


5563 


colon(3), kidney(l), liver(l), neuroepithelial cells(l), normal prostatic epithelial cells(l), 
ovarian tumor(l), senescent fibroblast(l), total brain{l) 


5691 


2 pooled high-grade transitional cell tumors(l), 2 pooled tumors (clear cell type)(3), 
adenocarcinoma(3), anaplastic oligodendrog]ioma(2), carcinoid(3), colon(2), germinal center 
b cell(l), glioblastoma (pooled)(3), medulloblastoma(l), ovarian tumor(l), parathyroid 
tumor(2), pectoral muscle (after mastectomy )(2)» prostate(l)» three pooled meningiomas(l), 
total brain(l) 


5748 


bone(2), anaplastic oligodendrogIioma(5), breast(l), carcinoid(l), colon tumor rer+(l), 
frontal lobe(2), germinal center b cell(l), glioblastoma (pooled)(l), ovarian tumor(4), 
parathyroid tumor(2), pooled germ cell tumors(l), senescent fibroblast(l), tumor, 5 pooled 
(see description)(l) 


5806 


2 pooled tumors (clear cell type)(4), female, 19 years old, normal leg muscle(2), alveolar 
rhabdomyosarcoma(l), anaplastic oligodendroglioma(4), breast(l), carcinoid(3), germinal 
center b cell(l), glioblastoma (pooled)(2), normal prostate(l) 


5816 


anaplastic oligodendroglioma(l), frontal lobe(4) 


5824 


2 pooled tumors (clear cell type)(2), b-cell, chronic lymphotic leukemia(2), 
adenocarcinoma(l), anaplastic oligodendroglioma(3), blood(l), breast(l), carcinoid(9), 
cerebellum(l), colon(3), fibrotheoma( 1 ), follicular lymphoma(l), germinal center b cell(2), 
glioblastoma (pooled)(2), kidney tumor(l), low-grade prostatic neoplasia(2), normal prostatic 
epithelial cells(l), ovarian tumor(2), parathyroid tumor(7), pooled germ cell tumors(l), 
senescent fibrobIast(6), thyroid(l) 


5861 


adipose tissue, white(2)» bone marrow from femur(l), carcinoid(l), epithelium(I), normal 
prostatic epithelial cells(l), pectoral muscle (after mastectomy )( 1 ) 


5885 


2 pooled tumors (clear cell type)(l), female, 19 years old, normal leg muscle(l), anaplastic 
oligodendroglioma(5), bone marrow stroma(l), colon tumor rer-i-(l), germinal center b 
cell(5), glioblastoma (pooled)(l), kidney tumor(l), melanocyte(2), moderately differentiated 
adenocarcmoma(l), moderately-differentiated adenocarcinoma(lX normal prostate(I), 
normal prostatic epithelial cells(2), ovarian tumor(2X parathyroid tumor(l), senescent 
fibroblast(l), three pooled menmgiomas(]), tumoral ) 


5947 


2 pooled tumors (clear cell type)(4), adenocarcinoma(l), colon tumor(3), intestine(l), liver(l) 


5970 


b-cell, chronic lymphotic leukemia(l), carcinoid(l), germinal center b celi(2), pooled germ 
cell tumors(3) 


5974 


blood(5), bone marrow(l), liver(4), reticulocyte(l) 


5983 


2 pooled tumors (clear cell type)(l), adenocarcinoma(l), carcinoid(l), germinal center b 
cell(4), medulloblastoma(l), pooled germ cell lumors(l), tumor, 5 pooled (see description)(l) 


5985 


2 pooled tumors (clear cell type)(3), b-cell, chronic lymphotic leukemia(2), anaplastic 
oligodendroglioma(5), brea$t(4X carcinoid(2), colon tumor, reM-(l), germ cell tumor(l), 
germinal center b cell(6), invasive prostate tumor(l), lung tumor(l), metastatic prostate bone 
lesion(2), normal prostate(l), parathyroid tumor(2), pectoral muscle (after mastectomy )(3), 
pooled germ cell tumors(l), senescent fibroblast(4), stroma(l), thyroid(l), tumor, 5 pooled 
(see description)(2) 


6011 


brain(5), carcinoid(l), frontal lobe(l). lungcarcinoma(l), retina(l) 


6080 


2 pooled tumors (clear cell type)(8), ewing's sarcoma(2), heart(lX adenocarcinoma(]), 
alveolar rhabdomyosarcoma(6), anaplastic oligodendroglioma(5), aorta(l), breast(4). bulk 
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germ cell seminoma(2), colon(3), germinal center b cell(2), glioblastoma (pooled)(l), 
kidney(2), lung carcinoma(l), metastatic prostate bone lesion(4), nonmal prostate(l), normal 
prostatic epithelial cells(l), oligodendroglioma(l), ovary(2), parathyroid tumor(5), pectoral 
muscle (after mastectomy)(19), pooled germ cell tumors(l), prostate(l), senescent 
fibroblast(4), tumor, 5 pooled (see description)(l) 


6081 


brain(l), germinal center b cell(l), testis(l) 


6108 


2 pooled tumors (clear cell type)(l), carcinoid(l), germinal center b cell(l), parathyroid 
tumor(l) 


6159 


2 pooled tumors (clear cell type)(2), ewing's sarcoma(l), schwannoma tumor(l), adipose 
tissue, white(lX adrenal adenoma(3), amygdala(IX anaplastic oligodendroglioma(l), 
astrocytoma(2), bone marrow stroma(3), borderline ovarian carcinoma(l), cochlea(3), colon 
tumor(l), epithelium (cell line)(4), frontal lobe(3), germinal center b cell(l), human 
pancreatic islets(l), kidney tumor(3), larynx(l), liver(l), lung carcinoma(l), lung tumor(l), 
normal leg muscle(l), oIigodendroglioma(l), ovarian tumor(8), parathyroid tumor(l), 
pectoral muscle (after mastectomy)(8), prostate tumor(l), senescent fibrobIast(3), small cell 
carcinoma(4) 


6231 


adenocarcinoma(2), breast(3), colon(4), colon tumor(2), endometrioid ovarian metastasis( I ), 
epithelium (cell line)(22), frontal lobe(l), germ cell tumor(3), invasive tumor (cell line)(13), 
moderately differentiated adenocarcinoma(l), normal prostatic epithelial cells(l), ovarian 
tumor(4), ovary(l), pancreatic islet(l), placenta(trophoblast)(l), pooled germ cell tumors(2), 
squamous cell carcinoma(l), synovial sarcoma(l), tumor(3), tumor, 5 pooled (see 
description)(2)» two pooled squamous cell carcinomas(2) 


6238 


bone(l), colon(2), ovarian tumor(2), small cell carcinoma(2), total brain(l) 


6252 


2 pooled tumors (clear cell type)(l), ewing's sarcoma(6), adenocarcinoma(l), alveolar 
rhabdomyosarcoma^), anaplastic oligodendroglioma(2), brain(l), breast(2), breast tumor(l), 
carcinoid(2), cochlea(2), germ cell tumor(l), glioblastoma (pooled)(l), kidney(l), kidney 
tumor(6), liposarcoma(l), lung carclnoma(l), lymphoma(l), metastatic prostate bone 
lesion(6), muscle(l), normal prostate(l), normal prostatic epithelial cells(l), ovarian 
tumor(2), ovary(4), parathyroid tumor(lX pectoral muscle (after mastectomy)(5), pooled 
germ cell tumors(4), prostate(l), renal cell tumor(lX senescent fibroblast(3), stem cells(l), 
tumor. 5 pooled (see description)(4) 


6290 


2 pooled tumors (clear cell type)(l), b-cell, chronic lymphotic leukemia(l), heart(3), 
lymphoma(l), adenocarcinoma(l), adipose tissue, white(l), brain(3), carcinoid(5), 
cerebellum(5), colon(8), epithelium (cell line)(l), frontal lobe(l), germinal center b-cells(l), 
kidney tumor(l), medulloblastoma(l), melanoma (mewo cell line)(l), normal prostate(l), 
omentum(l), ovarian tumor(l), pancreatic islet(l), placenta(3), pooled frontal lobe(l), 
retina(l), retinal fovaea(l), schizophrenic brain s-11 frontal lobe(l), senescent fibroblast(l), 
small cell carcinoma(3), synovial membrane(l), total brain(2) 


6322 


b-cell, chronic lymphotic leukemia(I), heart(4), lymphoma(l), adipose tissue, white(l), 
brain(3), carcinoid(4), cerebellum(5), colon(8), epithelium (cell line)(l), frontal lobe(l), 
germinal center b-cells(l), kidney tumor(l), melanoma (mewo cell line)(l), omentum(l), 
ovarian tumor(l), pancreatic islet(l), placenta(3), pooled frontal lobe(l), retinal fovaea(l), 
schizophrenic brain s-11 frontal lobe(l), senescent fibroblast(l), small cell carcinoma(3), 
synovial membrane(l). total brain(2) 


6329 


b-cell, chronic lymphotic leukemia(3), heart(3), lymphoma(2), adenocarcinoma(6), adipose 
tissue, white(l), adrenal adenoma(2), anaplastic oligodendroglioma(2), bone marrow 
stroma(l), brain(2), breast(l), carcinoid(7), cerebellum(5), colon(8). epithelium (cell line)(l), 
frontal lobe(l), germ cell tumor(2), germinal center b-cells(l), kidney tumor(l), larynx(l), 
lung tumor(l), medulloblastoma(l), melanoma (mewo cell line)(l), metastatic melanoma to 
bowel(l), moderately differentiated adenocarcinoma(l), normal prostate(2). omentum(l), 
ovarian tumor(l), pancreatic islet(2), papillary serous ovarian metastasis(l), parathyroid 
tumor(l), pectoral muscle (after mastectomy )( I), placenta(2), pooled frontal lobe(l), pooled 
germ cell tumors(3), prostate(l), retinal fovaea(l), schizophrenic brain s-ll frontal lobe(l), 
senescent fibroblast(3), small cell carcinoma(5), squamous cell carcinoma(l), synovial 
membrane(l), total brain(2), tumor(l), tumor, 5 pooled (see description)(l), two pooled 
squamous cell carcinomas(l) 


6334 


2 pooled tumors (clear cell type)(3), b-cell, chronic lymphotic leukemia(2), anaplastic 
oligodendroglioma(9), glioblastoma (pooled)(3), kidney(l), normal prostate(3), ovarian 
tumor(l), senescent fibroblast(2) 


6345 


testis(3) 
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6350 


2 pooled tumors (clear cell type)(2), b-cell, chronic lymphotic leukemia(l), bone(l), 
adenocarcinoma(l), anaplastic oligodendroglioma(lO), breast(l), breast tumor(l), colon(2), 
glioblastoma (pooledX6), kidney(l), lung carcinoma(l), medulloblastoma(l), metastatic 
prostate bone lesion(l), normal prostate(l), pectoral muscle (after mastectomy)(4), pooled 
germ cell tumors(2), senescent fibroblast(2X squamous cell carcinoma(l), testis(l), tumor(lX 
tumor, 5 pooled (see description)(l) 


6358 


2 pooled tumors (clear ceil type)(2), anaplastic oligodendrogl!oma(4), bram(4), carcinoia(2), 
colon(3), colon tumor rer+(l), germinal center b cell(4), glioblastoma (pooled)(2), normal 
prostate(l), normal prostatic epithelial cells(l), parathyroid tumor(2}, pectoral muscle (after 
mastectomyXl), senescent fibroblast(3) 


6384 


epithelium(l), meningioma(l), parathyroid tumor(2), senescent ribrobIast(l) 


6400 


blood(2), brain(2), colon(6), parathyroid tumor(I), tumor(l) 


6431 


adenocarcinoma(l) 


6453 


pooled germ cell tumors(3} 


6636 


2 pooled tumors (clear cell type)(lX ewing's sarcoma(2), adenocarcinoma(3), anaplastic 
oligodendroglioma(3), breast tumor(l), carcinoid(4), coIon(2), colon tumor(l), colon tumor 
rer+(2), frontal lobe(3), germinal center b cell(6), kidney tumor(l), ovarian tumor(4), 
parathyroid tumor(4), pectoral muscle (after mastectomy)(10), placenta(l), pooled germ cell 
tumors(2), senescent fibroblast(l), squamous cell carcinoma from base of tongue(l), tumor, 5 
pooled (see description)(l) 


6688 


b-cell, chronic lymphotic leukemia(3), alveolar rhabdomyosarcoma( 1 ), anaplastic 
oligodendroglioma(l), breast(2), carcinoid(l), colon(l), colon tumor(l), four pooled pituitary 
adenomas(l), germinal center b cell(2), glioblastoma (pooled)(l), kidney tumor(2), 
moderately-differentiated adenocarcinoma(l), muscle(2), pectoral muscle (after 
masteciomy)(l), pooled germ cell tumors(3), senescent fibrobIast(3), synovia! sarcoma(l), 
testis(l), tumor, 5 pooled (see description)(3) 


6727 


frontal !obe(l), schizophrenic brain s-l 1 frontal lobe(l) 


6865 


frontal lobe(4), germinal center b cell(l), muscle(l), ovarian tumor(2), pectoral muscle (after 
mastectomy)(3), senescent flbroblast(l), small cell carcinoma(l), thyroid(l) 


6892 


2 pooled tumors (clear cell type)(5). adenocarcinoma(l), anaplastic oligodendroglioma(5), 
brain(5), breast(3), breast tumor(l), carcinoid(5), cerebellum(l), colon(3), colon tumor 
rer+(2), frontal lobe(5), germinal center b cell(3), glioblastoma (pooled)(2), moderately- 
differentiated adenocarcinoma(l), normal prostate(3), ovary(2), parathyroid tumor(3), 
pectoral muscle (after mastectomy)(l), placenta(l), pooled germ cell tumors(5), senescent 
fibroblast(3), tumor(l), tumor, 5 pooled (see descriptionXO 


7000 


anaplastic oligodendroglioma(2), frontal lobe(2) 


7041 


germinal center b ceil(l), total brain(2) 


7533 


2 pooled tumors (clear cell typeXO, b-cell, chronic lymphotic leukemia(l), schwannoma 
tumor(l), anaplastic oligodendroglioma(l), astrocytoma(l), breast(l), cochlea(l), colon 
tumor rer+(2), germ cell tumor(l), germinal center b cell(2), glioblastoma (pooled)(3). 
hepatoma(4), medulloblastoma(l), metastatic prostate bone lesion(]), moderately- 
difTerentiated adenocarcinoma(l), ovarian tumor(l), prostate tumor(l), senescent 
fibroblast(l), squamous cell carcinoma(l), three pooled meningiomas(l) 


7535 


b-cell, chronic lymphotic leukemia(l), schwannoma tumor(l), anaplastic 
oligodendroglioma(l), astrocytoma( 1 ), breast(l), cochlea(l), colon tumor rer+(2), germ cell 
tumor(l), germinal center b cell(2), glioblastoma (pooledX3), hepatoma(4), 
medulloblastoma(l), metastatic prostate bone lesion(l), moderately-differentiated 
adenocarcinoma(i), ovarian tumor(1), senescent fibroblast(l)« squamous cell carcinoma(l) 


7697 


colon(l), mvasive adenocarcinoma(3), liver(l), moderately-differentiated adenocarctnoma(i) 


8009 


alveolar rhabdomyosarcoma(2), anaplastic oligodendroglioma(l), carcmoid(9), colon(l), 
colon tumor rer+(l), germinal center b cell(2), glioblastoma (pooledXl), normal prostate(l), 
ovary(2), pooled germ cell tumors(2), thyroid(2), tumor(l) 


8078 


frontal lobe(2) 


8079 


b-cell, chronic lymphotic leukemia(2), bone(1), anaplastic oligodendroglioma(l), colon(l), 
pooled germ cell tumors{3) 


8097 


colon(l} 


8166 


2 pooled tumors (clear cell typeX5X adenocarcinoma(6), anaplastic oligodendroglioma(3), 
breast tumor(l), carcinoid(l), colon(l), epithelium (cell lineX1)« glioblastoma (pooled)(2), 
lung tumor(2), metastatic melanoma to bowel(t), moderately-differentiated 
adenocarcinoma(l), normal prostate(l), ovary(l), papillary serous ovarian metastasis(l), 
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parathyroid tumor(l), pooled germ cell tumors(4), renal cell njmor(l), squamous cell 
carcinoma(5X synovial sarcoma(l), tumor(3), tumor, 5 pooled (see description)(3) 


8341 


2 pooled tumors (clear cell type)(l), anaplastic oligodendroglioma(2), germinal center b 
cetl(3), glioblastoma (pooIed)(lX normal prostate(l), oligodendroglioma(l), pectoral muscle 
(after mastectomy)(2), tumor(l) 


8534 


germinal center b cell(4) 


8666 


2 pooled high-grade transitional cell tumors(l), 2 pooled tumors (clear cell typeX5), b-cell, 
chronic lymphotic leukemia(2), adenocarcinoma(3), alveolar rhabdomyosarcoma(3), 
amygdala(l), anaplastic oligodendroglioma(3), blood(2), breast(l), carcinoid(3), colon(3). 
germinal center b cell(4X muscle(l), normal prostate(l), ovarian tumor(6), parathyroid 
tumor(2), pectoral muscle (after mastectomy )(4), pheochromocytoma(l), placenta(l), 
senescent fibroblast(l 1), two pooled squamous cell carcinomas(l) 


8671 


2 pooled high-grade transitional cell tumors(l), 2 pooled tumors (clear cell type)(3), b-cell, 
chronic lymphotic leukemia(l), adenocarcinoma(6), alveolar rhabdomyosarcoma(3), 
amygdala(2), anaplastic oligodendroglioma(2), blood(2), breast(2X carcinoid(l), colon(2), 
germmal center b cell(5), liver cancer(l), metastatic prostate bone lesion(]), muscle(]), 
normal prostate(l), ovarian tumor(5), parathyroid tumor(2), pectoral muscle (after 
mastectomy)(5), p1acenta(2), pooled germ cell tumors(tX renal cell tumor(l), senescent 
fibroblast(8) 


8968 


b-cell, chronic lymphotic leukem]a(lX neuroepithelial cells(l), total brain(]) 


9406 


brain(2), colon(l), tumor(l) 


9407 


brain(2), colon(l), tumor(l) 


9668 


2 pooled tumors (clear cell type)(8), adenocarcinoma(l), anaplastic oligodendroglioma(l), 
breast carcinoma in situ(3X cerebellum(l), oligodendroglioma(l), papillary serous 
carcinoma(2X parathyroid tumoK26), pooled germ cell tumors(l) 


9679 


ovarian tumor(3), parathyroid tumor( 1 ), uterus( 1 ) 


9755 


2 pooled tumors (clear cell type)(l), adenocarcinoma(l), carcinoid(l), ovarian tumor(l), 
pooled germ cell tumors(3X senescent fibroblast(2), total brain(2) 


9868 


brain(l), senescent flbrobla$t(l) 


10044 


2 pooled high-grade transitional cell tumors(l), 2 pooled tumors (clear cell type)(3), b-cell» 
chronic lymphotic leukemia(l), adenocarcinoma(6), alveolar rhabdomyosarcoma(3), 
amygdala(2), anaplastic oligodendroglioma(2), blood(l), breast(2), carcinoid(I), colon(2), 
germinal center b eel 1(5), liver cancer(l), metastatic prostate bone lesion(l), normal 
prostate(I), osteosarcoma(l), ovarian tumor(5), parathyroid tumor(2), pectoral muscle (after 
mastectomy)(5), placenta(2), pooled germ cell tumors(lX renal cell tumor(l), senescent 
fibroblast(8) 


10322 


2 pooled tumors (clear cell type)(9), b-cell, chronic lymphotic leukemia(2), anaplastic 
o!igodendroglioma(4), breast(l), carcinoid(3), colon(3), colon tumor rer+(l), germinal center 
b cell(l), glioblastoma (pooled)(4), kidney tumor(l), low-grade prostatic neoplasia(l), 
metastatic melanoma to bowel(l), normal prostate(2), ovary bulk tumor(l), parathyroid 
tumor(3), pectoral muscle (after mastectomy)(3), pooled germ cell tumors(2), senescent 
flbrobIast(2), small cell carcinoma(2), synovial $arcoma(2), two pooled squamous cell 
carcinomas(3) 


10526 


2 pooled tumors (clear cell type)(2), ewing's sarcoma(4), adenocarcinoma(l), adipose tissue, 
white(l), alveolar rhabdomyosarcoma(3), anaplastic oligodendroglioma(5), brain(l), 
breast(4), breast tumor(l), carcinoid(2), coIon(lO), epithelium (cell line)(l), fi*ontal lobe(7), 
germ cell tumor(l), glioblastoma (pooled)(l), invasive prostate tumor(l), kidney tumor(8), 
lymphoma(4), metastatic prostate bone lesion(2), mu5cte(3)» normal prostate(2), normal 
prostatic epithelial cells(3), ovarian tumor(l), ovary(2), parathyroid tumor(4), pectoral 
muscle (after mastectomy)(26), placenta(l), pooled germ cell tumors(6), prostate(9), 
senescent flbrobIast(6), small cell carclnoma(l), tumor. 5 pooled (see description)(2) 


10584 


2 pooled tumors (clear cell type)(2), b-cell, chronic lymphotic leukemia(4), anaplastic 
oligodendrogIioma(6), carcinoid(2), colon(3), colon tumor rer+(4), colon tumor, rer+(2), 
ovarian tumor(4) 


10650 


pooled germ cell tumors(2) 


10739 


2 pooled tumors (clear cell type)(5), ewing's sarcoma(l), liver(l), alveolar 
rhabdomyosarcoma( 1 ), anaplastic oligodendroglioma(7), breast(3), breast tumor(l), 
carcinoid(9), colon(5), colon tumor rer+(3), fi-ontal lobe(8), germinal center b cell(3), 
glioblastoma (pooled)(l 1), invasive prostate tumor(l), kidney(l), metastatic prostate bone 
lesion(l), moderately-differentiated adenocarcinoma(l), normal prostate(2), normal prostatic 
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epithelial cells(l), parathyroid tunior(4), pectoral muscle (after mastectomy)(6), placenta(l), 
pooled Rcrm cell tumorsdX tumor(l), tumor, 5 pooled (see description)(l) 


10743 


2 pooled tumors (clear cell type)(5), ewing's sarcoma(l), liver(l), schwannoma tumor(l), 
alveolar rhabdomyosarcoma(l), anaplastic o!igodendroglioma(7), breast(3), breast tumor(l), 
carcinoid(lO), colon(5), colon tumor rer+(3), germinal center b cell(3), glioblastoma 
(pooled)(ll), invasive prostate tumor(l), kidney(l), liver(l), metastatic prostate bone 
lesion(l), moderately-differentiated adenocarcinoma(l), muscle(l), normal prostate(2), 
normal prostatic epithelial ce!ls(l), ovarian tumor(4), parathyroid tumor(4), pectoral muscle 
(after mastectomy)(6), placenta(l), pooled germ cell tumors(2), thyroid(l), tumor(l), tumor, 
5 pooled (see description)(l) 


10744 


heart(2), four pooled pituitary adenomas(l), frontal lobe(3), glioblastoma (pooled)(l), 
liverd), moderately-differentiated adenocarcinoma(l). peripheral blood(l), retina(l) 


10880 


2 pooled tumors (clear cell type)(3), ewing's sarcoma(l), adenocarcinoma(3), brain(l), 
carcinoid(8), germinal center b cell(l), glioblastoma (pooled)(5), kidney tumor(l), normal 
prostate(3), oligodendroglioma(l), parathyroid tumor(ll), senescent fibroblast(4), three 
pooled meningiomas(l), total brain(3) 


10942 


2 pooled tumors (clear cell type)(8), adenocarcinoma(l), anaplastic oligodendroglioma(l), 
breast carcinoma in silu(3), cerebellum(l), oligodendroglioma(l), papillary serous 
carcinoma(2), parathyroid tumor(26), pooled germ cell tumors(l) 


11019 


epiderniis(l), ewing's sarcoma(2), heart(l), schwannoma tumor(I), adenocarcinoma(5), 
alveolar rhabdomyosarcoma(3), amygdala(l), brain(4), carcinoid(l), colon mucosa(l), 
endometrioid ovarian metastasis(l), epithelium (cell line)(2), frontal lobe(l), germ cell 
nimor(3), heart(l), invasive prostate tumor(2), kidney(2), kidney tumor(2), liposarcoma(l), 
liver(24), mantle cell lymphoma(3), medulloblastoma(l), metastatic prostate bone lesion(6), 
moderately-differentiated adenocarcinoma(l), muscle(l), normal prostatic epithelial celis(5), 
ovary(l), papillary serous ovarian metastasis(l), parathyroid tumor(l), placenta(2), pooled 
germ cell tumors(2), prostate(l), thyroid(l), tumor(l), tumor, 5 pooled (see descriptionXl), 
uterus(4) 


11278 


2 pooled tumors (clear cell type)(4), bone(4), heart(2), anaplastic oligodendroglioma(4), 
carcinoid(3), colon tumor rer+(l), epithelium (cell Iine)(l), frontal lobe(12), kidney(2), 
liposarcoma(l), liver(6), lung carcinoma(l), muscle(6), normal prostate(2), ovary(2), 
parathyroid tumor(6), pectoral muscle (after mastectomy)(3), schizophrenic brain s-1 1 frontal 
iobe(l), senescent fibroblast(6), small cell carcinoma(4), synovial membrane(l) 


11342 


2 pooled tumors (clear cell type)(3), blood(l), germinal center b cell(4), normal 
epithelium(l), pooled germ cell tumors(l), tumor, 5 pooled (see description)( 1 ) 


11735 


adenocarcinoma(l), breast(2), colon(l), frontal lobe(2), placenta(l), pooled germ cell 
tumors(8) 


12039 


testis(l) 


12043 


spleen(l) 


12048 


2 pooled tumors (clear cell type)(l), anaplastic oligodendroglioma(l), brain(2), colon tumor 
rer+(2), spleen(2) 


12098 


astrocytoma(l), ovarian tumor(l) 


12202 


2 pooled tumors (clear cell type)(2), adenocarcinoma(l), cochlea(l), germinal center b 
cell(2), pituitaryd) 


12243 


alveolar rhabdomyosarcoma(2), anaplastic oligodendrogIioma(lX carcinoid(9), colon(l), 
colon tumor rer+(l), germinal center b cell(2). glioblastoma (pooled)(2), metastatic prostate 
bone lesion(l), normal prostate(2), ovary(2), pooled germ cell tumors(2), thyroid(2), 
tumord) 


12263 


anaplastic oligodendroglioma(l), carcinoid(l), germinal center b cell(2), invasive 
adenocarcinoma(l), liver and spleen(2), papillary serous carcinoma(l) 


12490 


2 pooled tumors (clear cell type)(3), b-cell, chronic lymphotic leukemia(l), ewing's 
sarcoma(l), adenocarcinoma(2), anaplastic oligodendroglioma(l), germinal center b cell(2), 
kidney tumor(l), larynx(l), mantle cell lymphoma(l), medulloblastoma(l), melanocyte(2), 
moderately-differentiated adenocarcinoma(l), ovarian tumor(12), parathyroid tumor(2), 
pooled germ cell tumors(4), prostate(2), tumor, 5 pooled (see description)(2) 


12604 


2 pooled tumors (clear cell type)(12), b-cell, chronic lymphotic leukemia(4), bone(l), ewing's 
sarcoma(l), adenocarcinoma(l), anaplastic oligodendroglioma(lO), carcinoid(3), colon(3), 
colon tumor rerf(l), germinal center b cell(l), glioblastoma (pooled)(3), invasive prostate 
tumor(l), kidney(l), larynx(l), muscle(l), normal leg muscle(l), normal prostate(3), ovarian 
tumor(l), parathyroid tumor(12), pectoral muscle (after mastectomy)(28). pooled germ cell 
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tumors(3), prostate(3), renal cell tumor(l)» senescent ribroblast(12), skeletal muscle(l) 


12657 


adenocarcinoma(lX carcinoid(2)» colon tumor, rer+(2) 


12788 


2 pooled tumors (clear cell typeX3X adenocarcinoma(l), adrenal adenoma(l), alveolar 
rhabdomyosarcoma(l), carcinoid(l), colon(I), colon tumor rer+(l), germinal center b cell(l), 
glioblastoma (pooled)(2), ovary(l), parathyroid tumor(l), senescent fibroblast(2), tumor, 5 
pooled (see description)(l) 


12901 


anaplastic oligodendroglioma(2), carcinoid(l), germinal center b cell(6), invasive tumor (ceil 
line)(l), parathyroid tumor(l) 


12907 


adipose tissue, white(l), carcinoid(l), parathyroid tumor(2), pooled frontal lobe(l), tumor, 5 
pooled (see description)(l) 


13013 


invasive tumor (cell line)(l), parathyroid tumor(3) 


13202 


2 pooled tumors (clear cell type)(l), b-cell, chronic lymphotic leukemia(3), breast(l), breast 
tumor(l), carcinoid(2), colon tumor reM-(l), colon tumor, rer+(l), germinal center b cell(l), 
kidney tumor(1), lung carcinoma(l), lung tumor(I), meningioma(2), moderately 
differentiated adenocarcinoma(l), normal prostate(3), ovanan tumor(l), squamous cell 
carcinoma(l), tumor, 5 pooled (see description)(l) 


13229 


cochlea(l), frontal cortex(l), pooled germ cell tumors(l) 


13256 


2 pooled tumors (clear cell type)(2), adenocarcinoma(2), anaplastic oligodendroglioma(l), 
parathyroid tumor(l), pooled germ cell tumors(3) 


13285 


2 pooled tumors (clear cell type)(6), carcinoid(l), colon tumor rer+(l), kidney(l), liver(l) 


26710 


2 pooled tumors (clear cell type)(l), pooled germ cell tumors(l) 


27273 


anaplastic oligodendroglioma(l), cerebeilum(l), germinal center b cell(2), moderately- 
difTerentiated adenocarcinoma(l), pectoral muscle (after mastectomy)(2) 


27301 


astrocytoma(2), ovarian tumor(l) 


27336 


2 pooled tumors (clear cell type)(2), b-cell, chronic lymphotic Ieukemia(6), 
adenocarcinoma(l), anaplastic oligodendroglioma(9), carcinoid(6), colon(l), colon tumor 
rer+(l), colon tumor, rer+(6), frontal lobe(l), germinal center b cell(2), glioblastoma 
(pooled)(1), pooled germ cell tumors(4), senescent r!broblast(2) 


27361 


colon(l) 


27374 


bK:ell, chronic lymphotic leukemia(l), pooled germ cell tumors(5) 


27627 


2 pooled tumors (clear cell type)(l), anaplastic oligodendroglioma(l), frontal lobe(l), 
germinal center b cell(l), glioblastoma (pooled)(2), ovarian tumord) 


27697 


skeletal musc]e(2), thyroid(l) 


27877 


carcinoid(l), colon(l), germinal center b cen(l) 


29469 


frontal lobe(4), germinal center b cell(l), ovarian tumor(2), pectoral muscle (after 
mastectomy)(3), senescent fibroblast(l), small cell carcinoma(l), thyroid(l) 


29557 


ewing's sarcoma( 1 ), larynx( 1 ), medulloblastoma(l ), moderately-differentiated 
adenocarcinoma(l), ovarian tumor(7), parathyroid tumor(2), pooled germ cell tumor5(l), 
prostate(l) 


29673 


heart(l), colon tumor(l), kidney tumor(l), lung tumor(3), lymphoid(2), moderately 
differentiated adenocarcinoma(l), muscle(l), ovarian tumor(3), squamous cell carcinoma(!) 


29814 


brain(2), colon(l), pancreatic islet(l) 


30218 


small cell carcinoma(2) 


30446 


adenocarcinoma(2), adrenal adenoma(3), colon tumor(l), glioblastoma (pooled)(2), kidney 
tumor(l), ovarian tumor(l), parathyroid tumor(l), prostate(l), small cell carcinoma(l) 


30583 


carcinoid( 1 ), ovarian tumor(l), pooled germ cell tumors(2) 


30719 


2 pooled tumors (clear cell type)(3), b-cell, chronic lymphotic leukemia(l), breast(2). 
colon(l), colon tumor, rer+(l), germinal center b cell(l), glioblastoma (pooled)(3), Hver(2), 
normal prostate(2), pooled germ cell tumors(]0) 


31356 


pooled germ cell tumors(l) 


31422 


2 pooled tumors (clear cell type)(l), b-cell, chronic lymphotic leukemia(2), bone(l), 
anaplastic oligodendroglioma(l 1), bone marrow stroma(l), brain(2), breast(l), carcinoid(20), 
colon(6). colon tumor, reH-(I), germinal center b cell(2), glioblastoma (pooledX3), kidney 
tumor(l), larynx(l), medulloblastoma(l), normal prostate(l), ovarian tumor(l), parathyroid 
tumor(l), pectoral muscle (after mastectomy)(2), pooled germ cell tumors(2), senescent 
fibroblast(5), tumor(l), tumor, 5 pooled (see description)(3) 


31554 


2 pooled tumors (clear cell type)(l), carcinoid(l), germinal center b cell(4) 


31627 


2 pooled tumors (clear cell type)(l), alveolar rhabdomyosarcoma(l), anaplastic 
oligodendroglioma{2), frontal lobe(7), normal prostate(l). oligodendrofilioma(l). parathyroid 
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tumor(l), pectoral muscle (after mastectomy)(3X pooled germ cell tumors(l) 


31744 


adenocarcinoma(l ) 


31790 


ovarian tumor(l), synovial membrane(l) 


32102 


2 pooled tumors (clear cell type)(l), b-cell, chronic lymphotic leukemia(2), heart(3), 
lymphoma(2), adenocarcinoma(6), adipose tissue, white(l), anaplastic oligodendroglioma(5), 
brain(2), carcinoid(9), cerebellum(5), colon(7), epithelium (cell line)(l), frontal lobe(4), germ 
cell tumor(l), glioblastoma (pooledX3), juvenile granulosa tumor(lX kidney tumor(2), 
medulloblastoma(l), melanoma (mewo cell HneXl), metastatic melanoma to bowel(l), 
moderately-differentiated adenocarcinoma(l), normal epithelium(l), normal prostate(3), 
omentum(l), ovarian tumor(2), pancreatic islet(l), parathyroid tumor(4), pectoral muscle 
(after mastectomy)(2), placenta(l), pooled frontal lobe(l), pooled germ cell tumors(l), retinal 
fovaea(l), schizophrenic brain s-11 frontal lobe(l), senescent fibroblast(5), small cell 
carcinoma(4), synovial membrane(l), total brain(2), tumor, 5 pooled (see descriptionXlX two 
pooled squamous cell carcinomas(2) 


32473 


2 pooled tumors (clear cell type)(3), heart(l), adipose tissue, white(l), alveolar 
rhabdomyosarcoma(3), anaplastic oligodendroglioma(3), breast(2), colon(2), colon tumor 
reN-(l), invasive prostate tumor(l), kidney(l), metastatic prostate bone lesion(l), normal 
prostate(l), pectoral muscle (after mastectomyX22), pooled germ cell tumors(2), senescent 
fibroblastd) 


32475 


2 pooled tumors (clear cell typeX3), heart(l), adipose tissue, white(l), alveolar 
rhabdomyosarcoma(3), anaplastic oligodendroglioma(3), brea$t(2), colon(2), colon tumor 
rer+(l), kidney(l), metastatic prostate bone lesion(l), normal prostate(l), pectoral muscle 
(after mastectomyX22), pooled germ cell tumors(2), senescent fibroblast(l) 


33712 


2 pooled tumors (clear cell typeXO, adenocarcinoma(2), anaplastic oIigodendroglioma(4), 
brain frontal cortex(2), breast(l), germinal center b cel!(4), juvenile granulosa tumor(l), 
normal prostatic epithelial cells(2), ovarian tumor(l), three pooled meninfiiomas(2) 


35005 


bone(l), pooled germ cell tumors(l) 


35185 


spinal cord( 1 ) 


35326 


brain(l), breast(2), carcinoid(l), colon(l), germinal center b cell(2), human pancreatic 
islets(l), Iiver(l), melanocyte(l), ovarian tumor(l), placenta(l), total brain(l) 


37337 


anaplastic oligodendroglioma(2), colon(2), glioblastoma (pooledX3), juvenile granulosa 
tumor(l), pooled germ cell tumors(3), tumor, 5 pooled (see description)(l), uterus(2) 


38220 


2 pooled tumors (clear cell typeXO, b-cell, chronic lymphotic leukemia(l), 
adenocarcinoma(2), breast(l), carcinoid(l), frontal lobe(l), germinal center b cell(2), normal 
prostate(l), normal prostatic epithelial cells(l), pooled germ cell tumors(3), total brain(2) 


38311 


bone(l), frontal lobe(l), germinal center b cell(l), moderately differentiated 
adenocarcinoma(]), placenta(1) 


38631 


brain(l) 


38749 


2 pooled tumors (clear cell type)(4), breast(l), cochlea(l), colon tumor reH-(l), liver(l), 
moderately differentiated adenocarcinoma(l), normal prostate(l), pancreas (with no medical 
abnormalitiesXl), parathyroid tumor(l), prostate(I) 


40975 


brain(2), tumor(l) 


40991 


brain(2), tumor(l) 


44053 


2 pooled tumors (clear cell type)(l), bone(l), total brain(l) 


45179 


colon(l), metastatic prostate bone lesion(l) 


45274 


2 pooled tumors (clear cell typeX6), cd34+, cd38- from normal bone marrow donor(l), 
ewing's sarcoma(l), heart(l), lung(l), adenocarcinoma(8), adrenal adenoma(l), alveolar 
rhabdomyosarcoma(3), anaplastic oligodendroglioma(5), borderline ovarian carcinoma(17), 
brain(3), breast(4), breast carcinoma in situ(20), bronchioalveolar carcinoma(4), carcinoid(4), 
cerebellum(l), colon(4), colon tumor rer+(3), early stage papillary serous carcinoma(3), 
glioblastoma (pooledX2), invasive adenocarcinoma(8), invasive carcinoma(2), lobullar 
carcinoma in situ(2), low-grade prostatic neoplasia(l), lung carcinoma(]), lung tumor(l), 
normal prostate(2), normal prostatic epithelial cells(l), oligodendroglioma(3), ovarian 
tumor(2), papillary serous carcinoma(18), papillary serous ovarian metastasis(l), parathyroid 
tumor(5), pectoral muscle (after mastectomy)(7), pooled germ cell tumors(2), senescent 
fibroblast(3), stem cell 34+/38+(2), thymus(l) 


46679 


2 pooled tumors (clear cell typeX4), anaplastic oligodendroglioma(1), pectoral muscle (after 
mastectomyXIX pooled germ cell tumors(l) 


48024 


2 pooled tumors (clear cell typeX3), heart(l), adipose tissue, white(l), alveolar 
rhabdomyosarcoma(3), anaplastic oligodendroglioma(3), breast(2), colon(2), colon tumor 
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rer+(l), invasive prostate tumor(lX kidney(l), metastatic prostate bone lesion(l}, normal 
prostate(l), pectoral muscle (after mastectomy)(22), pooled germ cell tumors(2), senescent 
fibroblast(l) 


48548 


liver(4) 


48823 


2 pooled tumors (clear cell type)(l), bone(l), alveolar rhabdomyosarcoma(2), breast(l), 
epithelium (cell line)(l), glioblastoma (pooled)(l), lung(l), lung tumor(l), parathyroid 
tumor(l), pectoral muscle (after mastectomy)(l), renal cell tumor(l) 


48901 


germinal center b celt(lX pooled germ cell tumors(l) 


49018 


anaplastic oligodendroglioma(l), bram(l), breast(l), frontal lobe (see aescription)(l), 
testis(5) 


49034 


adipose tissue, white(l), pancreas (with no medical abnormalities)(l) 


49133 


retina(l) 


49140 


2 pooled tumors (clear cell type)(lX adenocarcinoma(lX carcinoid(6), colon tumor rer+(l), 
glioblastoma (pooled)(2), moderately difTerentiated adenocarcmoma(l), normal prostate(lX 
pancreatic islet(2), parathyroid tumor(3) 


49387 


colon(l), pooled germ cell tumors(l) 


49416 


anaplastic oligodendroglioma(]}, carcinoid(3), cerebellum(2), frontal lobe(2)» germinal center 
b cell(2), parathyroid tumor(l), relina(l), tumor(I) 


49493 


neuroepithelial cells(l), ovarian tumor(4), thyroid(l) 


49640 


2 pooled tumors (clear cell typeXl)> anaplastic oligodendroglioma(l}, brain(2), colon tumor 
reH-(l), spleen(2) 


49863 


b-cell, chronic lymphotic leukemia(l), carcinoid(l), germinal center b cell(2), pooled genn 
cell tumors(3) 


49871 


b-cell, chronic lymphotic leukemia(l), carcinoid(l), germinal center b cell(2), pooled germ 
cell tumors(3) 


50185 


total brain(l) 


50763 


cerebral cortex(3), brain(2), colon(l), colon tumor rer+(l), ovarian tumor(l)» schizophrenic 
Drain S' 1 1 ironiai looe^ i } 




z pooieo lUmOis ^ciear ceii typey^z^, o*ceii, cnronic iympnoiic \KwsK>\v\\c\j)'t v>aii/iiiuiu^i^» 
cochlea( 1 ), germinal center b cell(2) 


51130 


adenocarcinoma(l) 


51212 


placenta.(2) 


51346 


bone(lX small cell carcinoma(2) 


51380 


placenta(4) 


51400 


liverd) 


51954 


germinal center b cell(l), total brain(l) 


52076 


normal prostatic epithelial cells(l) 



TABLE V 



SEQ ID NO. in 

priority 

application 


Low frequency expression 


High frequency expression 


37 


salivary gland 




51 


fetal brain, fetal kidney 


brain, salivary gland, liver 


179 


liver 




180 


liver 




183 


prostate 




326 


salivary gland 




362 


testis 




440 


brain 




452 


placenta 




483 


placenta 




500 


salivary gland 




528 


testis 
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573 


liver 




587 


dQllVuijr ^JUlIU 




588 


liver 




593 


fetal kidnev Dlacenta 




599 


liver 




628 


testis 




653 


testis 




670 


testis 




693 


nlacentfl 

via wviiuit 




703 


liver 




746 


Km in 




770 






775 






812 


brdin 




940 


liver 






liver 




00^ 
yyyj 


tPCtlQ 






oallVoiy gioilU, Ulolll 




1 \/0*T 


Krsiin livpr nmctntP 
L/IUIil, IIVCJ) Jpi\JaUltw 


foffll hrain 


1 151 

I I J I 


Ul Hill 




1 1Q0 


odllVaijr ^laliu, pivJauilC 






olUUtawll/ UllColUlv 




1 yUH 








br&in 




ZJOO 


testis 






pi UoiaiC 






piUdiaiv 






testis 




7755 


pi UololC 




7Rnn 


11 V Vl ) ICOlld 






brsin 




7Q'?9 


Iwlcll liVvl 




lfl7R 

JU f o 


pi uaUEiiv 






Mvpr 

11 VCI 






pi udioic 




J J o / 


nrostate 

pi K/JlCliV 






Gfilivnrv olnnH 




3501 


brsin 




3633 


brsin 




■5678 

J\J t o 






5714 


liver 










3804 


^alivarv (zland fetal kidnev 

scull vCUjr j^lUliUy AV^Ul IVlWIlVJr 




3892 


brsin 




3985 


ctnmach/intestine 




4005 


liver 






fpt jil hrsiin 

ICiai Ulalll 




•tvOO 


tpctic 

lColl> 




41 1 1 

Hill 


brBin 




4126 


liver 




4172 


liver 




4261 


prostate 




4436 


liver 




4609 


fetal liver 




4647 


fetal liver 




4660 


fetal liver 
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4664 


fetal brain 




4678 


fetal liver 




4687 


fetal liver 




4690 


fetal liver 




4694 


salivary gland 




4696 


salivary gland 




4733 


fetal liver 




4807 


fetal brain 




4809 


salivary gland 




4855 


prostate 




4908 


brain 




4943 


liver 




4947 


testis 




4976 


stomach/intestine 




5000 


fetal kidney 




5005 


fetal kidney 




5011 


prostate 




5040 


fetal liver 




5089 


salivary gland 




5117 


prostate 




5141 


liver 




5162 


fetal brain 




5167 


salivary gland 




5192 


liver 




5214 


fetal brain 




5250 


fetal kidney, fetal brain 




5262 


brain 


fetal kidney, fetal brain 


5270 


fetal brain 




5278 


fetal liver 




5453 


fetal kidney 




5494 


fetal liver 




5499 


brain 




5533 


fetal brain 




5563 


fetal liver 




5609 


placenta 




5748 


placenta 




5816 


fetal brain 




5824 


prostate 




5861 


fetal liver, prostate, fetal kidney, 
salivary gland 




5947 


fetal liver 




5966 


placenta 




5970 


fetal kidney 




5974 


fetal liver 




5983 


brain 




5985 


fetal kidney 




6080 


salivary gland 




6081 


salivary gland 




6108 


liver 




6238 


placenta, brain 




6252 


placenta 




6334 


liver 




6345 


brain 




6358 


fetal brain 




6636 


fetal brain 




6688 


fetal brain 
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6727 


testis 




6865 


salivary gland 




6892 


stomach/intestine, fetal brain 




7041 


liver 




7533 


placenta 




7535 


brain 




7697 


fetal liver 




7712 


testis 




8078 


testis 




8097 


salivary gland 




8166 


placenta, fetal liver 




8262 


prostate 




8534 


testis 




8666 


placenta 




8744 


prostate 




8968 


brain 




8994 


testis 




9297 


fetal liver 




9332 


testis 




9406 


prostate 




9407 


prostate 




9668 


testis 




9755 


testis 




9868 


liver 




10044 


fetal liver, placenta 




10526 


prostate 




10650 


testis 




10743 


stomach/intestine 




10744 


salivary gland 




10761 


prostate 




10880 


fetal liver 




10942 


brain 




11019 


liver 




11278 


salivary gland 




11562 


testis 




11688 


testis 




11735 


fetal br^in 




11813 


brain 




12043 


brain 




12202 


testis 




12220 


fetal kidney 




12243 


brain, testis, liver 


fetal brain, fetal liver 


12263 


prostate 




12276 


brain 




12604 


fetal kidney 




12657 


fetal kidney, stomach/intestine 




12788 


testis 




12901 


placenta 




12907 


salivary gland 




13013 


fetal liver 




13229 


brain 




13256 


testis 




13267 


salivary gland 




13285 


fetal brain 


fetal kidney, stomach/intestine 


26638 


fetal liver 




26710 


brain 
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26726 


fetal kidney 




26786 


brain 




27084 


testis 




27273 


fetal kidney 




27361 


liver 




27374 


fetal liver 




27877 


salivary gland 




28413 


prostate 




28517 


testis 




28518 


placenta 




29120 


salivary gland 




29673 


fetal liver 




30218 


liver 




30446 


prostate 




30477 


placenta 




30583 


testis 




30719 


fetal brain, salivary gland 




31356 


prostate 




31790 


brain 




33130 


testis 




38749 


prostate 




38890 


placenta 




40163 


testis 




40975 


prostate 




40991 


prostate 




42896 


prostate 




44053 


stomach/intestine, fetal kidney 




45091 


stomach/intestine 




45179 


prostate 




45274 


fetal kidney 




46679 


fetal liver 




48024 


fetal kidney 




48603 


liver, testis 




48671 


testis 




48823 


fetal kidney 




48901 


salivary gland 




49018 


stomach/intestine 




49034 


salivary gland 




49133 


liver 




49261 


testis 




49387 


liver 




49416 


fetal brain 




49426 


stomach/intestine 




49493 


salivary gland 




49640 


brain 




49863 


brain 




49871 


brain 




50015 


testis 




50049 


testis 




50112 


testis 




50185 


salivary gland 




50241 


liver 




50763 


brain 




51130 


testis 




51212 


stomach/intestine 




51400 


liver 
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WHAT IS CLAIMED IS: 

] . An isolated polynucleotide, said polynucleotide comprising a nucleic acid sequence encoding: 

i) A polypeptide comprising the amino acid sequence shown as SEQ 
5 IDNO:305; 

ii) a polypeptide comprising any one of the amino acid sequences 
shown as SEQ ID NOs:170-304, 306-338, 456-560, 785-918; or 

iii) a biologically active fragment of any of said polypeptides. 

1 0 2. The polynucleotide of claim 1 , wherein said polypeptide comprises a signal peptide. 

3. The polynucleotide of claim 1 , wherein said polypeptide is a mature protein. 

4. The polynucleotide of claim 1 , wherein said polynucleotide comprises any one of the nucleic 
15 acid sequences shown as SEQ ID NOs: 1-169, 339-455, or 561-784. 

5. The polynucleotide of claim I, wherein said polynucleotide is operably linked to a promoter. 

6. An expression vector comprising the polynucleotide of claim 5. 

20 

7. A host cell recombinant for the polynucleotide of claim 1 . 

8. A non-human transgenic animal comprising the host cell of claim 7. 

25 9. A pharmaceutical composition comprising the polynucleotide of claim I , and a 
pharmaceutically acceptable carrier. 

10. A method of making a GENSET polypeptide, said method comprising 

a) providing a population of host cells comprising the polynucleotide of 
30 claim 5; and 

b) culturing said population of host cells under conditions conducive to the 
production of said polypeptide within said host cells. 

1 1 . The method of claim 10, further comprising purifying said polypeptide from said population of 
35 host cells. 
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12. An isolated polynucleotide, said polynucleotide comprising any one of the nucleic acid 
sequences shown as SEQ ID NOs:l-169, 339-455, or 561-784. 

13. A biologically active polypeptide encoded by the polynucleotide of claim 12. 

5 

14. An isolated polypeptide or biologically active fragment thereof, said polypeptide comprising 
any one of the amino acid sequences shown as SEQ ID NOs: 1 70-338, 456-560, or 785-9 1 8. 

15. The polypeptide of claim 14, wherein said polypeptide comprises a signal peptide. 

10 

16. The polypeptide of claim 14, wherein said polypeptide is a mature protein. 

17. An antibody that specifically binds to the polypeptide of claim 14. 

15 1 8. A pharmaceutical composition comprising the polypeptide of claim 14, and a phannaceutically 
acceptable carrier. 

19. A method of making a GENSET polypeptide, said method comprising 

a) providing a population of ceils comprising a polynucleotide encoding 
20 the polypeptide of claim 14, operably linked to a promoter; 

b) culturing said population of cells under conditions conducive to the 
production of said polypeptide within said cells; and 

c) purifying said polypeptide from said population of cells. 

25 20. A method of determining whether a GENSET gene is expressed within a mammal, said method 
comprising the steps of: 

a) providing a biological sample from said mammal 

b) contacting said biological sample with either of: 

i) a polynucleotide that hybridizes under stringent conditions to the 
30 polynucleotide of claim 1 ; or 

ii) a polypeptide that specifically binds to the polypeptide of claim 14; and 

c) detecting the presence or absence of hybridization between said polynucleotide 
and an RNA species within said sample, or the presence or absence of binding 
of said polypeptide to a protein within said sample; 

35 wherein a detection of said hybridization or of said binding indicates that said GENSET gene is 
expressed within said mammal. 
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21. The method of claim 21, wherein said polynucleotide is a primer, and wherein said 

hybridization is detected by detecting the presence of an amplification product comprising the 
sequence of said primer. 

S 22. The method of claim 2 1 , wherein said polypeptide is an antibody. 

23. A method of determining whether a mammal has an elevated or reduced level of GENSET gene 
expression, said method comprising the steps of : 

a) providing a biological sample from said mammal; and 
1 0 b) comparing the amount of the polypeptide of claim 1 4, or of an RNA species 

encoding said polypeptide, within said biological sample with a level 
detected in or expected from a control sample; 
wherein an increased amount of said polypeptide or said RNA species within said biological 
sample compared to said level detected in or expected from said control sample indicates that said 
15 mammal has an elevated level of said GENSET gene expression, and wherein a decreased amount 
of said polypeptide or said RNA species within said biological sample compared to said level 
detected in or expected from said control sample indicates that said mammal has a reduced level of 
said GENSET gene expression. 

20 24, A method of identifying a candidate modulator of a GENSET polypeptide, said method 
comprising : 

a) contacting the polypeptide of claim 14 with a test compound; and 

b) determining whether said compound specifically binds to said 
polypeptide; 

25 wherein a detection that said compound specifically binds to said polypeptide indicates that 

said compound is a candidate modulator of said GENSET polypeptide. 

25. The method of claim 24, further comprising testing the biological activity of said GENSET 
polypeptide in the presence of said candidate modulator, wherein an alteration in the biological 

30 activity of said GENSET polypeptide in the presence of said compound in comparison to the 
activity in the absence of said compound indicates that the compound is a modulator of said 
GENSET polypeptide. 

26. A method for the production of a pharmaceutical composition comprising 

35 a) identiiying a modulator of a GENSET polypeptide using the method of 

claim 24; and 

b) combining said modulator with a pharmaceutical ly acceptable carrier. 



270 



wo 02/083898 



PCT/IBOl/00914 





118 

1 



120 



DATA 
RETRIEVING 
DEVICE 



DISPLAY 



Figure 1 



wo 02/083898 



PCT/IBOl/00914 



2/4 

201 



200 



( START 
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X 
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c 
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300 



4/4. 

3or 



( START 



, STORE A FIRST SEQU 


ENCE TO A MEMORY 






OPEN A DATABASE OF SEQUENCE FEATURES 




r 


READ FIRST FEATURE FROM DATABASE 




< — I 


r 


COMPARE FEATURE ATTRIBUTES WITH THE FIRST 
SEQUENCE 



304 



306 
308 



310 



READ NEXT 
FEATURE IN 
DATABASE 
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<110> GENSET SA 

<120> Full-length human cDNAs encoding potentially secreted proteins 

<130> 81. WO 

<150> US 60/197,873 

<151> 2000-04-18 

<150> US 60/224,009 

<151> 2000-08-07 

<150> US 60/260,328 

<151> 2001-01-08 

<150> US 60/224,006 

<151> 2000-08-04 

<160> 918 

<170> Patent. pm 



<210> 1 
<211> 615 
<212> DNA 

<213> Homo sapiens 

<220> 
<221> CDS 
<222> 88. .471 

<220> 

<221> sig^peptide 

<222> 88.. 189 

<223> Von Heijne matrix 

score 13.1278588169685 

seq VLLLLSLLLTAXA/SS 

<220> 

<221> polyA_site 
<222> 600.. 615 

<220> 

<221> polyA_signal 
<222> 577.. 582 

<400> 1 

acttatctgc agacttgtag gcagcaactc accctcactc agaggtcttc tggttctgga 60 
aacaactcta gctcagcctt ctccacc atg age etc aga ctt gat acc acc cct 114 

Met Ser Leu Arg Leu Asp Thr Thr Pro 
-30 

tec tgt aac agt gcg aga cca ctt cat gcc ttg cag gtg ctg ctg ctt 162 

Ser Cys Asn Ser Ala Arg Pro Leu His Ala Leu Gin Val Leu Leu Leu 

-25 -20 -15 -10 

ctg tea ttg ctg ctg act get ytg get tec tec ace aaa gga caa ayt 210 

Leu Ser Leu Leu Leu Thr Ala Leu Ala Ser Ser Thr Lys Gly Gin Xaa 
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2 









-5 










1 








5 








aag aga 


aac 


ttg 


Qcq 


aaa 


QCJC 


aaa 


aac 

-^g v> v 


qaa 


agt 


eta 


gac 


agt 


gae 


ttg 


258 


Lys Arg 


Asn 
10 


Leu 


Ala 


Lys 


Gly 


Lvs 
15 


Asp 


Glu 


Ser 


Leu 


Asp 
20 


Ser 


Asp 


Leu 




tat get 


gaa 


etc 


cgc 


tgc 


atg 


tgt 


ata 


aag 


aca 


ace 


tct 


gga 


att 


cat 


306 


Tyr Ala 


Glu 


Leu 


Arg 


Cys 


Met 


Cys 


He 


Lys 


Thr 


Thr 


Ser 


Gly 


He 


His 




25 










30 










35 












ccc aaa 


aac 


ate 


caa 


agt 


ttg 


gaa 


gtg 


ate 


ggg 


aaa 


gga 


acc 


cat 


tgc 


354 


Pro Lys 


Asn 


He 


Gin 


Ser 


Leu 


Glu 


Val 


He 


Gly 


Lys 


Gly 


Thr 


His 


Cys 




40 








45 










50 










55 




aac caa 


gtc 


gaa 


gtg 


ata 


gee 


aca 


ctg 


aag 


gat 


ggg 


agg 


aaa 


ate 


tgc 


402 


Asn Gin 


Val 


Glu 


Val 

60 


He 


Ala 


Thr 


Leu 


Lys 

65 


Asp 


Gly 


Arg 


Lys 


He 
70 


Cys 




ctg gac 


cca 


gat 


get 


ecc 


aga 


ate 


aag 


aaa 


att 


gta 


cag 


aaa 


aaa 


ttg 


450 


Leu Asp 


Pro 


Asp 
75 


Ala 


Pro 


Arg 


He 


Lys 
80 


Lys 


He 


Val 


Gin 


Lys 
85 


Lys 


Leu 




gca ggt 


gat 


gaa 


tct 


get 


gat 


taatttgttc tgtttetgec aaacttcttt 


501 


Ala Gly 


Asp 


Glu 


Ser 


Ala 


Asp 























90 



aactcccagg aagggtagaa ttttgaaacc ttgattttct agttctcatt tattcaggat 561 
acctattctt actgtattaa aatttggata tgtgtttcaa aaaaaaaaaa aaaa 615 

<210> 2 

<211> 539 

<212> DNA 

<213> Homo sapiens 

<220> 
<221> CDS 
<222> 39. .518 

<220> 

<221> sig_peptide 

<222> 39. .92 

<223> Von Heijne matrix 

score 12.2866240999866 

seq LTLAVLFLTGSQA/RH 

<220> 

<221> polyA_site 
<222> 524. .539 

<220> 

<221> polyA_signal 
<222> 507. .512 

<400> 2 

agagactgeg agaaggaggt eceecaeggc cettcagg atg aaa get geg gtg ctg 56 

Met Lys Ala Ala Val Leu 
-15 



acc 


ttg 


gcc 


gtg 


etc 


ttc 


ctg 


aeg 


ggg 


age 


cag 


get 


egg 


cat 


ttc 


tgg 


104 


Thr 


Leu 


Ala 
-10 


Val 


Leu 


Phe 


Leu 


Thr 
-5 


Gly 


Ser 


Gin 


Ala 


Arg 
1 


His 


Phe 


Trp 




cag 


caa 


gat 


gaa 


ccc 


ccc 


cag 


age 


ecc 


tgg 


gat 


eg a 


gtg 


aag 


gac 


ctg 


152 


Gin 


Gin 


Asp 


Glu 


Pro 


Pro 


Gin 


Ser 


Pro 


Trp 


Asp 


Arg 


Val 


Lys 


Asp 


Leu 




5 










10 










15 










20 




gee 


act 


gtg 


tac 


gtg 


gat 


gtg 


etc 


aaa 


gac 


age 


gge 


aga 


gac 


tat 


gtg 


200 


Ala 


Thr 


Val 


Tyr 


Val 
25 


Asp 


Val 


Leu 


Lys 


Asp 
30 


Ser 


Gly 


Arg 


Asp 


Tyr 
35 


Val 




tec 


cag 


ttt 


gaa 


gge 


tec 


gee 


ttg 


gga 


aaa 


cag 


eta 


aac 


eta 


aag 


etc 


248 


Ser 


Gin 


Phe 


Glu 
40 


Gly 


Ser 


Ala 


Leu 


Gly 
45 


Lys 


Gin 


Leu 


Asn 


Leu 
50 


Lys 


Leu 




ctt 


gae 


aac 


tgg 


gac 


age 


gtg 


ace 


tec 


gee 


ttc 


age 


aat 


ctg 


cgc 


gaa 
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Leu 


Asp 


Asn 
55 


Trp 


Asp 


Ser 


Val 


Thr 
60 


Ser 


3 

Ala 


Phe 


Ser 


Asn 
65 


Leu 


Arg 


Glu 




cag 


etc 


ggc 


cct 


gtg 


acc 


cag 


gag 


tty 


tgg 


gat 


aac 


ctg 


gaa 


aag 


gag 


344 


Gin 


Leu 
70 


Gly 


Pro 


Val 


Thr 


Gin 
75 


Glu 


Phe 


Trp 


Asp 


Asn 
80 


Leu 


Glu 


Lys 


Glu 




aca 


gag 


ggc 


ctg 


agg 


cag 


gag 


atg 


age 


aag 


gat 


ctg 


gag 


gag 


gtg 


aag 


392 


Thr 


Glu 


Gly 


Leu 


Arg 


Gin 


Glu 


Met 


Ser 


Lys 


Asp 


Leu 


Glu 


Glu 


Val 


Lys 




85 










90 










95 










100 




gcc 


aag 


gtg 


cag 


ccc 


tac 


ctg 


gac 


gac 


ttc 


cag 


aag 


aag 


tgg 


cag 


gag 


440 


Ala 


Lys 


Val 


Gin 


Pro 
105 


Tyr 


Leu 


Asp 


Asp 


Phe 
110 


Gin 


Lys 


Lys 


Trp 


Gin 
115 


Glu 




gag atg 


gag 


etc 


tac 


cgc 


cag 


aag 


gca 


get 


ttc 


tta 


act 


ate 


eta 


aca 


488 


Glu 


Met 


Glu 


Leu 


Tyr Arg 


Gin 


Lys 


Ala 


Ala 


Phe 


Leu 


Thr 


lie 


Leu 


Thr 










120 










125 










130 








age 


ett 


gga 


cca 


aat 


gga 


aat 


aaa 


get 


ttt 


tgatgaaaaa aaaaaaaaaa a 


539 


Ser 


Leu 


Gly 


Pro 


Asn 


Gly Asn 


Lys 


Ala 


Phe 




















135 










140 





















<210> 3 

<211> 796 

<212> DNA 

<213> Homo sapiens 

<220> 
<221> CDS 
<222> 134.. 373 

<220> 

<221> sig_peptide 

<222> 134. .211 

<223> Von Heijne matrix 

score 12.0462155590967 

seq LLLALLLPTQIYS/SE 

<400> 3 

atttgtggct ttcctggtat ataaggtcte gccggctcgc cgegetcccc acettgcetg 60 

cgeccgcccg gagccagcgg ttctccaagc acccagcatc ctgctagacg cgccgcgeac 120 

cgacggaggg gac atg ggc aga gca atg gtg gee agg etc ggg ctg ggg 169 

Met Gly Arg Ala Met Val Ala Arg Leu Gly Leu Gly 













■25 










■20 










■15 




ctg 


ctg 


ctg 


ctg 


gca 


ctg 


etc 


eta 


ccc 


acg 


cag 


att 


tat 


tec 


agt 


gaa 


217 


Leu 


Leu 


Leu 


Leu 


Ala 

-10 


Leu 


Leu 


Leu 


Pro 


Thr 

-5 


Gin 


He 


Tyr 


Ser 


Ser 
1 


Glu 




act 


aca 


act 


gga 


act 


tea 


agt 


aac 


tec 


tee 


cag 


agt 


act 


tee 


aac 


tct 


265 


Thr 


Thr 


Thr 
5 


Gly 


Thr 


Ser 


Ser 


Asn 
10 


Ser 


Ser 


Gin 


Ser 


Thr 
15 


Ser 


Asn 


Ser 




ggg 


ttg 


gcc 


cca 


aat 


cca 


act 


aat 


gee 


acc 


acc 


aag 


gtg 


get 


ggt 


ggt 


313 


Gly Leu 


Ala 


Pro 


Asn 


Pro 


Thr 


Asn 


Ala 


Thr 


Thr 


Lys 


Val 


Ala 


Gly 


Gly 






20 










25 










30 












gcc 


ctg 


cag 


tea 


aca 


gee 


agt 


etc 


ttc 


gtg 


gtc 


tea 


etc 


tct 


ett 


ctg 


361 


Ala 


Leu 


Gin 


Ser 


Thr 


Ala 


Ser 


Leu 


Phe 


Val 


Val 


Ser 


Leu 


Ser 


Leu 


Leu 




35 










40 










45 










50 





cay eye tac tct taagagaetc aggceaagaa aegtyttcta aatttcccca 413 
His Xaa Tyr Ser 

tettetaaac ceaatccaaa tggcgtctgg aagtccaatg tggeaaggaa aaacaggtct 473 

tcatcgaate tactaattcc acacctttta ttgacacaga aaatgttgag aatcccaaat 533 

ttgattgatt tgaagaaeat gtgagaggtt tgaetagatg atggatgcca atattaaatc 593 

tgctggagtt tcatgtaeaa gatgaaggag aggeaacatc caaaatagtt aagacatgat 653 

ttccttgaat gtggcttgag aaatatggac acttaataet accttgaaaa taagaataga 713 

aataaaggat gggattgtgg aatggagatt eagttttcat ttggttcatt aattctataa 773 

ggccataaaa caggtaatat aaa 796 



<210> 4 
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<211> 528 
<212> DNA 
<213> Homo sapiens 

<220> 
<221> CDS 
<222> 75.. 377 

<220> 

<221> sig_peptide 
<222> 75. .155 
<223> Von Heijne matrix 

score 11.8486806626293 

LFLGLLLLPLVVA/ FA 

<220> 

<221> polyA_site 
<222> 513. .528 

<220> 

<221> polyA_signal 
<222> 498. .503 

<400> 4 

attggccaca gagacccagc ccgagtttcc catcgcactg agcactgaga tcctgctgga 60 



agctctgccg cage 


atg 


age 


tec 


gca 


gcc 


ggg 


ttc 


tgc 


gcc 


tea 


cge 


ece 


110 










Met 


Ser 


Ser 
-25 


Ala 


Ala 


Gly 


Phe 


Cys 
-20 


Ala 


Ser 


Arg 


Pro 




ggg 


ctg 


ctg 


ttc 


ctg 


ggg 


ttg 


ctg 


etc 


ctg 


cca 


ctt 


gtg 


gtc 


gcc 


ttc 


158 


Gly Leu 


Leu 


Phe 


Leu 


Gly 


Leu 


Leu 


Leu 


Leu 


Pro 


Leu 


Val 


Val 


Ala 


Phe 




-15 










-10 










-5 










1 




gcc 


age 


get 


gaa 


get 


gaa 


gaa 


gat 


ggg 


gac 


ctg 


cag 


tgc 


ctg 


tgt 


gtg 


206 


Ala 


Ser 


Ala 


Glu 
5 


Ala 


Glu 


Glu 


Asp 


Gly 
10 


Asp 


Leu 


Gin 


Cys 


Leu 
15 


Cys 


Val 




aag 


acc 


acc 


tec 


cag 


gtc 


cgt 


ccc 


agg 


cac 


ate 


ace 


age 


ctg 


gag 


gtg 


254 


Lys 


Thr 


Thr 
20 


Ser 


Gin 


Val 


Arg 


Pro 
25 


Arg 


His 


He 


Thr 


Ser 
30 


Leu 


Glu 


Val 




ate 


aag 


gcc 


gga 


ccc 


cac 


tgc 


ccc 


act 


gcc 


caa 


ctg 


ata 


gcc 


acg 


ctg 


302 


He 


Lys 
35 


Ala 


Gly 


Pro 


His 


Cys 
40 


Pro 


Thr 


Ala 


Gin 


Leu 
45 


He 


Ala 


Thr 


Leu 




aag 


aat 


gga 


agg 


aaa 


att 


tgc 


ttg 


gac 


ctg 


caa 


gcc 


ccg 


ctg 


tac 


aag 


350 


Lys 


Asn 


Gly 


Arg 


Lys 


He 


Cys 


Leu 


Asp 


Leu 


Gin 


Ala 


Pro 


Leu 


Tyr 


Lys 




50 










55 










60 










65 




aaa 


ata 


att 


aag 


aaa 


ctt 


ttg 


gag 


agt 


tagctactag ctgcctacgt 




397 


Lys 


lie 


He 


Lys 


Lys 
70 


Leu 


Leu 


Glu 


Ser 



















gtgtgcattt gctatatagc atacttcttt tttccagttt caatctaact gtgaaagaac 4 57 
ttctgatatt tgtgttatcc ttatgatttt aaataaacaa aataaatcaa gttgcaaaaa 517 
aaaaaaaaaa a 528 



<210> 5 

<211> 522 

<212> DNA 

<213> Homo sapiens 

<220> 
<221> CDS 
<222> 76. .378 

<220> 

<221> sig_peptide 

<222> 76. .156 

<223> Von Heijne matrix 



PCT/IBOl/00914 



wo 02/083898 PCT/roOl/00914 

5 

score 11.8486806626293 
LFLGLLLLPLVVA/FA 

<220> 

<221> polyA_site 
<222> 508.. 522 

<220> 

<221> polyA_signal 
<222> 499.. 504 

<400> 5 

aattggccac agagacccag cccgagtttc ccatcgcact gagcactgag atcctgctgg 60 

aagctctgcc gcagc atg age tec gca gcc ggg ttc tgc gee tea ego ccc 111 

Met Ser Ser Ala Ala Gly Phe Cys Ala Ser Arg Pro 
-25 -20 



ggg 


ctg 


ctg 


ttc 


ctg 


ggg 


ttg 


ctg 


etc 


ctg 


cca 


ett 


gtg 


gtc 


gcc 


ttc 


159 


Gly Leu 


Leu 


Phe 


Leu 


Gly 


Leu 


Leu 


Leu 


Leu 


Pro 


Leu 


Val 


Val 


Ala 


Phe 




-15 










-10 










-5 










1 




gcc 


age 


get 


gaa 


get 


gaa 


gaa 


gat 


ggg 


gae 


ctg 


cag 


tgc 


ctg 


tgt 


gtg 


207 


Ala 


Ser 


Ala 


Glu 
5 


Ala 


Glu 


Glu 


Asp 


Gly 
10 


Asp 


Leu 


Gin 


Cys 


Leu 
15 


Cys 


Val 




aag 


ace 


ace 


tec 


cag 


gtc 


cgt 


ccc 


agg 


cac 


ate 


ace 


age 


ctg 


gag 


gtg 


255 


Lys 


Thr 


Thr 
20 


Ser 


Gin 


Val 


Arg 


Pro 
25 


Arg 


His 


He 


Thr 


Ser 
30 


Leu 


Glu 


Val 




ate 


aag 


gee 


gga 


ccc 


cac 


tgc 


ccc 


act 


gcc 


caa 


ctg 


ata 


gcc 


acg 


ctg 


303 


He 


Lys 
35 


Ala 


Gly 


Pro 


His 


Cys 
40 


Pro 


Thr 


Ala 


Gin 


Leu 
45 


lie 


Ala 


Thr 


Leu 




aag 


aat 


gga 


agg 


aaa 


att 


tgc 


ttg 


gae 


ctg 


caa 


gcc 


ceg 


ctg 


tac 


aag 


351 


Lys Asn 


Gly 


Arg 


Lys 


He 


Cys 


Leu 


Asp 


Leu 


Gin 


Ala 


Pro 


Leu 


Tyr 


Lys 




50 










55 










60 










65 




aaa 


ata 


att 


aag 


aaa 


ctt 


ttg 


gag 


agt 


tagctactag ctgcctacgt 




398 


Lys 


He 


He 


Lys 


Lys 
70 


Leu 


Leu 


Glu 


Ser 



















gtgtgcattt getatatagc atacttcttt tttccagttt caatctaact gtgaaagaac 458 
ttctgatatt tgtgttatcc ttatgatttt aaataaacaa aataaatcta aaaaaaaaaa 518 
aaaa 522 



<210> 6 

<211> 652 

<212> DNA 

<213> Homo sapiens 

<220> 
<221> CDS 
<222> 75. .332 

<220> 

<221> sig_peptide 

<222> 75. .128 

<223> Von Heijne matrix 

score 11.7816699306029 
seq VLLLLLLVEQAAA/LG 

<220> 

<221> polyA_site 
<222> 637.. 652 

<220> 

<221> polyA__signal 
<222> 616. .621 



<400> 6 



wo 02/083898 PCT/IBO 1/009 14 

6 

gactctttgg tcagggaact gctcgctgag cacagctgca cagtgctgtc agaacggccg 60 



atctccagcc caag 


atg 


att 


cca 


gca 


gtg 


gtc 


ttg 


etc 


tta 


etc 


ctt 


ttg 


110 








Met 


He 


Pro 


Ala 
-15 


Val 


Val 


Leu 


Leu 


Leu 
-10 


Leu 


Leu 


Leu 




gtt gaa 


caa 


gca 


gcg 


gcc 


ctg 


gga 


gag 


cct 


cag 


etc 


tgc 


tat 


ate 


ctg 


158 


Val Glu 


Gin 


Ala 


Ala 


Ala 


Leu 


Gly 


Glu 


Pro 


Gin 


Leu 


Cys 


Tyr 


He 


Leu 




-5 










1 








5 










10 




gat gcc 


ate 


etg 


ttt 


ctg 


tat 


gga 


att 


gtc 


etc 


ace 


etc 


etc 


tac 


tgt 


206 


Asp Ala 


He 


Leu 


Phe 
15 


Leu 


Tyr 


Gly 


He 


Val 
20 


Leu 


Thr 


Leu 


Leu 


Tyr 
25 


Cys 




cga ctg 


aag 


ate 


caa 


gtg 


cga 


aag 


gca 


get 


ata 


ace 


age 


tat 


gag 


aaa 


254 


Arg Leu 


Lys 


He 
30 


Gin 


Val 


Arg 


Lys 


Ala 

35 


Ala 


He 


Thr 


Ser 


Tyr 
40 


Glu 


Lys 




tea gat 


ggt 


gtt 


tac 


aeg 


ggc 


ctg 


age 


ace 


agg 


aae 


cag 


gag 


act 


tac 


302 


Ser Asp 


Gly 
45 


Val 


Tyr 


Thr 


Gly 


Leu 
50 


Ser 


Thr 


Arg 


Asn 


Gin 
55 


Glu 


Thr 


Tyr 




gag act 


ctg 


aag 


cat 


gag 


aaa 


cca 


ccm 


cag 


tagccttaga atagatgegg 


352 


Glu Thr 


Leu 


Lys 


His 


Glu 


Lys 


Pro 


Pro 


Gin 
















60 










65 























teatattett ctttggette tggttcttec agcecteatg tttggcatea catatgcetg 412 

eatgeeatta aeaceagetg gecetacecc tataatgatc etgtgteeta aattaatata 472 

caccagtggt tcctcetccc tgttaaagac taatgctcag atgetgttta eggatattta 532 

tattetagtc teaetctett gteeeaecet tcttetettc eccatteeca aetecageta 592 

aaatatggga agggagaace eccaataaaa ctgccatgga ctggaaaaaa aaaaaaaaaa 652 



<210> 7 
<211> 743 
<212> DNA 

<213> Homo sapiens 

<220> 
<221> CDS 
<222> 112. .387 

<220> 

<221> sig_peptide 

<222> 112. .183 

<223> Von Heijne matrix 

score 11.7298925418815 

seq FVLGLGLTPPTLA/QD 

<220> 

<221> polyA_site 
<222> 728. .743 

<220> 

<221> polyA_signal 
<222> 705. .710 

<400> 7 

acaeaactgg aacccatetc eaggaaeaaa cagetggaae eeatctceeg ttgaagggaa 60 
aetgceagat ttttgtaaga ttcttectcc tgggagcctg tgttggaaga g atg gtg 117 

Met Val 

atg ggc etg ggc gtt ttg ttg ttg gtc ttc gtg ctg ggt ctg ggt ctg 165 
Met Gly Leu Gly Val Leu Leu Leu Val Phe Val Leu Gly Leu Gly Leu 

-20 -15 -10 

acc cca ccg ace etg get cag gat aae tec agg tac aca eac ttc ctg 213 
Thr Pro Pro Thr Leu Ala Gin Asp Asn Ser Arg Tyr Thr His Phe Leu 

-5 15 10 

ace eag eac tat gat gee aaa cca cag ggc egg gat gae aga tac tgt 261 
Thr Gin His Tyr Asp Ala Lys Pro Gin Gly Arg Asp Asp Arg Tyr Cys 

15 20 25 

gaa age ate atg agg aga egg ggc ctg ace tea ccc tgc aaa gae ate 309 



wo 02/083898 PCT/IBO 1/009 14 

7 

Glu Ser lie Met Arg Arg Arg Gly Leu Thr Ser Pro Cys Lys Asp lie 

30 35 40 

aac aca ttt att cat ggc aac aag cgc asa tea agg cca tct gtg aaa 357 
Asn Thr Phe lie His Gly Asn Lys Arg Xaa Ser Arg Pro Ser Val Lys 

45 50 55 

aca aga atg gaa acc etc aca gag aaa acc taagaataag caagtcttct 407 
Thr Arg Met Glu Thr Leu Thr Glu Lys Thr 

60 65 
ttccaggtca ccacttgcaa gctacatgga ggttccccct ggcctccatg ccagtaccga 4 67 
gccacagcgg ggttcagaaa cgttgttgtt gcttgtgaaa atggcttacc tgtccacttg 527 
gatcagtcaa ttttccgtcg tccgtaacca gcgggcccct ggtcaagtgc tggctctgct 587 
gtccttgcct tccatttccc ctctgcaccc agaacagtgg tggcaacatt cattgccaag 647 
ggcccaaaga aagagctacc tggacctttt gttttctgtt tgacaacatg tttaataaat 707 
aaaaatgtct tgatatcagc aaaaaaaaaa aaaaaa 743 

<210> 8 

<211> 502 

<212> D.NA 

<213> Homo sapiens 



<220> 
<221> CDS 
<222> 39. .371 



<220> 

<221> sig_peptide 

<222> 39. .86 

<223> Von Heijne matrix 

score 11.56948973398 
seq ILLSVALLAFSSA/QD 



<220> 

<221> polyA_site 
<222> 487. .502 



<220> 

<221> polyA_signal 
<222> 467. .472 



<400> 8 

agcataaagt tgggagtgac accagagcct tctgcaag atg ctt ctg att ctg ctg 56 

Met Leu Leu He Leu Leu 
-15 



tea 


gtg 


gee 


ctg 


ctg 


gcc 


ttc 


age 


tea 


get 


cag 


gac 


tta 


gat 


gaa 


gat 


104 


Ser 


Val 


Ala 


Leu 


Leu 


Ala 


Phe 


Ser 


Ser 


Ala 


Gin 


Asp 


Leu 


Asp 


Glu 


Asp 




-10 










-5 










1 








5 






gtc 


age 


caa 


gaa 


gac 


gtt 


ccc 


ttg 


gta 


ata 


tea 


gat 


gga 


gga 


gac 


tct 


152 


Val 


Ser 


Gin 


Glu 
10 


Asp 


Val 


Pro 


Leu 


Val 
15 


He 


Ser 


Asp 


Gly 


Gly 
20 


Asp 


Ser 




gag 


cag 


ttc 


ata 


gat 


gag 


gag 


cgt 


cag 


gga 


cca 


cct 


ttg 


gga 


gga 


cag 


200 


Glu 


Gin 


Phe 
25 


He 


Asp 


Glu 


Glu 


Arg 
30 


Gin 


Gly 


Pro 


Pro 


Leu 
35 


Gly 


Gly 


Gin 




caa 


tct 


caa 


ccc 


tct 


get 


ggt 


gat 


ggg 


aac 


cag 


aat 


gat 


ggc 


cct 


cag 


248 


Gin 


Ser 

40 


Gin 


Pro 


Ser 


Ala 


Gly 

45 


Asp 


Gly 


Asn 


Gin 


Asn 

50 


Asp 


Gly 


Pro 


Gin 




cag 


gga 


cca 


ccc 


caa 


caa 


gga 


ggc 


cag 


cag 


caa 


caa 


ggt 


cca 


cca 


cct 


296 


Gin Gly 


Pro 


Pro 


Gin 


Gin 


Gly 


Gly 


Gin 


Gin 


Gin 


Gin 


Gly 


Pro 


Pro 


Pro 




55 










60 










65 










70 




cct 


cag 


gga 


aag 


cca 


caa 


gga 


cca 


cct 


ccc 


caa 


ggg 


ggc 


cgc 


cca 


caa 


344 


Pro 


Gin 


Gly 


Lys 


Pro 
75 


Gin 


Gly 


Pro 


Pro 


Pro 
80 


Gin 


Gly 


Gly 


Arg 


Pro 
85 


Gin 




gga 


cct 


cca 


cag 


ggg 


cag 


tct 


cct 


cag 


taatctagga ttcaatgaca 




391 


Gly 


Pro 


Pro 


Gin 


Gly 


Gin 


Ser 


Pro 


Gin 



















wo 02/083898 



PCT/IBOl/00914 



8 

90 95 
ggaagtgaat aagaagatga cagtgtttca aatgccttga aacataatgt gatcatgctc 451 
taacttcaat ataccaataa aataatcagc ttgctaaaaa aaaaaaaaaa a 502 



<210> 9 

<211> 401 

<212> DNA 

<213> Homo sapiens 



<220> 
<221> CDS 
<222> 36. .353 



<220> 

<221> sig_peptide 

<222> 36. .83 

<223> Von Heijne matrix 

score 11.56948973398 
seq ILLSVALLAFSSA/QD 



<220> 

<221> polyA_site 
<222> 386. ,401 



<400> 9 

ataaagttgg gagtgacacc agagccttct gcaag atg ctt ctg att ctg ctg 53 

Met Leu Leu lie Leu Leu 
-15 

tea gtg gcc ctg ctg gcc ttc age tea get cag gac tta gat gaa gat 101 

Ser Val Ala Leu Leu Ala Phe Ser Ser Ala Gin Asp Leu Asp Glu Asp 

-10 -5 15 

gtc age caa gaa gac gtt ccc ttg gta ata tea gat gga gga gac tct 14 9 

Val Ser Gin Glu Asp Val Pro Leu Val lie Ser Asp Gly Gly Asp Ser 

10 15 20 

gag cag ttc ata gat gag gag cgt cag gga cca cct ttg gga gga cag 197 
Glu Gin Phe He Asp Glu Glu Arg Gin Gly Pro Pro Leu Gly Gly Gin 

25 30 35 

caa tct caa ccc tct get ggt gat ggg aac cag gat gat ggc cct cag 245 
Gin Ser Gin Pro Ser Ala Gly Asp Gly Asn Gin Asp Asp Gly Pro Gin 

40 45 50 

cag gga cca ccc caa caa gga ggc cag cag caa caa ggt cca cca cct 293 
Gin Gly Pro Pro Gin Gin Gly Gly Gin Gin Gin Gin Gly Pro Pro Pro 
55 60 65 70 

cct cag gga aag cca caa gga cca ccc caa cag gga ggc cag tct tgt 341 
Pro Gin Gly Lys Pro Gin Gly Pro Pro Gin Gin Gly Gly Gin Ser Cys 

75 80 85 

tgc tgt gac aag taacagcctt ttttttaaat tgttttttat acaaaaaaaa 393 
Cys Cys Asp Lys 

90 

aaaaaaaa 401 

<210> 10 

<211> 961 

<212> DNA 

<213> Homo sapiens 



<220> 
<221> CDS 
<222> 45. .467 



<220> 

<221> sig_peptide 
<222> 45. .116 



wo 02/083898 



PCT/IBOl/00914 



<223> Von Heijne matrix 

score 10.983328612314 
seq LAGLCCLVPVSLA/ED 



<220> 

<221> polyA__site 
<222> 946. .961 

<220> 

<221> polyA_signal 
<222> 927. .932 



<400> 10 

ctcagcttca ggcaccacca ctgacctggg acagtgaatc gaca atg ccg tct tct 56 

Met Pro Ser Ser 
tgc ctg gtc cct 104 
Cys Leu Val Pro 
-5 



ate 


teg 




yy- 




etc 








ggc 


ctg 


tgc 


Val 


Ser 




Glv 


lie 


Leu 


Leu 


Leu 


Ala 


Gly 


Lg u 




-20 










-15 










-10 




qtc 


tec 


Ctd 


act 


craa 


oat 


ccc 


cag 


aaa 




get 


gcc 


Val 


Ser 


Leu 


Ala 


Glu 
1 


Asp 


Pro 


Gin 


Gly 

5 


Asp 


Ala 


Ala 


aca 


tec 


cac 


cat 


gat 


cag 


gat 


cac 


cea 


acc 


ttc 


aac 


Thr 


Ser 


His 


His 


Asp 


Gin 


Asp 


His 


Pro 


Thr 


Phe 


Asn 






15 










20 










aac 


ctg 


get 


gag 


ttc 


gcc 


ttc 


age 


eta 


tac 


cgc 


cag 


Asn 


Leu 


Ala 


Glu 


Phe 


Ala 


Phe 


Ser 


Leu 


Tyr 


Arg 


Gin 




30 










35 










40 


tec 


aac 


age 


acc 


aat 


ate 


ttc 


ttc 


tee 


cea 


gtg 


age 


Ser 


Asn 


Ser 


Thr 


Asn 


lie 


Phe 


Phe 


Ser 


Pro 


Val 


Ser 


45 










50 










55 




ttt 


gca 


atg 


etc 


tec 


ctg 


ggg 


acc 


aag 


get 


gae 


act 


Phe 


Ala 


Met 


Leu 


Ser 


Leu 


Gly 


Thr 


Lys 


Ala 


Asp 


Thr 










65 










70 






ctg 


gag 


ggc 


ctg 


aat 


ttc 


aac 


etc 


acg 


gag 


att 


ccg 


Leu 


Glu 


Gly 


Leu 


Asn 


Phe 


Asn 


Leu 


Thr 


Glu 


He 


Pro 








80 










85 








cat 


gaa 


ggc 


ttc 


cag 


gaa 


etc 


etc 


cgt 


acc 


etc 


aac 


His 


Glu 


Gly 


Phe 


Gin 


Glu 


Leu 


Leu 


Arg 


Thr 


Leu 


Asn 






95 










100 










cag 


etc 


cag 


ctg 


acc 


acc 


ggc 


aag 


aat 


taatgctttc 1 


Gin 


Leu 


Gin 


Leu 


Thr 


Thr 


Gly 


Lys 


Asn 









cag 


aag 


aca 


gat 


152 


Gin 


Lys 


Thr 


Asp 






10 








aag 


ate 


ace 


ccc 


200 


Lys 


He 


Thr 


Pro 




25 










ctg 


gca 


cac 


cag 


248 


Leu 


Ala 


His 


Gin 




ate 


get 


aca 


gcc 


296 


He 


Ala 


Thr 


Ala 










60 




cac 


gat 


gaa 


ate 


344 


His 


Asp 


Glu 


He 








75 






gag 


get 


cag 


ate 


392 


Glu 


Ala 


Gin 


He 






90 








cag 


cea 


gae 


age 


440 


Gin 


Pro 


Asp 


Ser 




105 











actaagt 487 

Lys Asn 
110 115 

taetttgagt attaaeaagt agcttatact gaggtgtget teaagggage attatagggc 547 

agaaaacaaa ttaaaatgaa atgttgttgt tatagceatg eattgaccat taaatgeatt 607 

ttaatatetc ggcaaaatga gatttgaaac tcattagttt aactcttaaa acatagtgtt 667 

gttaaaeagt attttttggg ggcgggtgga tagcttttts ttctatcttg tacagaaaac 727 

agtaatggaa tcactcatgt tcagaacctt gtaaatatgt ctgtaatatt aatgcttccc 787 

tttattaaca caaatataca ttcaaaaagg gagattttat ttagacacac attttctttt 847 

caecctaatc tatttagatt catacttatt gagacgggaa aaactcgetg taaaataatg 907 

ccaacctaga taatgctata ataaattatt ttgtacacaa aaaaaaaaaa aaaa 961 



<210> 11 

<211> 643 

<212> DNA 

<213> Homo sapiens 

<220> 
<221> CDS 
<222> 91. .462 



<220> 

<221> sig_peptide 
<222> 91. .180 



wo 02/083898 

<223> Von Heijne matrix 

score 9.99993961154359 
seq LLFGFTLVSGTGA/EK 

<220> 

<221> polyA_site 
<222> 628.. 643 

<220> 

<221> polyA_signal 
<222> 607. .612 

<400> 11 

acccctaccc cacgccccct cccgcgcgcg cggttaaatc cccgcacctg agcatcggct 60 
cacacctgca ccccgcccgg gcatagcacc atg cct get tgt cgc eta gge ccg 114 

Met Pro Ala Cys Arg Leu Gly Pro 
-30 -25 



eta 


gcc 


gcc 


gcc 


etc 


etc 


etc 


age 


ctg 


ctg 


ctg 


ttc 


ggc 


ttc 


acc 


eta 


162 


Leu 


Ala 


Ala 

-20 


Ala 


Leu 


Leu 


Leu 


Ser 

-15 


Leu 


Leu 


Leu 


Phe 


Gly 
-10 


Phe 


Thr 


Leu 




gtc 


tea 


ggc 


aca 


gga 


gea 


gag 


aag 


act 


gge 


gtg 


tgc 


ccc 


gag 


etc 


cag 


210 


Val 


Ser 
-5 


Gly 


Thr 


Gly 


Ala 


Glu 
1 


Lys 


Thr 


Gly 


Val 
5 


Cys 


Pro 


Glu 


Leu 


Gin 
10 




get 


gac 


cag 


aac 


tgc 


acg 


caa 


gag 


tgc 


gtc 


teg 


gac 


age 


gaa 


tgc 


gcc 


258 


Ala 


Asp 


Gin 


Asn 


Cys 
15 


Thr 


Gin 


Glu 


Cys 


Val 
20 


Ser 


Asp 


Ser 


Glu 


Cys 
25 


Ala 




gac 


aac 


etc 


aag 


tgc 


tgc 


age 


geg 


ggc 


tgt 


gcc 


acc 


ttc 


tgc 


tct 


ctg 


306 


Asp 


Asn 


Leu 


Lys 
30 


Cys 


Cys 


Ser 


Ala 


Gly 

35 


Cys 


Ala 


Thr 


Phe 


Cys 

40 


Ser 


Leu 




ccc 


aat 


gat 


aag 


gag 


ggt 


tec 


tgc 


ccc 


cag 


gtg 


aac 


att 


aac 


ttt 


ccc 


354 


Pro 


Asn 


Asp 
45 


Lys 


Glu 


Gly 


Ser 


Cys 
50 


Pro 


Gin 


Val 


Asn 


He 
55 


Asn 


Phe 


Pro 




cag 


etc 


ggc 


etc 


tgt 


egg 


gac 


cag 


tgc 


cag 


gtg 


gac 


age 


cag 


tgt 


cct 


402 


Gin 


Leu 
60 


Gly 


Leu 


Cys 


Arg 


Asp 
65 


Gin 


Cys 


Gin 


Val 


Asp 
70 


Ser 


Gin 


Cys 


Pro 




ggc 


cag 


atg 


aaa 


tgc 


tgc 


cgc 


aat 


ggc 


tgt 


ggg 


aag 


gtg 


tec 


tgt 


gtc 


450 


Gly Gin 


Met 


Lys 


Cys 


Cys 


Arg 


Asn 


Gly 


Cys 


Gly 


Lys 


Val 


Ser 


Cys 


Val 




75 










80 










85 










90 




act 


ccc 


aat 


ttc 


tgagctecag ecaccaecag getgagcagt gaggagagaa 




502 


Thr 


Pro 


Asn 


Phe 





























agtttctgcc tggccctgea tctggttcca gcccacctgc cctccccttt ttcgggactc 562 
tgtattccct cttgggctga ccacagctte tccctttcec aaceaataaa gtaaccactt 622 
tcagcaaaaa aaaaaaaaaa a 643 



<210> 12 

<211> 577 

<212> DNA 

<213> Homo sapiens 

<220> 
<221> CDS 
<222> 28. .399 

<220> 

<221> sig_peptide 

<222> 28. .117 

<223> Von Heijne matrix 

score 9.99993961154359 
seq LLFGFTLVSGTGA/EK 



PCT/IBOl/00914 

to 



<220> 

<221> pplyA_site 
<222> 562. .577 



wo 02/083898 



PCT/IBOl/00914 



11 

<220> 

<221> polyA__signal 
<222> 540.. 545 

<400> 12 

acctgcaccc cgcccgggca tagcacc atg cct get tgt cgc eta ggc eeg eta 54 

Met Pro Ala Cys Arg Leu Gly Pro Leu 
-30 -25 



gee 


gee 


gcc 


cte 


cte 


etc 


age 


ctg 


ctg 


ctg 


tte 


ggc 


ttc 


ace 


eta 


gtc 


102 


Ala 


Ala 
-20 


Ala 


Leu 


Leu 


Leu 


Ser 
-15 


Leu 


Leu 


Leu 


Phe 


Gly 
-10 


Phe 


Thr 


Leu 


Val 




tea 


ggc 


aea 


gga 


gea 


gag 


aag 


act 


ggc 


gtg 


tgc 


cce 


gag 


etc 


cag 


get 


150 


Ser 


Gly 


Thr 


Gly 


Ala 


Glu 


Lys 


Thr 


Gly 


Val 


Cys 


Pro 


Glu 


Leu 


Gin 


Ala 




-5 










1 








5 










10 






gac 


cag 


aac 


tgc 


aeg 


caa 


gag 


tgc 


gtc 


teg 


gac 


age 


gaa 


tgc 


gee 


gac 


198 


Asp Gin 


Asn 


Cys 


Thr 


Gin 


Glu 


Cys 


Val 


Ser 


Asp 


Ser 


Glu 


Cys 


Ala 


Asp 










15 










20 










25 








aac 


ate 


aag 


tgc 


tgc 


age 


gcg 


ggc 


tgt 


gee 


ace 


ttc 


tgc 


tet 


ctg 


cce 


246 


Asn 


He 


Lys 
30 


Cys 


Cys 


Ser 


Ala 


Gly 
35 


Cys 


Ala 


Thr 


Phe 


Cys 
40 


Ser 


Leu 


Pro 




aat 


gat 


aag 


gag 


ggt 


tec 


tgc 


cce 


cag 


gtg 


aac 


att 


aac 


ttt 


cec 


cag 


294 


Asn Asp 


Lys 


Glu 


Gly 


Ser 


Cys 


Pro 


Gin 


Val 


Asn 


He 


Asn 


Phe 


Pro 


Gin 






45 










50 










55 












cte 


ggc 


etc 


tgt 


egg 


gac 


cag 


tgc 


cag 


gtg 


gac 


age 


cag 


tgt 


cct 


ggc 


342 


Leu 


Gly 


Leu 


Cys 


Arg 


Asp 


Gin 


Cys 


Gin 


Val 


Asp 


Ser 


Gin 


Cys 


Pro 


Gly 




60 










65 










70 










75 




cag 


atg 


aaa 


tgc 


tgc 


cgc 


aat 


ggc 


tgt 


ggg 


aag 


gtg 


tec 


tgt 


gtc 


act 


390 


Gin 


Met 


Lys 


Cys 


Cys 


Arg 


Asn 


Gly 


Cys 


Gly 


Lys 


Val 


Ser 


Cys 


Val 


Thr 





80 85 90 



cce aat ttc tgagctccag ccacaggctg ageagtgagg agagaaagtt 439 
Pro Asn Phe 

tctgectgge cetgcatetg gttecagcec acetgecctc cccttttteg ggaetctgta 499 

ttccctcttg ggctgaccac agcttetccc ttteccaacc aataaagtaa ceactttcag 559 

etaaaaaaaa aaaaaaaa 577 

<210> 13 
<211> 785 
<212> DNA 

<213> Homo sapiens 

<220> 
<221> CDS 
<222> 10. .216 

<220> 

<221> sig peptide 

<222> 10.760 

<223> Von Heijne matrix 

score 9.5148977432302 

FVFLLFVISLAAA/AH 

<400> 13 

aacccactt atg ttt cct ggc ttt gtt ttt ttg cte ttt gta ata agt ctt 51 
Met Phe Pro Gly Phe Val Phe Leu Leu Phe Val lie Ser Leu 
-15 -10 -5 

get get get get cat ctt tgg gtc ctg gee gcc ttt atg ggc cgt ate 99 
Ala Ala Ala Ala His Leu Trp Val Leu Ala Ala Phe Met Gly Arg He 

15 10 
ace gtg aag gtc tgc age ttc act cct gag gcc age aag ace gtg age 147 
Thr Val Lys Val Cys Ser Phe Thr Pro Glu Ala Ser Lys Thr Val Ser 

15 20 25 

eca cea gaa gga gcg aac aac teg aga cgc act get ttt aag age tgt 195 



wo 02/083898 PCT/IBOl/00914 

12 

Pro Pro Glu Gly Ala Asn Asn Ser Arg Arg Thr Ala Phe Lys Ser Cys 
30 35 40 45 

aac act cac cac aaa ggt ctg tagcatcact cctgaagtca gcgggaccac 24 6 
Asn Thr His His Lys Gly Leu 
50 

gaacctacca gaaagaagaa actccataca gatctgaaaa tctgaaggaa caaactccgg 306 

acacaccatc tttaagaact gtaacactca ctgcgagggt ccacggcttc atttttgaag 366 

tcagccagac caagaaccca ccaattccgc acacacttct gctgtttcgt tccattctca 426 

aggaaaaact gtgatgggtc catttttaat cacatgcaaa ttcatgggca aatctttgct 486 

atggttatgt actttgattt agtagagttg tatcacatgt tcactcatgt gcccacaata 546 

gcagggtata ccaataggct gcttcaaaac aatcatgaaa agtaggaggt ggtgtttcca 606 

aaggtagtgg tcttgccaga aaaaaatgga aatactgtga agataaaaat aaacatctat 666 

taacacaaaa atagaaaaat aatataggta cacatacaca tacttttcta catatttttt 726 

gcagtcatat cagaggaaca atacagagga acaaaagtgt aaatcaaaaa aaaaaaaaa 785 



<210> 14 

<211> 1154 

<212> DNA 

<213> Homo sapiens 



<220> 
<221> CDS 
<222> 88. ,318 



<220> 

<221> sig_peptide 

<222> 88. .186 

<223> Von Heijne matrix 

score 9.33181480765342 
seq MVLLSFALTLCSA/FW 



<220> 

<221> polyA_site 
<222> 1139, .1154 



<400> 14 

ctgtgggtgc ccaggaaggg actacacctc ttcgcagtgt tttatacctt tggtaatatc 60 
gcatcaattg ggagtaccat cttcctc atg gga cca gtg aaa cag ctg aag cga 114 

Met Gly Pro Val Lys Gin Leu Lys Arg 
-30 -25 



atg 


ttt 


gag 


cot 


act 


cgt 


ttg 


att 


gca 


act 


ate 


atg 


gtg 


etg 


ttg 


agt 


162 


Met 


Phe 


Glu 


Pro 


Thr 


Arg 


Leu 


lie 


Ala 


Thr 


lie 


Met 


Val 


Leu 


Leu 


Ser 












-20 










-15 










-10 






ttt 


gca 


ctt 


acc 


ctg 


tgt 


tct 


gcc 


ttt 


tgg 


tgg 


eat 


aac 


awg 


gga 


ctt 


210 


Phe 


Ala 


Leu 


Thr 
-5 


Leu 


Cys 


Ser 


Ala 


Phe 
1 


Trp 


Trp 


His 


Asn 
5 


Xaa 


Gly 


Leu 




gca 


ctt 


ate 


ttc 


tgc 


att 


ttg 


cag 


tct 


ttg 


gca 


ttg 


acg 


tgg 


tac 


age 


258 


Ala 


Leu 


lie 


Phe 


Cys 


lie 


Leu 


Gin 


Ser 


Leu 


Ala 


Leu 


Thr 


Trp 


Tyr 


Ser 






10 










15 










20 












ctt 


tec 


ttc 


ata 


cca 


ttt 


gca 


agg 


gat 


get 


gtg 


aag 


aag 


tgt 


ttt 


gee 


306 


Leu 


Ser 


Phe 


lie 


Pro 


Phe 


Ala 


Arg 


Asp 


Ala 


Val 


Lys 


Lys 


Cys 


Phe 


Ala 





25 30 35 40 



gtg tgt ctt gca taattcatgg ceagttttat gaagctttgg aaggcactat 358 
Val Cys Leu Ala 

ggacagaagc tggtggacag ttttgtaaet atettegaaa ectetgtctt acagacatgt 418 

gccttttate ttgcagcaat gtgttgcttg tgattcgaac atttgagggt tacttttgga 478 

agcaacaata cattetcgaa cctgaatgtc agtagcacag gatgagaagt gggttetgta 538 

tcttgtggag tggaatctte eteatgtaec tgtttcctct etggatgttg tcceactgaa 598 

ttcceatgaa tacaaaccta ttcagcaaca gcacataagc cttgggtgca agtgattcec 658 

aggtggeaaa aggeageece atcagagatc acgggagcaa cagtaaggga cagagttttg 718 

gggtccaett gtecetcagc atggaageca tcaeegtggt cetgcataga gtgagtctae 778 

ttctactctg gcatetgaga acaagtgact ctgetttaga eaageccctg gagageetgg 838 

ccatggagtg aggtagaaaa gaageaettt ttggtggtat atgctgtttc tgaatttgaa 898 



wo 02/083898 PCT/IBO 1/009 14 

13 

tctaagcatt cttggtgttc tgattctgcg cctgactgtg gaatgtattt tctgacatta 958 

ggtgcagact ctttttctct tgggttttat ttcccatccc tccctccctc tccctgcata 1018 

tctttcttta tgacctcctg gcactagatg gggaaagaac agagcagcag cagcagaaaa 1078 

tgctgctgtt ttttttggac gagagcccat aggagttgaa aaatcctgct gctctcgtcc 1138 

aaaaaaaaaa aaaaaa 1154 

<210> 15 
<211> 766 
<212> DNA 

<213> Homo sapiens 



<220> 
<221> CDS 
<222> 84.. 521 



<220> 

<221> sig_peptide 

<222> 84.. 161 

<223> Von Heijne matrix 

score 8.99656184291183 
seq GLLWAFCAPGARA/EE 



<220> 

<221> polyA_site 
<222> 751.. 766 



<220> 

<221> polyA signal 
<222>.728. .733 



<400> 15 

gaggaagagc gttttgggga cgggggctgg tgaggctcac gttggagggc ttcgcgtctg 60 
cttcggagac cgtaaggata ttg atg acc atg aga tec ctg etc aga ace ecc 113 

Met Thr Met Arg Ser Leu Leu Arg Thr Pro 
-25 -20 



ttc 


ctg 


tgt 


ggc 


ctg 


etc 


tgg 


gcc 


ttt 


tgt 


gcc 


cca 


ggc 


gcc 


agg 


get 


161 


Phe 


Leu 


Cys 


Gly 


Leu 


'Leu 


Trp 


Ala 


Phe 


Cys 


Ala 


Pro 


Gly 


Ala 


Arg 


Ala 






-15 










-10 










-5 












gag 


gag 


ect 


gca 


gee 


age 


ttc 


tee 


caa 


ecc 


ggc 


age 


atg 


ggc 


ctg 


gat 


209 


Glu 


Glu 


Pro 


Ala 


Ala 


Ser 


Phe 


Ser 


Gin 


Pro 


Gly 


Ser 


Met 


Gly 


Leu 


Asp 




1 








5 










10 










15 






aag 


. aac 


aca 


gtg 


cac 


gae 


caa 


gag 


cat 


ate 


atg 


gag 


eat 


eta 


gaa 


ggt 


257 


Lys Asn 


Thr 


Val 


His 


Asp 


Gin 


Glu 


His 


He 


Met 


Glu 


His 


Leu 


Glu 


Gly 










20 










25 










30. 








gtc 


ate 


aac 


aaa 


cca 


gag 


gcg 


gag 


atg 


teg 


cca 


caa 


gaa 


ttg 


eag 


etc 


305 


Val 


lie 


Asn 


Lys 


Pro 


Glu 


Ala 


Glu 


Met 


Ser 


Pro 


Gin 


Glu 


Leu 


Gin 


Leu 








35 










40 










45 










cat 


tac 


ttc 


aaa 


atg 


eat 


gat 


tat 


gat 


ggc 


aat 


aat 


ttg 


ctt 


gat 


ggc 


353 


His 


Tyr 


Phe 


Lys 


Met 


His 


Asp 


Tyr 


Asp 


Gly 


Asn 


Asn 


Leu 


Leu 


Asp 


Gly 






50 










55 










60 












tta 


gaa 


etc 


tec 


aca 


gee 


ate 


act 


cat 


gtc 


eat 


aag 


gag 


gaa 


ggg 


agt 


401 


Leu 


Glu 


Leu 


Ser 


Thr 


Ala 


He 


Thr 


His 


Val 


His 


Lys 


Glu 


Glu 


Gly 


Ser 




65 










70 










75 










80 




gaa 


cag 


gca 


cca 


eta 


atg 


agt 


gaa 


gat 


gaa 


ctg 


att 


aac 


ata 


ata 


gat 


449 


Glu 


Gin 


Ala 


Pro 


Leu 


Met 


Ser 


Glu 


Asp Glu 


Leu 


He 


Asn 


He 


He 


Asp 












85 










90 










95 






ggt 


gtt 


ttg 


aga 


gat 


gat 


gae 


aag 


aac 


aat 


gat 


gga 


tac 


att 


gae 


tat 


497 


Gly Val 


Leu 


Arg 


Asp 


Asp 


Asp 


Lys 


Asn 


Asn 


Asp 


Gly 


Tyr 


He 


Asp 


Tyr 










100 










105 










110 








get 


gaa 


ttt 


gca 


aaa 


tea 


ctg 


eag 


tagatgttat ttggecatct ectggttata 


551 


Ala 


Glu 


Phe 


Ala 


Lys 


Ser 


Leu 


Gin 





















115 120 



tacaaatgtg acccgtgata atgtgattga acaetttagt aatgegaaat aactcatttc 611 



wo 02/083898 PCT/IBOl/00914 
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caactactgc tgcagcattt tggtaaaaac ctgtagcgat tcgttacact ggggtgagaa 671 
gagataagag aaatgaaaga gaagagaaat gggacatcta atagtcccta agtgstatta 731 
aataccttat tggacaagca aaaaaaaaaa aaaaa 766 

<210> 16 

<211> 808 

<212> DNA 

<213> Homo sapiens 

<220> 
<221> CDS 
<222> 155.. 340 



<220> 

<221> sig_peptide 

<222> 155.. 292 

<223> Von Heijne matrix 

score 8.64329745298384 
AVLLLILFAIVFG/LL 



<220> 

<221> polyA_site 
<222> 754., 808 



<220> 

<221> polyA_signal 
<222> 728., 733 



<400> 16 

cttttcctct caacagttgc ttctttgagt cagggtgcag ctctggtcac ctggcggcct 60 
cttcagctca gccctccaca aagtgtgagc ctgaaggacc accctgaatt gcccttgtag 120 
gacccagaac agctaccagc agaatcagat tctc atg gac caa ctg gta ttc aaa 175 

Met Asp Gin Leu Val Phe Lys 
-45 -40 



gag 


aca 


ate 


tgg 


aat 


gat 


gcg 


ttc 


tgg 


cag 


aac 


ccc tgg gac cag ggg 


223 


Glu 


Thr 


lie 


Trp 


Asn 


Asp 


Ala 


Phe 


Trp 


Gin 


Asn 


Pro Trp Asp Gin Gly 












-35 










-30 




-25 




ggc 


ctg 


gca 


gtg 


att 


ate 


tta 


ttc 


ate 


ace 


get 


gte ctg ett etc ate 


271 


Gly 


Leu 


Ala 


Val 


He 


He 


Leu 


Phe 


He 


Thr 


Ala 


Val Leu Leu Leu He 










-20 










-15 






-10 




tta 


ttt 


gcc 


ate 


gtg 


ttt 


ggt 


tta 


etc 


act 


tec 


aca gaa aac act cag 


319 


Leu 


Phe 


Ala 

-5 


He 


Val 


Phe 


Gly 


Leu 


Leu 


Thr 


Ser 


Thr Glu Asn Thr Gin 

c 




tgt 


gaa 


gcg 


ggt 


gaa 


gag 


gag 


tgacctgact tgctggggac tgagatggca 


370 


Cys 


Glu 


Ala Gly 


Glu 


Glu 


Glu 














10 










15 

















gcaggggagg cgagctgacc tgcccccatt ecagtggtgg gccccttcgc ggttccctct 430 

ggctcagggg ccaagcectg gtgtcttcct ttcecaccag gaaaaagtct agtaaaatac 490 

tgtatctggc ttagggttgg tcagaetagt aagatgggga ggctggtctg agaccaattc 550 

tggctccttg accctattgt ttttagggtt ceccgaecag aaccctaaaa geacatggag 610 

aggatggcte cactgcetca ggtggaagga getatggeta aeaaggttct etaacaggct 670 

cacaggceca geeageaatt teacaaatec ttgacagaga aagaeacaac caaatgaaat 730 

aaaaattect tttcaaatct getaaaaaaa aaaaaaaaaa aaaaaaaaaa aaaramaaaa 790 

aaaaaaaaaa aaaaaaaa 808 



<210> 17 

<211> 772 

<212> DNA 

<213> Homo sapiens 

<220> 
<221> CDS 
<222> 155. .340 



1 



wo 02/083898 PCT/IBO 1/00914 
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<220> 

<221> sig_peptide 

<222> 155, .292 

<223> Von Heijne matrix 

score 8.64329745298384 

AVLLLILFAIVFG/LL 

<220> 

<221> polyA__site 
<222> 751. .772 

<220> 

<221> polyA^signal 
<222> 728.. 733 

<400> 17 

cttttcctct caacagttgc ttctttgagt cagggtgcag ctctggtcac ctggcggcct 60 
cttcagctca gccctccaca aagtgtgagc ctgaaggacc accctgaatt gcccttgtag 120 
gacccagaac agcaaccagc agaatcagat tctc atg gac caa ttg gta ttc aaa 175 

Met Asp Gin Leu Val Phe Lys 
-45 -40 



gag 


aca 


ate tgg 


aat 


gat 


gcg 


ttc 


tgg 


eag 


aac 


ecc tgg gac cag ggg 


223 


Glu 


Thr 


lie Trp 


Asn 


Asp 


Ala 


Phe 


Trp 


Gin 


Asn 


Pro Trp Asp Gin Gly 










-35 










-30 




-25 




ggc 


ctg 


gca gtg 


att 


ate 


tta 


ttc 


ate 


ace 


get 


gtc ctg ett etc ate 


271 


Gly Leu 


Ala Val 


He 


He 


Leu 


Phe 


He 


Thr 


Ala 


Val Leu Leu Leu He 








-20 










-15 






-10 




tta 


ttt 


gcc ate 


gtg 


ttt 


ggt 


tta 


etc 


act 


tec 


aca gaa aac act cag 


319 


Leu 


Phe 


Ala He 
-5 


Val 


Phe 


Gly 


Leu 


Leu 


Thr 


Ser 


Thr Glu Asn Thr Gin 




tgt 


gaa 


gcg ggt 


gaa 


gag 


gag 


tgacctgact tgctggggac tgagatggca 


370 


Cys 


Glu 


Ala Gly 


Glu 


Glu 


Glu 














10 








15 

















gcaggggagg cgagctgacc tgeccccatt ccagtggtgg gecccttcgc ggttccetet 430 

ggcteagggg ceaagcectg gtgtcttcct ttcecaccag gaaaaagtet agtaaaatae 490 

tgtatctggc ttagggttgg tcagactagt aagatgggga ggctggtctg agaccaattc 550 

tggctccttg accctattgt ttttagggtt ccccgaccag aaccctaaaa gcacatggag 610 

aggatggcte cactgcctca ggtggaagga gctatggcta acaaggttet etaacaggct 670 

cacaggecca gecagcaatt tcacaaatcc ttgacagaga aagaeacaac caaatgaaat 730 

aaaaattcct tttcaaatct aaaaaaaaaa aaaaatgega aa 772 

<210> 18 
<211> 546 
<212> DNA 

<213> Homo sapiens 

<220> 
<221> CDS 
<222> 13.. 498 

<220> 

<221> sig_peptide 

<222> 13. .153 

<223> Von Heijne matrix 

score 4.65474235719409 

seq LALSSLRSLLLFA/GM 

<220> 

<221> polyA_site 
<222> 533.. 546 



<220> 



wo 02/083898 
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16 

<221> polyA_signal 
<222> 514. .519 



<400> 18 

aagcgctgac gc atg cgc ata get aac cgc acc egg ttc age teg cct ttc 51 
Met Arg lie Ala Asn Arg Thr Arg Phe Ser Ser Pro Phe 
-45 -40 -35 

ttg gcc aga ggc gcc ggt tgg act cac ggg egg ggc atg atg gtg gtg 99 
Leu Ala Arg Gly Ala Gly Trp Thr His Gly Arg Gly Met Met Val Val 

-30 -25 -20 

ggt acg ggc aee teg etg gcg etc tee tee etc egg tee etg etg etc 147 
Gly Thr Gly Thr Ser Leu Ala Leu Ser Ser Leu Arg Ser Leu Leu Leu 

-15 -10 -5 

ttt get ggg atg eag atg tac age egt cag etg gee tec acc gag tgg 195 
Phe Ala Gly Met Gin Met Tyr Ser Arg Gin Leu Ala Ser Thr Glu Trp 

15 10 
etc acc ate cag ggc ggc etg ctt ggt teg ggt etc ttc gtg ttc teg 243 
Leu Thr lie Gin Gly Gly Leu Leu Gly Ser Gly Leu Phe Val Phe Ser 
15 20 25 30 

291 



339 



etc 


act 


gee 


ttc 


aat 


aat 


etg 


gag 


aat 


ctt 


gte 


ttt 


ggc 


aaa 


gga 


ttc 


Leu 


Thr 


Ala 


Phe 


Asn 


Asn 


Leu 


Glu 


Asn 


Leu 


Val 


Phe 


Gly 


Lys 


Gly 


Phe 










35 










40 










45 




eaa 


gea 


aag 


ate 


ttc 


cet 


gag 


att 


etc 


etg 


tge 


etc 


etg 


ttg 


get 


etc 


Gin 


Ala 


Lys 


He 


Phe 


Pro 


Glu 


He 


Leu 


Leu 


Cys 


Leu 


Leu 


Leu 


Ala 


Leu 








50 










55 










60 






ttt 


gea 


tet 


ggc 


etc 


ate 


cac 


ega 


gte 


tgt 


gte 


ace 


acy 


tge 


ttc 


ate 


Phe 


Ala 


Ser Gly 


Leu 


He 


His 


Arg 


Val 


Cys 


Val 


Thr 


Thr 


Cys 


Phe 


He 






65 










70 










75 








ttc 


tec 


atg 


gtt 


ggt 


etg 


tac 


tac 


ate 


aac 


aag 


ate 


tec 


tec 


acc 


etg 


Phe 


Ser 


Met 


Val 


Gly 


Leu 


Tyr 


Tyr 


He 


Asn 


Lys 


He 


Ser 


Ser 


Thr 


Leu 




80 










85 










90 










tac 


eag 


gea 


gea 


get 


cea 


gte 


etc 


aea 


cea 


gee 


aag 


gte 


aea 


ggc 


aag 


Tyr 


Gin 


Ala 


Ala 


Ala 


Pro 


Val 


Leu 


Thr 


Pro 


Ala 


Lys 


Val 


Thr 


Gly 


Lys 


95 










100 










105 










110 



387 



435 



483 



age aag aag aga aac tgaccctgaa tgttcaataa agttgattet ttgcaaaaaa 538 
Ser Lys Lys Arg Asn 
115 

aaaaaaaa 546 



<210> 19 

<211> 767 

<212> DNA 

<213> Homo sapiens 

<220> 
<221> CDS 
<222> 132.. 401 



<220> 

<221> sig__peptide 

<222> 132. .236 

<223> Von Heijne matrix 

score 8.35677668307059 
seq LGLGLVTLTASWA/AL 



<220> 

<221> polyA_site 
<222> 752. .767 



<220> 

<221> polyA___signal 
<222> 741. .746 



wo 02/083898 PCT/IBO 1/009 14 

17 

<400> 19 

ccttcagcgc ccggccgaga gcccggtggc tgcgactgag agcctggtta ctgccggcac 60 
ctttggatcc tcggccaatt ctctgcttcg cctacacttg gcgtgcggtg taccagcctc 120 
cccgcgtcac c atg gaa aca gga gcc tct gca tec ate cca gag ctg ate 170 
Met Glu Thr Gly Ala Ser Ala Ser lie Pro Glu Leu He 
-35 -30 -25 

tgt gaa get atg aga aga ate tgg age ctg gga ttg ggg ttg gtg act 218 
Cys Glu Ala Met Arg Arg He Trp Ser Leu Gly Leu Gly Leu Val Thr 

-20 -15 -10 

ctg aeg gcc age tgg gca get ctt ttc cac gat gge ttt gcg gtt ctt 266 
Leu Thr Ala Ser Trp Ala Ala Leu Phe His Asp Gly Phe Ala Val Leu 

-5 15 10 

gga gga aae att gtg age gat cte age aea gta aga ttt gtt gca eaa 314 
Gly Gly Asn He Val Ser Asp Leu Ser Thr Val Arg Phe Val Ala Gin 

15 20 25 

cag cag cac ttc cag cte ctt gac gtg tgg ace agg aat ttc egg aag 362 
Gin Gin His Phe Gin Leu Leu Asp Val Trp Thr Arg Asn Phe Arg Lys 

30 35 40 

cca ctg gge age atg tgc ttt gtt ttc ttg ttg cte cca taatcaatgt 411 
Pro Leu Gly Ser Met Cys Phe Val Phe Leu Leu Leu Pro 

45 50 55 

tgggcatcaa gatctaaccc ttgaaccttc cacgaaccct gttgtcaata cctctgggtt 471 
tccgccggtt acgcttaatt ttgacatatc ggtcagactg gtgccggata aacttcttgg 531 
ttctcttttt gacgatcttg ggcttcccaa gggttetgag gaeggtttat aaaagtgcac 591 
tgaggggagg ctgagacagg agaattgett gaacceggga ggcggaggtt gcagtgagcc 651 
gagategtgc ceatcctagg cgacagagag agactetatg tcaaaaataa ataaataaat 711 
aaataaataa ataaataaat aaataaataa ataaaatgcc aaaaaaaaaa aaaaaa 767 



<210> 20 
<211> 596 
<212> DNA 

<213> Homo sapiens 

<220> 
<221> CDS 
<222> 41. .478 



<220> 

<221> sig_peptide 

<222> 41. .115 

<223> Von Heijne matrix 

score 8.18796433724836 
seq ATLLLVLCLQLGA/NK 



<220> 

<221> polyA_site 
<222> 581.. 596 



<220> 

<221> polyA_signal 
<222> 560.. 565 



<400> 20 

accacttctc tgggacacat tgecttctgt ttteteeagc atg egc ttg etc cag 55 

Met Arg Leu Leu Gin 
-25 

etc ctg ttc agg gcc age cct gcc acc ctg cte ctg gtt etc tgc ctg 103 

Leu Leu Phe Arg Ala Ser Pro Ala Thr Leu Leu Leu Val Leu Cys Leu 

-20 -15 -10 -5 

cag ttg ggg gcc aae aaa get cag gac aae act egg aag ate ata ata 151 

Gin Leu Gly Ala Asn Lys Ala Gin Asp Asn Thr Arg Lys He He He 

15 10 

aag aat ttt gac att ccc aag tea gta cgt cca aat gac gaa gtc act 199 
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Lys Asn 


Phe 


Asp 


He 


Pro 


Lys 


Ser 


Val Arg 


Pro 


Asn Asp Glu Val Thr 






15 










20 








25 




gca gtg 


ctt 


gca 


gtt 


caa 


aca 


gaa 


ttg 


aaa 


gaa 


tgc atg gtg gtt aaa 


247 


Ala Val 


Leu 


Ala 


Val 


Gin 


Thr 


Glu 


Leu 


Lys 


Glu 


Cys Met Val Val Lys 




30 










35 










40 




act tac 


etc 


att 


age 


age 


ate 


cct 


eta 


caa 


ggk 


gem ttt aac tat arg 


295 


Thr Tyr 


Leu 


lie 


Ser 


Ser 


He 


Pro 


Leu 


Gin 


Gly 


Ala Phe Asn Tyr Xaa 




45 








50 










55 


60 




tat act 


gcc 


tgc 


eta 


tgt 


gae 


gae 


aat 


cca 


aaa 


ace ttc tac tgg gae 


343 


Tyr Thr 


Ala 


Cys 


Leu 


Cys Asp Asp Asn 


Pro 


Lys 


Thr Phe Tyr Trp Asp 










65 










70 




75 




ttt tac 


acc 


aac 


aga 


act 


gtg 


caa 


att 


gca 


gcc 


gte gtt gat gtt att 


391 


Phe Tyr 


Thr 


Asn 


Arg 


Thr 


Val 


Gin 


He 


Ala 


Ala 


Val Val Asp Val He 








80 










85 






90 




egg gaa 


tta 


ggc 


ate 


tgc 


cct 


gat 


gat 


get 


get 


gta ate ccc ate aaa 


439 


Arg Glu 


Leu 


Gly 


He 


Cys 


Pro 


Asp 


Asp Ala 


Ala 


Val He Pro He Lys 






95 










100 








105 




aac aac 


egg 


ttt 


tat 


act 


att 


gaa 


ate 


eta 


aag 


gta gaa taatggaagc 


488 


Asn Asn 


Arg 


Phe 


Tyr 


Thr 


He 


Glu 


He 


Leu 


Lys 


Val Glu 




110 










115 










120 




cctgtctgtt 


tgccacaccc aggtgatttc ctctaaagaa 


acttggetgg aatttctgct 


548 


gtggtctata i 


aaataaactt cttaacatgc tcaaaaaaaa 


aaaaaaaa 


596 


<210> 21 






















<211> 672 






















<212> DNA 






















<213> Homo 1 


sapiens 


















<220> 
























<221> CDS 






















<222> 7. 


.366 




















<220> 
























<221> sig_peptide 


















<222> 7. 


.102 




















<223> Von Heijne matrix 

















score 7.64815044221151 



VTLGIGFFALASA/LW 

<220> 

<221> polyA_site 
<222> 641. .672 

<400> 21 

agtcat atg get tea cct aaa ggt ttt ttt aat tac ttg ace tat ttt 48 



Met Ala Ser Pro Lys Gly Phe Phe Asn Tyr Leu Thr Tyr Phe 











-30 










-25 










-20 






ctt 


get 


get 


ggt 


get 


gte 


act 


ttg 


gga 


att 


ggt 


ttc 


ttt 


get 


ttg 


gca 


96 


Leu 


Ala 


Ala 


Gly 
-15 


Ala 


Val 


Thr 


Leu 


Gly 
-10 


He 


Gly 


Phe 


Phe 


Ala 

-5 


Leu 


Ala 




tea 


get 


ttg 


tgg 


ttc 


etg 


att 


tgc 


aaa 


cga 


aga 


gaa 


ata 


ttt 


caa 


aat 


144 


Ser 


Ala 


Leu 
1 


Trp 


Phe 


Leu 


He 
5 


Cys 


Lys 


Arg 


Arg 


Glu 
10 


He 


Phe 


Gin 


Asn 




tec 


aaa 


ttt 


aaa 


gca 


att 


gat 


gag 


aga 


tgc 


agg 


caa 


aga 


cca 


tea 


atg 


192 


Ser Lys 


Phe 


Lys 


Ala 


He 


Asp 


Glu 


Arg 


Cys 


Arg 


Gin 


Arg 


Pro 


Ser 


Met 




15 










20 










25 










30 




gcg 


aaa 


att 


aaa 


tet 


eat 


tet 


cag 


tgt 


gtt 


ttt 


att 


tet 


cga 


aat 


ttt 


240 


Ala 


Lys 


He 


Lys 


Ser 
35 


His 


Ser 


Gin 


Cys 


Val 
40 


Phe 


He 


Ser 


Arg 


Asn 
45 


Phe 




eat 


act 


ggg 


aga 


ttc 


caa 


tta 


cag 


caa 


tta 


aag 


ate 


att 


eta 


aag 


atg 


288 


His 


Thr 


Gly 


Arg 
50 


Phe 


Gin 


Leu 


Gin 


Gin 
55 


Leu 


Lys 


He 


He 


Leu 
60 


Lys 


Met 




aac 


ccc 


aac 


ttg 


caa 


caa 


aaa 


ata 


tea 


ttt 


gtg 


ate 


cct 


cag 


aga 


cca 


336 
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Asn Pro Asn Leu Gin Gin Lys He Ser Phe Val He Pro Gin Arg Pro 

65 70 75 

get cca caa caa ate gca gca gtg tta cat taagcttatc aacattacca 386 
Ala Pro Gin Gin He Ala Ala Val Leu His 

80 85 

tctgattctt attacagcca aagtatagaa gcagctgatg actggttttc tgatgattct 446 

ctagtgaaaa ggaactctcc aatgccttct ctcggggaac ctctaatgga aaaagtattt 506 

tcatacctgt caaccatttc attagaagag ggtactgaaa gtgtactgaa tgacacttta 566 

tgaccatcaa aaagatgact acattaaggg aaaatgttca tgaagaaaca cagaggttga 626 

aatataaaac cttcaacaga aaaaaaaaaa aaaatgcgaa aagctt 672 

<210> 22 

<211> 438 

<212> DNA 

<213> Homo sapiens 

<220> 
<221> CDS 
<222> 84. .317 

<220> 

<221> sig^peptide 

<222> 84. .140 

<223> Von Heijne matrix 

score 7.64030745849671 

seq ALVLALMISMISA/DS 

<220> 

<221> polyA_site 
<222> 421. .438 

<220> 

<221> polyA_signal 
<222> 397.. 402 

<400> 22 

atagaaaagg acatctcttg agacttcact tcagcttcac tgacttcttg actctcctct 60 
tgagtaaaag gactcagcca act atg aag ttt ttt gtc ttt get tta gtc ttg 113 

Met Lys Phe Phe Val Phe Ala Leu Val Leu 
-15 -10 
get etc atg att tec atg att age get gat tea cat gaa aag aga cat 161 
Ala Leu Met He Ser Met He Ser Ala Asp Ser His Glu Lys Arg His 

-5 1 5 

cat ggg tat aga aga aaa ttc cat gaa aag cat cat tea tac cat ate 209 
His Gly Tyr Arg Arg Lys Phe His Glu Lys His His Ser Tyr His He 

10 15 20 

aca eta eta cca ett ttt gaa gaa tea tea aag age aat gca aat gaa 257 
Thr Leu Leu Pro Leu Phe Glu Glu Ser Ser Lys Ser Asn Ala Asn Glu 

25 30 35 

aaa cac tat aat tta ctg tat act ctt tgt ttc agg ata ett gcc ttt 305 
Lys His Tyr Asn Leu Leu Tyr Thr Leu Cys Phe Arg He Leu Ala Phe 
40 45 50 55 

tea att gtc act tgatgatata attgcaattt aaactgttaa gctgtgttca 357 
Ser He Val Thr 

gtactgtttc tgaataatag aaatcacttc tctaaaagea ataaatttca agcacatttt 417 
taaataaaaa aaawawaaaa a 4 38 

<210> 23 

<211> 617 

<212> DNA 

<213> Homo sapiens 



<220> 
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<221> CDS 
<222> 141. .401 



<220> 

<221> sig_peptide 

<222> 141. .200 

<223> Von Heijne matrix 

score 7.42708462116258 
LLVFLAGFPVLDA/ND 



<220> 

<221> polyA_site 
<222> 603.. 617 



<400> 23 

gatttctccc ggaacctctg ctcagcctgg tgaaccacac aggcccgagt ttcacccagt 60 
ccccactcca cggtgcagct gcggcttatc tctcagccca gcgagatgcc agccttcctg 120 
tcccgggcca gcgctctgac atg cag aag gtg acc ctg ggc ctg ctt gtg ttc 173 

Met Gin Lys Val Thr Leu Gly Leu Leu Val Phe 
-20 -15 -10 

ctg gca ggc ttt cct gtc ctg gac gcc aat gac eta gaa gat aaa aac 221 
Leu Ala Gly Phe Pro Val Leu Asp Ala Asn Asp Leu Glu Asp Lys Asn 

-5 15 
agt cct ttc tac tat gac tgg cac age etc cag gtt ggc ggg etc ate 269 
Ser Pro Phe Tyr Tyr Asp Trp His Ser Leu Gin Val Gly Gly Leu lie 

10 15 20 

tgc get ggg gtt ctg tgc gcc atg ggc ate ate ate gtc atg agt gca 317 
Cys Ala Gly Val Leu Cys Ala Met Gly lie He He Val Met Ser Ala 

25 30 35 

aaa tgc aaa tgc aag ttt ggc cag aag tec ggt cac cat eca ggg gag 365 
Lys Cys Lys Cys Lys Phe Gly Gin Lys Ser Gly His His Pro Gly Glu 
40 45 50 55 

act eca cct etc ate acc eca ggc tea gee caa age tgatgaggae 411 
Thr Pro Pro Leu He Thr Pro Gly Ser Ala Gin Ser 

60 65 
agaccagctg aaattgggtg gaggaccgtt ctctgtcccc aggtcctgtc tctgcacaga 471 
aacttgaact ccaggatgga attcttccte ctctgctggg actcctttgc atggcagggc 531 
etcatctcac ctetcgeaag agggtctett tgttcaattt ttttttatct aaaatgattg 591 
tgcctctgcc caaaaaaaaa aaaaaa 6^'? 



<210> 24 

<211> 673 

<212> DNA 

<213> Homo sapiens 



<220> 
<221> CDS 
<222> 69. .383 



<220> 

<221> sig_peptide 

<222> 69. .125 

<223> Von Heijne matrix 

score 7.20796835452081 
VSIMLLLVTVSDC/AV 



<400> 24 

acaaggctga gcgggaggaa gcgagaggea tctaagcagg cagtgttttg cettcacccc 60 
aagtgacc atg aga ggt gcc acg cga gtc tea ate atg etc etc eta gta 110 
Met Arg Gly Ala Thr Arg Val Ser He Met Leu Leu Leu Val 
-15 -10 
act gtg tct gac tgt get gtg ate aca ggg gcc tgt gag cga gat gtc 158 
Thr Val Ser Asp Cys Ala Val He Thr Gly Ala Cys Glu Arg Asp Val 
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-5 








1 








5 










10 






cag tgt 
Gin Cys 


ggg 

Gly 


gca 
Ala 
15 


ggc 
Gly 


ace 
Thr 


tgc 
Cys 


tgt 
Cys 


gcc 
Ala 
20 


ate 
He 


age 
Ser 


ctg 
Leu 


tgg 
Trp 


ctt 
Leu 
25 


ega 
Arg 


ggg 

Gly 


206 


ctg egg 
Leu Arg 


atg 
Met 
30 


tgc 
Cys 


ace 
Thr 


ccg 
Pro 


ctg 
Leu 


ggg 

Gly 
35 


egg 
Arg 


kaa 
Xaa 


ggc gag 
Gly Glu 


gag 
Glu 
40 


tgc 
Cys 


cac 
His 


ccc 
Pro 


254 


ggc age 
Gly Ser 
45 


cac 
His 


aag 
Lys 


ate 
lie 


ccc 
Pro 


ttc 
Phe 
50 


ttc 
Phe 


agg 
Arg 


aaa 
Lys 


cgc 
Arg 


aag 
Lys 
55 


cac 
His 


cac 
His 


acc 
Thr 


tgt 
Cys 


302 


cct tgc 
Pro Cys 
60 


ttg 
Leu 


ccc 
Pro 


aac 
Asn 


ctg 
Leu 

65 


ctg 
Leu 


tgc 
Cys 


tec 
Ser 


agg 
Arg 


ttc 
Phe 
70 


ccg 
Pro 


gac 
Asp 


ggc 
Gly 


agg 
Arg 


tac 
Tyr 

75 


350 


cgc tgc 
Arg Cys 


tec 
Ser 


atg 
Met 


gac 
Asp 
80 


ttg 
Leu 


aag 
Lys 


aac 
Asn 


ate 
He 


aat 
Asn 
85 


ttt 
Phe 


taggcgcttg cctggtctea 


403 



ggatacecac catccttttc ctgagcacag cctggatttt tatttctgcc atgaaaccca 463 

gctcceatga ctetcccagt ccctasactg actaccctga tetctcttgt etagtacgca 523 

catatgeaca caggcagaca tacctcccat catgacatgg tccceaggct ggcctgagga 583 

tgtcacagct tgaggctgtg gtgtgaaagg tggecagcct ggttctcttc ectgctcagg 643 

ctgcactcaa aaaaaaaaaw aaatgcgaaa 673 



<210> 25 
<211> 678 
<212> DNA 

<213> Homo sapiens 

<220> 
<221> CDS 
<222> 69. .383 

<220> 

<221> sig_peptide 

<222> 69.. 125 

<223> Von Heijne matrix 

score 7.20796835452081 

VSIMLLLVTVSDC/AV 

<220> 

<221> polyA_site 
<222> 652.. 678 

<400> 25 

acaaggctga gcgggaggaa gcgagaggca tctaagcagg cagtgttttg ccttcacccc 60 
aagtgacc atg aga ggt gcc aeg ega gtc tea ate atg etc etc eta gta 110 
Met Arg Gly Ala Thr Arg Val Ser He Met Leu Leu Leu Val 
-15 -10 



act gtg 


tct 


gac 


tgt 


get 


gtg 


ate 


aca 


ggg 


gcc 


tgt 


gag 


ega 


gat 


gtc 


158 


Thr Val 


Ser 


Asp 


Cys 


Ala 


Val 


He 


Thr 


Gly 


Ala 


Cys 


Glu 


Arg 


Asp 


Val 




-5 








1 








5 










10 






cag tgt 


ggg 


gca 


ggc 


acc 


tgc 


tgt 


gcc 


ate 


age 


ctg 


tgg 


ctt 


ega 


ggg 


206 


Gin Cys 


Gly 


Ala 


Gly 


Thr 


Cys 


Cys 


Ala 


He 


Ser 


Leu 


Trp 


Leu 


Arg 


Gly 








15 










20 










25 








ctg egg 


atg 


tgc 


acc 


ccg 


ctg 


ggg 


egg 


gaa 


ggc 


gag 


gag 


tgc 


cac 


ccc 


254 


Leu Arg 


Met 


Cys 


Thr 


Pro 


Leu 


Gly 


Arg 


Glu 


Gly 


Glu 


Glu 


Cys 


His 


Pro 






30 










35 










40 










ggc age 


cac 


aag 


ate 


ccc 


ttc 


ttc 


agg 


aaa 


cgc 


aag 


cac 


cac 


acc 


tgt 


302 


Gly Ser 


His 


Lys 


He 


Pro 


Phe 


Phe 


Arg 


Lys 


Arg 


Lys 


His 


His 


Thr 


Cys 




45 










50 










55 












cct tgc 


ttg 


ccc 


aac 


ctg 


ctg 


tgc 


tec 


agg 


ttc 


ccg 


gac 


ggc 


agg 


tac 


350 


Pro Cys 


Leu 


Pro 


Asn 


Leu 


Leu 


Cys 


Ser 


Arg 


Phe 


Pro 


Asp 


Gly 


Arg 


Tyr 




60 








65 










70 










75 




cgc tgc 


tec 


atg 


gac 


ttg 


aag 


aac 


ate 


aat 


ttt 


taggcgcttg cctggtctea 


403 
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22 

Arg Cys Ser Met Asp Leu Lys Asn lie Asn Phe 

80 85 
ggatacccac catccttttc ctgagcacag cctggatttt tatttctgcc atgaaaccca 463 
gctcccatga ctctcccagt ccctacactg actaccctga tctctcttgt ctagtacgca 523 
catatgcaca caggcagaca tacctcccat catgacatgg tccccaggct ggcctgagga 583 
tgtcacagct tgaggctgtg gtgtgaaagg tggccagcct ggttctcttc cctgctcagg 643 
ctgcactcaa aaaaaaaaaa aaatgcgaaa agctt 678 

<210> 26 
<211> 860 
<212> DNA 

<213> Homo sapiens 

<220> 
<221> CDS 
<222> 129. .434 



<220> 

<221> sig_peptide 

<222> 129. .236 

<223> Von Heijne matrix 

score 7.04760006506154 
seq PFFLITLVGVVVA/VV 

<220> 

<221> polyA__site 
<222> 845. .860 



<220> 

<221> polyA_signai 
<222> 826. .831 



<400> 26 

tccggggccc gggagccaac cgagggcgtt cctgtcgggg ctgcagcggc gggagggagc 
ccagtggagg cgccctcccg aagcgccact gcccatgctg accacccagc cctccggctg 
ctgatgtc atg agt aac acc act gtg ccc aat gcc ccc cag gcc aac age 
Met Ser Asn Thr Thr Val Pro Asn Ala Pro Gin Ala Asn Ser 
-35 -30 -25 

gac tec atg gtg ggc tat gtg ttg ggg ccc ttc ttc etc ate acc ctg 
Asp Ser Met Val Gly Tyr Val Leu Gly Pro Phe Phe Leu lie Thr Leu 

-20 -15 -10 

gtc ggg gtg gtg gtg get gtg gta atg tat gta cag aag aaa aag egg 
Val Gly Val Val Val Ala Val Val Met Tyr Val Gin Lys Lys Lys Arg 
-5 15 10 



gtg 


gac 


egg 


ctg 


cgc 


cat 


cac 


ctg 


etc 


ccc 


atk 


tac 


age 


tat 


gac 


cca 


Val 


Asp 


Arg 


Leu 


Arg 


His 


His 


Leu 


Leu 


Pro 


Xaa 


Tyr 


Ser 


Tyr 


Asp 


Pro 










15 










20 










25 




get 


gag 


gaa 


ctg 


cat 


gag 


get 


gag 


car 


cag 


ctg 


etc 


tot 


gac 


atg 


gga 


Ala 


Glu 


Glu 


Leu 


His 


Glu 


Ala 


Glu 


Gin 


Gin 


Leu 


Leu 


Ser 


Asp 


Met 


Gly 



30 



35 



40 



gac 


ccc 


aag 


gtg 


gta 


cat 


ggc 


tgg 


cag akt ggc tac cag cac aag egg 


Asp 


Pro 


Lys 


Val 


Val 


His 


Gly 


Trp 


Gin Xaa Gly Tyr Gin His Lys Arg 






45 










50 


55 


atg 


cca 


ctg 


ctg 


gat 


gtc 


aag 


acg 


tgacctgacc cccttgcccc acccttcaga 


Met 


Pro 


Leu 


Leu 


Asp 


Val 


Lys 


Thr 






60 










65 







gcctggggtc ctggactgec tggggccctg ccatctgctt cecctgetgt cacctggctc 

cecctgctgg gtgctgggtc tccatttcte cctceaceca cectcagcag catctgcttc 

ccatgcectc accatcacct cactgcccec aggccttctg ccctttgtgg gtgttgagct 

caccgeccac ecacaggcac teataggaag aggctttect tctgggatgg cggcggctgg 

tagacacctt tgctttctct agccctcctg ggctgggctt gggcacaaat ccccaggcag 

gctttggagt tgtttccatg gtgatggggc cagatgtata gtattcagta tatattttgt 

aaataaaatg ttttgtggct aaaaaaaaaa aaaaaa 



60 
120 
170 



218 



266 



314 



362 



410 



464 



524 
584 
644 
704 
764 
824 
860 
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<210> 27 
<211> 1443 
<212> DNA 

<213> Homo sapiens 

<220> 
<221> CDS 
<222> 30. .440 



<220> 

<221> sig^peptide 

<222> 30. .83 

<223> Von Heijne matrix 

score 7.06387130837154 
seq VLGLVLLSVTVQG/KV 



<220> 

<221> polyA^site 
<222> 1428. .1443 



<220> 

<221> polyA_signal 
<222> 1409. .1414 



197 



245 



293 



<400> 27 

cagcctagca ctctgaccta gcagtcaac atg aag get etc att gtt ctg ggg 53 

Met Lys Ala Leu He Val Leu Gly 
-15 

ett gtc etc ctt tct gtt acg gtc cag ggc aag gtc ttt gaa agg tgt 101 

Leu Val Leu Leu Ser Val Thr Val Gin Gly Lys Val Phe Glu Arg Cys 

-10 -5 15 

gag ttg gee aga act ctg aaa aga ttg gga atg gat ggc tac agg gga 149 

Glu Leu Ala Arg Thr Leu Lys Arg Leu Gly Met Asp Gly Tyr Arg Gly 

10 15 20 

ate age eta gea aac tgg atg tgt ttg gee aaa tgg gag agt ggt tac 
He Ser Leu Ala Asn Trp Met Cys Leu Ala Lys Trp Glu Ser Gly Tyr 

25 30 35 

aac aca ega get aca aac tac aat get gga gae aga age act gat tat 
Asn Thr Arg Ala Thr Asn Tyr Asn Ala Gly Asp Arg Ser Thr Asp Tyr 

40 45 50 

ggg ata ttt cag ate aat age cgc tac tgg tgt aat gat ggc aaa ace 
Gly He Phe Gin He Asn Ser Arg Tyr Trp Cys Asn Asp Gly Lys Thr 
55 60 65 70 

cea gga gea gtt aat gee tgt eat tta tec tge agt ggg tgg cat gga 341 
Pro Gly Ala Val Asn Ala Cys His Leu Ser Cys Ser Gly Trp His Gly 

75 80 85 

gaa ate gtt gtc aaa aca gag atg tec gtc agt atg ttc aag gtt gtg 389 
Glu He Val Val Lys Thr Glu Met Ser Val Ser Met Phe Lys Val Val 

90 95 100 

gag tgt aac tec aga att ttc ett ctt cag etc att ttg tct etc tea 437 
Glu Cys Asn Ser Arg He Phe Leu Leu Gin Leu He Leu Ser Leu Ser 

105 110 115 

cat taagggagta ggaattaagt gaaaggtcac actaccatta ttttcccctt 4 90 

His 

caaacaaata atatttttac agaageagga geaaaatatg gcctttcttc taagagatat 550 
aatgtteaet aatgtggtta ttttatatta ageetaeaae attttteagt ttgeaaatag 610 
aactaatact ggtgaaaatt tacctaaaac ettggttate aaataeatct eeagtaeatt 670 
cegttetttt tttttttttt tttttttttt ttgagaeagt etcgetetgt egeeeagget 730 
ggagtgeagt ggegcaatet eggctcaetg caaectccac etceegggtt caegceatte 790 
tcctgcetca gectcccgag tagctgggat tacgggcgcc cgccaeeaeg eecggctaat 850 
tttttgtatt tttagtagag acagggtttc accgttttag ccaggatggt etcgatctcc 910 
tgaccttgtg atccacccac etcggcctcc caaagtgctg cgattacagg cgtgagccac 970 



wo 02/083898 
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tgcgcccggc 
cgattatgcc 
caaaatacca 
gtaaaaataa 
gatagaaaca 
ttgctgttcc 
agcttacaag 
aaaaatacaa 



cacattcagt 
caatattaag 
gctgatgaag 
ccactttttg 
gactgcctga 
ttaatttgat 
tattgaaata 
acattctaat 



tyttatcaaa 
taaaaaatat 
gcatctgatg 
ttgggcaata 
attgagaatt 
taatttaatt 
attactaatt 
taaaggcttt 



24 

gaaataaccc 
aagaaaaggt 
ccttcatctg 
tgaaattttt 
ttgatttttt 
catgtattat 
aatcacaaat 
gcaacacaaa 



<210> 28 

<211> 622 

<212> DNA 

<213> Homo sapiens 

<220> 
<221> CDS 
<222> 82. .384 



agacttaatc 
katcttaaat 
ttcagtcatc 
aaaggagtag 
aaagtgtgtt 
gattaaatct 
gtgaagttat 
aaaaaaaaaa 



ttgaatgata 
agatcttagg 
tccaaaaaca 
aataccaaat 
tctttctaaa 
gakgcakatg 
gcatgatgta 
aaa 



1030 
1090 
1150 
1210 
1270 
1330 
1390 
1443 



<220> 

<221> sig_peptide 

<222> 82. .168 

<223> Von Heijne matrix 

score 6.9066614461057 
seq VCMCVFSTQGALG/EM 



<220> 

<221> polyA_site 
<222> 607.. 622 

<220> 

<221> polyA_signal 
<222> 570. .575 



<400> 28 

gctacctcct ggtgtctgtt gcctggtgtg tgggaaaccc 
gtcttctgtg ttgactgttg a atg agg gaa gag aaa 

Met Arg Glu Glu Lys 
-25 

gag aga gag agt gtg tgt gtg tgt atg tgt gtg 
Glu Arg Glu Ser Val Cys Val Cys Met Cys Val 

-15 -10 
get ttg ggg gaa atg get gca cac ttc ata gat 
Ala Leu Gly Glu Met Ala Ala His Phe lie Asp 

1 5 
agt gag ggg aat ggt cac aga ggg aca ctg gat 
Ser Glu Gly Asn Gly His Arg Gly Thr Leu Asp 

15 20 
caa gag tec tac ate ccc tec ace gee gae ccc 
Gin Glu Ser Tyr lie Pro Ser Thr Ala Asp Pro 
30 35 40 

gag ctg ctg cac aag aac ttg ccc gtg ace tec 
Glu Leu Leu His Lys Asn Leu Pro Val Thr Ser 

50 55 
ccc tct gac etc geg ate cca gee get gcc etc 
Pro Ser Asp Leu Ala lie Pro Ala Ala Ala Leu 

65 70 
ctgccctcgc tgagcttgga ccacgtgcct gagacccgtc 
tcggggtggg gggagctcca caccggcggg gaacgtgctt 
ctctagaaca gagcctcagg cacaggaggc acttcacatt 
gaatcagtgt catcctccca acaaaaaaaa aaaaaaaa 



ceaeacgctg gaccacagaa 
aag ccc ttt gag aga 
Lys Pro Phe Glu Arg 
-20 

ttt tec act caa gga 
Phe Ser Thr Gin Gly 
-5 

gaa aag ctg agg ccc 
Leu Arg 



Pro 



Glu Lys 
10 

age ctg agt teg gac 
Ser Leu Ser Ser Asp 
25 

ace cag get ggc cet 
Thr Gin Ala Gly Pro 

45 

ega tec cag ccg ctg 
Arg Ser Gin Pro Leu 
60 

tgacetcecg atcceagceg 



tttcctcatt taaaccctgc 
gtteacagtg tgttcttage 
tattgaataa atagatgaat 



60 
111 



159 



207 



255 



303 



351 



404 



464 
524 
584 
622 



<210> 29 
<211> 658 



wo 02/083898 PCT/IBO 1/009 14 

25 

<212> DNA 

<213> Homo sapiens 

<220> 
<221> CDS 
<222> 114.. 497 

<220> 

<221> sig_pepticle 

<222> 114. .191 

<223> Von Heijne matrix 

score 6.8337397741578 

seq VFLSLFLIQLLIS/FS 

<400> 29 

ctcagcaccc agggcggtgg taggtcacag tctctgggcg ggtctcagtg tccaacactg 60 
tagctggtgc ctgccaggtt cccagtggct ggggtcacca ggtctgaaga gag atg 116 

Met 



tgc 


tgg 


ctg 


egg 


gca 


tgg 


ggc 


cag 


ate 


etc 


ctg 


cca 


gtt 


ttc 


etc 


tec 


164 


Cys 


Trp 


Leu 


Arg 


Ala 


Trp 


Gly 


Gin 


He 


Leu 


Leu 


Pro 


Val 


Phe 


Leu 


Ser 




-25 










-20 










-15 










-10 




etc 


ttt 


etc 


ate 


caa 


ttg 


ctt 


ate 


age 


ttc 


tea 


gag 


aat 


ggt 


ttt 


ate 


212 


Leu 


Phe 


Leu 


He 


Gin 


Leu 


Leu 


He 


Ser 


Phe 
1 


Ser 


Glu 


Asn 


Gly 
5 


Phe 


He 




cac 


age 


ccc 


agg 


-5 
aac 


aat 


cag 


aaa 


cca 


aga 


gat 


ggg 


aat 


gaa 


gag 


gaa 


260 


His 


Ser 


Pro 


Arg 


Asn 


Asn 


Gin 


Lys 


Pro 


Arg 


Asp 


Gly 


Asn 


Glu 


Glu 


Glu 








10 










15 










20 










tgt 


get 


gta 


aag 


aag 


agt 


tgt 


caa 


ttg 


tgc 


aea 


gaa 


gat 


aag 


aaa 


tgt 


308 


Cys 


Ala 


Val 


Lys 


Lys 


Ser 


Cys 


Gin 


Leu 


Cys 


Thr 


Glu 


Asp 


Lys 


Lys 


Cys 






25 










30 










35 












gtt 


tgg 


tgt 


agt 


gaa 


gaa 


aaa 


gca 


tgc 


raa 


aaa 


tac 


tgt 


ttt 


eec 


tat 


356 


Val 


Trp 


Cys 


Ser 


Glu 


Glu 


Lys 


Ala 


Cys 


Xaa 


Lys 


Tyr 


Cys 


Phe 


Pro 


Tyr 




40 






45 










50 










55 




ttc 


ggt 


tgt 


ega 


ttc 


agt 


tct 


ata 


tat 


tgg 


tta 


aac 


tgt 


aaa 


gtt 


gae 


404 


Phe 


Gly 


Cys 


Arg 


Phe 


Ser 


Ser 


He 


Tyr 


Trp 


Leu 


Asn 


Cys 


Lys 


Val 


Asp 










60 










65 










70 






atg 


ttt 


gga 


ate 


atg 


atg 


ctt 


eta 


etc 


att 


gca 


gta 


tta 


att 


aea 


gga 


452 


Met 


Phe 


Gly 


He 


Met 


Met 


Leu 


Leu 


Leu 


He 


Ala 


Val 


Leu 


He 


Thr 


Gly 








75 










80 










85 








ttc 


gtt 


tgg 


tac 


tgc 


tgc 


gcc 


tat 


cac 


ttt 


tac 


ctg 


cag 


gat 


ata 




497 


Phe 


Val 


Trp 


Tyr 


Cys 


Cys 


Ala 


Tyr 


His 


Phe 


Tyr 


Leu 


Gin 


Asp 


He 










90 










95 










100 











tgatgaatag ataattgaaa agagateetc cagaaagagc agaaggaagt ttetteaatg 557 
gcttcettca ggattttaat cateettaca gectetttga gaatgattga aetteeaaat 617 
tccctgaagt taaaatttta aattctatta aacatttttt c 658 

<210> 30 
<211> 619 
<212> DNA 

<213> Homo sapiens 

<220> 
<221> CDS 
<222> 74 . .499 

<220> 

<221> sig_peptide 

<222> 74. .145 

<223> Von Heijne matrix 

score 6.56258453552342 

seq LSICLSAVATATG/AE 



<220> 
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<221> polyA__site 
<222> 556. .619 

<400> 30 

agcgactagg acgcgcctgc gcagaggcgg cagcaccacc ggggttgact ccgggggcgc 60 
ggcgaggaga gac atg agg ctg age tgg ttc egg gtc ctg aca gta ctg 109 
Met Arg Leu Ser Trp Phe Arg Val Leu Thr Val Leu 
-20 -15 



tec 


ate 


tgc 


ctg 


age 


gee 


gtg 


gee 


aeg 


gee 


acg 


ggg 


gee 


gag 


ggc 


aaa 


157 


Ser 


He 


Cys 


Leu 


Ser 


Ala 


Val 


Ala 


Thr 


Ala 


Thr 


Gly 


Ala 


Glu 


Gly 


Lys 








-10 










-5 










1 










agg 


aag 


ctg 


cag 


ate 


ggg 


gtc 


aag 


aag 


egg 


gtg 


gac 


cac 


tgt 


cce 


ate 


205 


Arg 


Lys 


Leu 


Gin 


He 


Gly 


Val 


Lys 


Lys 


Arg 


Val 


Asp 


His 


Cys 


Pro 


He 




5 










10 










15 










20 




aaa 


teg 


cgc 


aaa 


ggg 


gat 


gtc 


ctg 


cac 


atg 


cac 


tac 


acg 


ggt 


aag 


ctg 


253 


Lys 


Ser 


Arg 


Lys 


Gly 


Asp 


Val 


Leu 


His 


Met 


His 


Tyr 


Thr 


Gly 


Lys 


Leu 












25 










30 










35 






gaa 


gat 


ggg 


aca 


gag 


ttt 


gac 


age 


age 


ctg 


ccc 


cag 


aac 


cag 


cce 


ttt 


301 


Glu Asp 


Gly 


Thr 


Glu 


Phe 


Asp 


Ser 


Ser 


Leu 


Pro 


Gin 


Asn 


Gin 


Pro 


Phe 










40 










45 










50 








gtc 


ttc 


tec 


ctt 


ggc 


aca 


ggc 


cag 


gtc 


ate 


aag 


ggc 


tgg 


gac 


cag 


ggg 


349 


Val 


Phe 


Ser 


Leu 


Gly 


Thr 


Gly 


Gin 


Val 


He 


Lys 


Gly 


Trp 


Asp 


Gin 


Gly 








55 










60 










65 










ctg 


ctg 


ggg 


atg 


tgt 


gag 


ggg 


gaa 


aag 


cgc 


aas 


ykg 


gtg 


ate 


cca 


tec 


397 


Leu 


Leu 


Gly 


Met 


Cys 


Glu 


Gly 


Glu 


Lys 


Arg 


Xaa 


Xaa 


Val 


He 


Pro 


Ser 






70 










75 










80 












gag 


eta 


ggg 


tat 


gga 


gas 


egg 


gga 


get 


cce 


cca 


aag 


att 


cca 


ggc 


ggt 


445 


Glu 


Leu 


Gly 


Tyr 


Gly 


Xaa 


Arg 


Gly 


Ala 


Pro 


Pro 


Lys 


He 


Pro 


Gly 


Gly 




85 










90 










95 










100 




gca 


ace 


ctg 


gtg 


ttc 


gar 


gtg 


gag 


ctg 


etc 


aaa 


ata 


gag 


cga 


cga 


act 


493 


Ala 


Thr 


Leu 


Val 


Phe 


Glu 


Val 


Glu 


Leu 


Leu 


Lys 


He 


Glu 


Arg 


Arg 


Thr 





105 110 115 



gag ctg taaccagact ggggaggggc agggggagag geccccatca gggaccagac 54 9 

Glu Leu 

tgttccaaaa aaaaaacaaa aaacaaaaac aaacaaaaaa acaettaaaa gccgaaaaaa 609 
aaaaaaaaaa 

<210> 31 

<211> 573 

<212> DNA 

<213> Homo sapiens 

<220> 
<221> CDS 
<222> 65. .274 

<220> 

<221> sig__peptide 

<222> 65. .136 

<223> Von Heijne matrix 

score 6.54728084640774 

ILLTRPACLGSWA/EI 

<220> 

<221> polyA_site 
<222> 556. .573 

<220> 

<221> polyA signal 
<222> 540.. 545 

<400> 31 

cactgattct ttggccatcg etactacaat tttagaattg ctggcaattt tttctgeaca 60 
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ctgg atg get tct tec aec agt gte tea aca gga caa ate ttg ctg aca 109 
Met Ala Ser Ser Thr Ser Val Ser Thr Gly Gin lie Leu Leu Thr 
-20 -15 -10 

aga cct get tgc ttg ggg tec tgg get gag ate egg tea ceg gtg agg 157 
Arg Pro Ala Cys Leu Gly Ser Trp Ala Glu lie Arg Ser Pro Val Arg 

-5 15 
acc ate tec ate gee age gac ttc eea aca gea egg gtg agt etc tgg 205 
Thr He Ser lie Ala Ser Asp Phe Pro Thr Ala Arg Val Ser Leu Trp 

10 15 20 

gtg ccg ccc gea cct ggg atg gtt cct att aag ate tec ggc tgt gca 253 
Val Pro Pro Ala Pro Gly Met Val Pro He Lys He Ser Gly Cys Ala 

25 30 35 

aae tgg gee ttc tea ccg gca tagatgatat eacacatcat ggcaagctca 304 
Asn Trp Ala Phe Ser Pro Ala 
40 45 

cagcccccgc caaaggeata gceattgaca gcagcgatga ctggattett gaectgggtg 364 
aggtggtccc agtgcttcaa gaacttgctg gagtaaeagt cetggaaact caggttctgc 424 
attteettga tateagctet agtgaaaagg gcggtagttg gtggtggaac eagaaacgga 484 
cgccggtget tggagcggtt cttaaattat atttaaaaaa gtaacttttt gtataaataa 544 
aagaaaatgg gacaaaaaaa aaaaaaaaa 573 

<210> 32 
<211> 497 
<212> DNA 

<213> Homo sapiens 

<220> 
<221> CDS 
<222> 51.. 389 

<220> 

<221> sig_peptide 

<222> 51.. 119 

<223> Von Heijne matrix 

score 6.43297166361141 

seq LGALSGAAALGFA/SY 

<220> 

<221> polyA_site 
<222> 482.. 497 

<220> 

<221> polyA_signal 
<222> 462. .467 



<400> 32 

etettgggtg gcetgggcgg cgttggtccg gtgegtectg ttctacaget atg gcc 56 

Met Ala 



ggg 


cca 


get 


gca 


get 


ttc 


cge 


cge 


ttg 


ggc 


gee 


ttg 


tec 


gga 


get 


gcg 


104 


Gly 


Pro 
-20 


Ala 


Ala 


Ala 


Phe 


Arg 
-15 


Arg 


Leu 


Gly 


Ala 


Leu 
-10 


Ser 


Gly 


Ala 


Ala 




gee 


tta 


ggc 


ttc 


get 


tec 


tac 


ggg 


geg 


cac 


ggc 


gee 


maw 


ttc 


cca 


gat 


152 


Ala 


Leu 


Gly 


Phe 


Ala 


Ser 


Tyr 


Gly 


Ala 


His 


Gly Ala 


Xaa 


Phe 


Pro 


Asp 




-5 










1 








5 










10 






gcc 


tac 


ggg 


aag 


gag 


ctg 


ttt 


gac 


aag 


gcc 


aae 


aaa 


cac 


cac 


ttc 


tta 


200 


Ala 


Tyr 


Gly 


Lys 
15 


Glu 


Leu 


Phe 


Asp 


Lys 
20 


Ala 


Asn 


Lys 


His 


His 

25 


Phe 


Leu 




cac 


age 


ctg 


gcc 


ctg 


tta 


ggg 


gtg 


ccc 


cat 


tgc 


aga 


aag 


cca 


etc 


tgg 


248 


His 


Ser 


Leu 
30 


Ala 


Leu 


Leu 


Gly 


Val 
35 


Pro 


His 


Cys 


Arg 


Lys 
40 


Pro 


Leu 


Trp 




get 


ggg 


tta 


ttg 


eta 


get 


tee 


gga 


acg 


aec 


tta 


ttc 


tgc 


acc 


age 


ttt 


296 


Ala 


Gly 
45 


Leu 


Leu 


Leu 


Ala 


Ser 

50 


Gly 


Thr 


Thr 


Leu 


Phe 

55 


Cys 


Thr 


Ser 


Phe 
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tac tac cag get ctg agt gga gac ccc age ate cag act ttg gcc cet 344 

Tyr Tyr Gin Ala Leu Ser Gly Asp Pro Ser lie Gin Thr Leu Ala Pro 

60 65 70 75 

gcg gga ggg acc ctg eta etc ttg ggc tgg ctt gcc ttg get ctt 389 

Ala Gly Gly Thr Leu Leu Leu Leu Gly Trp Leu Ala Leu Ala Leu 

80 85 90 

tgagctccct tttgcttaat tactgggttt tctgcgcagt tttttttttt aaagagttgg 449 
agtaagaaga ggattaaaaa ggaaaggcaa acaaaaaaaa aaaaaaaa 4 97 

<210> 33 

<211> 484 

<212> DNA 

<213> Homo sapiens 

<220> 
<221> CDS 
<222> 14. .352 

<220> 

<221> sig_peptide 

<222> 14.. 82 

<223> Von Heijne matrix 

score 6,43297166361141 

seq LGALSGAAALGFA/SY 

<220> 

<221> polyA_site 
<222> 469. ,484 

<220> 

<221> polyA_signal 
<222> 443. .448 

<400> 33 

ctgttctaca get atg gcc ggg cea get gca get ttc cgc egc ttg ggc 49 
Met Ala Gly Pro Ala Ala Ala Phe Arg Arg Leu Gly 

















-20 










-15 










gee 


ttg 


tec 


gga 


get 


gcg 


gcc 


tta 


ggc 


ttc 


get 


tec 


tac 


ggg 


gcg 


eac 


97 


Ala 


Leu 
-10 


Ser 


Gly 


Ala 


Ala 


Ala 
-5 


Leu 


Gly 


Phe 


Ala 


Ser 
1 


Tyr 


Gly 


Ala 


His 
5 




ggc 


gcc 


ma a 


ttc 


cea 


gat 


gcc 


tac 


ggg 


aag 


gag 


ctg 


ttt 


gac 


aag 


gcc 


145 


Gly Ala 


Xaa 


Phe 


Pro 


Asp 


Ala 


Tyr 


Gly 


Lys 


Glu 


Leu 


Phe 


Asp 


Lys 


Ala 












10 










15 










20 






aac 


aaa 


eac 


cae 


tty 


tta 


eac 


age 


ctg 


gcc 


ctg 


tta 


ggg 


gtg 


ccc 


cat 


193 


Asn 


Lys 


His 


His 
25 


Phe 


Leu 


His 


Ser 


Leu 
30 


Ala 


Leu 


Leu 


Gly 


Val 
35 


Pro 


His 




tgc 


aga 


aag 


cea 


etc 


tgg 


get 


ggg 


tta 


ttg 


eta 


get 


tec 


gga 


aeg 


acc 


241 


Cys 


Arg 


Lys 
40 


Pro 


Leu 


Trp 


Ala 


Gly 
45 


Leu 


Leu 


Leu 


Ala 


Ser 
50 


Gly 


Thr 


Thr 




tta 


tty 


tgc 


acc 


age 


ttt 


tac 


tac 


cag 


get 


ytg 


agt 


gga 


gac 


ccc 


age 


289 


Leu 


Phe 
55 


Cys 


Thr 


Ser 


Phe 


Tyr 
60 


Tyr 


Gin 


Ala 


Leu 


Ser 
65 


Gly 


Asp 


Pro 


Ser 




ate 


cag 


act 


ttg 


gcc 


cet 


gcg 


gga 


ggg 


acc 


ctg 


eta 


etc 


ttg 


ggc 


tgg 


337 


He 


Gin 


Thr 


Leu 


Ala 


Pro 


Ala 


Gly 


Gly 


Thr 


Leu 


Leu 


Leu 


Leu 


Gly 


Trp 




70 










75 










80 










85 





ctt gcc ttg get ctt tgagctccct tttgattaat tactgggttt tctgggcagt 392 
Leu Ala Leu Ala Leu 
90 

tttttttttt taaagagttg gagtaagaag aggattaaaa aggaaaggca aataaacttt 452 
ggagtctttg tteatcaaaa aaaaaaaaaa aa 484 



<210> 34 
<211> 739 



wo 02/083898 

<212> DNA 

<213> Homo sapiens 



PCT/TBOl/00914 



29 



<220> 
<221> CDS 
<222> 111.. 272 



<220> 

<221> sig_peptide 

<222> 111. .161 

<223> Von Heijne matrix 

score 6.48133858229514 
LTFAQLLFATVLG/IA 



<220> 

<221> polyA_site 
<222> 718. .739 



<220> 

<221> polyA_signal 
<222> 688.. 693 



<400> 34 

aattgtagac gtcagtgttg gtgttaacat ttcagatttg aagagatttc ctgggtgtgg 60 

agtgtgactt tccaaaacca gcttttcctt gagctgtatt tgttgcagca atg ttt 116 

Met Phe 

agg aga ttg act ttt gca caa ctg ctt ttt gcc act gtc ctt gga att 164 

Arg Arg Leu Thr Phe Ala Gin Leu Leu Phe Ala Thr Val Leu Gly lie 

-15 -10 -5 1 

get gga gga gta tat att ttt caa cea gta ttt gaa cag tat gee aaa 212 

Ala Gly Gly Val Tyr lie Phe Gin Pro Val Phe Glu Gin Tyr Ala Lys 

5 10 15 

gat cag aag gaa tta aaa gaa aag atg cag ttg gta caa gaa tea gaa 260 

Asp Gin Lys Glu Leu Lys Glu Lys Met Gin Leu Val Gin Glu Ser Glu 

20 25 30 

gag aag aaa agt taatactaca tggagttagg cctggcgcag tggctcacgc 312 
Glu Lys Lys Ser 
35 

ctgtaatccc agcactttgg gaggccgagg cgggtggatc aggtggtcag gagttcaaga 372 

ccagcctgac caacatggtg aaaccctgtc tctactaaaa atacaaaaat cagccaggct 432 

tggtggcatg cgcctgtaat cccagctact tgggaggctg aggcaggagc atcacttgaa 4 92 

cctgggaggc agaggtggca gtgagtcaag atcacgctgc tgcactccag cctgggtgac 552 

agagcgagac tccatctaaa aaaacaaaaa gcaaaaaaac ccacaaatac ctcatggaga 612 

tgaactgtaa taattgctta aagttccatt taattatgtt aactctaatc tagcaaaaac 672 

atagatgtac ttaaaaataa atcatggata atgatttttt aacccaaaaa aaaaaaaaaa 732 
tgtgaaa 739 



<210> 35 
<211> 432 
<212> DNA 

<213> Homo sapiens 



<220> 
<221> CDS 
<222> 47» .295 



<220> 

<221> sig_peptide 

<222> 47. .91 

<223> Von Heijne matrix 

score 6.32210467320472 
seq LFLSLPVLVVVLS/IV 



wo 02/083898 



PCT/IBOl/00914 



30 

<220> 

<221> polyA_site 
<222> 420. .432 

<220> 

<221> polyA_signal 
<222> 395. ,400 

<400> 35 

aaccaagccc tccagcaagg attcagagtg cccctccggp ctcgcc atg agg etc 55 

Met Arg Leu 
-15 

ttc ctg teg ctt ccg gtc ctg gtg gtg gtt ctg teg ate gtc ttg gaa 103 
Phe Leu Ser Leu Pro Val Leu Val Val Val Leu Ser He Val Leu Glu 

-10 -5 1 

ggc cca gee cca gee cag ggg acc cea gac gtc tec agt gee ttg gat 151 
Gly Pro Ala Pro Ala Gin Gly Thr Pro Asp Val Ser Ser Ala Leu Asp 
5 10 15 20 

aag ctg aag gag ttt gga aac aca ctg gag gac aag get egg gaa etc 199 
Lys Leu Lys Glu Phe Gly Asn Thr Leu Glu Asp Lys Ala Arg Glu Leu 

25 30 35 

ate age cge ate aaa cag agt gaa ctt tct gcc aag atg egg gag tgg 247 
He Ser Arg lie Lys Gin Ser Glu Leu Ser Ala Lys Met Arg Glu Trp 

40 45 50 

ttt tea gag aca ttt cag aaa gtg aag gat aaa etc aag att gac tea 295 
Phe Ser Glu Thr Phe Gin Lys Val Lys Asp Lys Leu Lys He Asp Ser 

55 60 65 

tgaggacctg aagggtgaea tcccaggagg ggeetetgaa atttcccaca ccccagegce 355 
tgtgctgagg actccctcca tgtggcccca ggtgccacca ataaaaatcc tacagaaaac 415 
taawaaaaaa aaaaaaa 432 

<210> 36 

<211> 839 

<212> DNA 

<213> Homo sapiens 

<220> 
<221> CDS 
<222> 66. .359 

<220> 

<221> sig_peptide 
<222> 66. .194 
<223> Von Heijne matrix 

score 6.07207487250823 

seq FLIYV7VLLRVSEC/LP 

<220> 

<221> polyA_site 
<222> 824.. 839 

<220> 

<221> polyA signal 
<222> 805. .810 

<400> 36 

gggcgegagg cggceaccgt ggagagcaga gegcggeggc tggaagctgc taagtcagag 
eegeg atg ttc egg att gag ggc etc geg ccg aag ctg gac ccg gag gag 
Met Phe Arg He Glu Gly Leu Ala Pro Lys Leu Asp Pro Glu Glu 
-40 -35 -30 

atg aaa egg aag atg cgc gag gat gtg ate tec tee ata egg aac ttt 
Met Lys Arg Lys Met Arg Glu Asp Val He Ser Ser He Arg Asn Phe 
-25 -20 -15 



60 
110 

158 



wo 02/083898 PCT/IBO 1/009 14 

31 



etc ate 


tac 


gtg 


gcc 


etc ctg 


cga 


gtc 


agt gag 


tgt 


etc 


ccg 


ggc 


tgt 


206 


Leu lie 


Tyr 


Val 


Ala 


Leu Leu 


Arg 


Val 


Ser Glu 


Cys 


Leu 


Pro 


Gly 


Cys 






-10 








-5 






1 










gac tgt 


gat 


acc 


age 


ggg gag 


etc 


acc 


gac ggg 


cac 


ccc 


tta 


act 


eta 


254 


Asp Cys 


Asp 


Thr 


Ser 


Gly Glu 


Leu 


Thr 


Asp Gly 


His 


Pro 


Leu 


Thr 


Leu 




5 








10 






15 










20 




agg ggt 


cat 


egg 


ggc 


ett cga 


act 


gag 


ctg aae 


ggt 


age 


ggg 


gag 


caa 


302 


Arg Gly 


His 


Arg 


Gly 


Leu Arg 


Thr 


Glu 


Leu Asn 


Gly 


Ser 


Gly 


Glu 


Gin 










25 








30 








35 






gga ggc 


tec 


att 


tat 


ett aaa 


gaa 


att 


gga cag 


cat 


atg 


aag 


aca 


gga 


350 


Gly Gly 


Ser 


He 


Tyr 


Leu Lys 


Glu 


He 


Gly Gin 


His 


Met 


Lys 


Thr 


Gly 








40 








45 








50 









cat cac ata tgaatgcacg atatgaagag cctggttaca gtttcgactc 399 
His His He 



55 

ctctctgeaa gtgaatagge ecagaaaggt gtaagagact ctttgaatgg acataaaatt 459 

ctgcttgtta agaaeaagtt tggctctggt aaetgacett caaagctaaa atataaaact 519 

atttgggaag tatgaaacga tgtctcgtga tctggtgtac ecttatecct gtgacgtttg 579 

gectctgaca atactggtat aattgtaaat aatgtcaaac tccgttttct agcaagtatt 639 

aagggagctg tgtctgaaat ggeactgtct tgtcagtcat ttctgtttac ctttttcttc 699 

tgcecagagt gtatttgtga aragtctctt atattatgtt ttgtggaaat cagcacacaa 759 

ccacaatgac atttaagcae aggatcatta ttagtctatg tttttaataa acatatcaat 819 

tatgaaaaaa aaaaaaaaaa 839 



<210> 37 

<211> 1114 

<212> DNA 

<213> Homo sapiens 

<220> 
<221> CDS 
<222> 117.. 362 



<220> 

<221> sig_peptide 

<222> 117. .245 

<223> Von Heijne matrix 

score 5.65876793443964 
VVSFALIATLVYA/LF 



<220> 

<221> polyA_site 
<222> 1083.. 1114 



<400> 37 

ataaggggae gtctagtggg ttgcecggga ggggtggcgg gagcggtcct ggaaataate 60 
tgtcctctgt cgccgggaac tggcgaggta gttccttcgc ggtggagaga cctgga atg 119 

Met 

gcc aaa tat caa ggt gaa gtt caa agt ttg aaa ctg gat gat gat tea 167 
Ala Lys Tyr Gin Gly Glu Val Gin Ser Leu Lys Leu Asp Asp Asp Ser 

-40 -35 -30 

gtt ata gaa gga gta age gac caa gta ett gtg gca gtt gtg gtc agt 215 
Val He Glu Gly Val Ser Asp Gin Val Leu Val Ala Val Val Val Ser 

-25 -20 -15 

ttc get ttg att get acc ctg gta tat gca ett ttc 
Phe Ala Leu He Ala Thr Leu Val Tyr Ala Leu Phe Arg Asn Val 
-10 -5 1 

caa aae att cac eca gaa aae cag gag eta gta agg 
Gin Asn He His Pro Glu Asn Gin Glu Leu Val Arg 

10 15 
cag ett caa aca gaa cag gat gca cct get gac teg 
Gin Leu Gin Thr Glu Gin Asp Ala Pro Ala Asp Ser 

25 30 



aga 


aat 


gta 


eat 


263 


Arg 


Asn 


Val 

5 


His 




gta 


ett 


cga 


gaa 


311 


Val 


Leu 


Arg 


Glu 






20 








aca 


gca 


gtt 


eta 


359 


Thr 


Ala 


Val 


Leu 




35 
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32 

cac tgacatgtac tgtcccatct gcctgcacca agcctccttc ccggtggaga 412 

His 

ccaactgtgg acatcttttt tgtggtgcct gcattattgc ttactggcga tatggttcat 472 

ggcttggggc aatcagttgt ccaatctgta gacaaacgag acatggccac attgcattgt 532 

ccagaacagc ttagaccatg acagttagca tcgaagccac ctgaggaggg aggcagtaac 592 

cttactccta acagtatttg gtgaagatga tcagtctcag gatgttctga gattgcatca 652 

ggatattaat gattataacc ggagattctc agggcaaccc agatctgtaa gtaatgctaa 712 

agcatgttca aagttagagg aagacacatt tcttctcttt tgtaaagtga ggtttaccaa 772 

caagtattct ttgactatga gaaatcttgg ccaggcacag tagctaacgc ctataatcct 832 

agcactttgg gaggccaagg caggtggatc acttgagccc aggagtttga gaccagcctt 892 

ggaaacatga tgaaacccca tctctagaaa aaacaccaaa aaattggaca agagtgttgg 952 

cacatgcctg tagtccctgc ttcttgggag gctgaaatgg gaggatcacc tgagcccagg 1012 

aggttgaggc tatagtgagc catgatcgca ctattgcact cccacctggg tggcagtgag 1072 

acccttcctc aaaaaacaag aaaagaaaaa aaaaaaaaaa aa 1114 

<210> 38 

<211> 1112 

<212> DNA 

<213> Homo sapiens 

<220> 
<221> CDS 
<222> 117. .362 



<220> 

<221> sig_peptide 

<222> 117.. 245 

<223> Von Heijne matrix 

score 5.65876793443964 
seq VVSFALIATLVYA/LF 



<220> 

<221> polyA_site 
<222> 1097. .1112 



<220> 

<221> polyA_signal 
<222> 1089. .1094 



<400> 38 

ataaggggac gtctagtggg ttgcccggga ggggtggcgg gagcggtcct ggaaataatc 60 
tgtcctctgt cgccgggaac tggcgaggta gttccttcgc ggtggagaga cctgga atg 119 



Met 



167 



215 



263 



311 



359 



cac tgacatgtac tgtcccatct gcctgcacca agcctccttc ccggtggaga 412 
His 

ccaactgtgg acatcttttt tgtggtgcct gcattattgc ttactggcga tatggttcat 472 

ggcttggggc aatcagttgt ccaatctgta gacaaacgag acatggccac attgcattgt 532 

ccagaacagc ttagaccatg acagttagca tcgaagccac ctgaggaggg aggcagtaac 592 



gcc 


aaa 


tat 


caa 


ggt 


gaa 


gtt 


caa 


agt 


ttg 


aaa 


ctg 


gat 


gat 


gat 


tea 


Ala 


Lys 


Tyr 


Gin 


Gly 


Glu 


Val 


Gin 


Ser 


Leu 


Lys 


Leu 


Asp 


Asp 


Asp 


Ser 






-40 










-35 










-30 








gtt 


ata 


gaa 


gga 


gta 


age 


gac 


caa 


gta 


ctt 


gtg 


gca 


gtt 


gtg 


gtc 


agt 


Val 


lie 


Glu 


Gly 


Val 


Ser 


Asp 


Gin 


Val 


Leu 


Val 


Ala 


Val 


Val 


Val 


Ser 




-25 










-20 










-15 










ttc 


get 


ttg 


att 


get 


acc 


ctg 


gta 


tat 


gca 


ctt 


ttc 


aga 


aat 


gta 


cat 


Phe 


Ala 


Leu 


lie 


Ala 


Thr 


Leu 


Val 


Tyr 


Ala 


Leu 


Phe 


Arg 


Asn 


Val 


His 


-10 










-5 










1 








5 




caa 


aac 


att 


cac 


cca 


gaa 


aac 


cag 


gag 


eta 


gta 


agg 


gta 


ctt 


ega 


gaa 


Gin 


Asn 


lie 


His 


Pro 


Glu 


Asn 


Gin 


Glu 


Leu 


Val 


Arg 


Val 


Leu 


Arg 


Glu 








10 










15 










20 






cag 


ctt 


caa 


aca 


gaa 


cag 


gat 


gca 


cct 


get 


gac 


teg 


aea 


gca 


gtt 


eta 


Gin 


Leu 


Gin 


Thr 


Glu 


Gin 


Asp 


Ala 


Pro 


Ala 


Asp 


Ser 


Thr 


Ala 


Val 


Leu 






25 










30 










35 









wo 02/083898 

33 

cttactccta acagtatttg gtgaagatga tcagtctcag gatgttctga 

ggatattaat gattataacc ggagattctc agggcaaccc agatctgtaa 

agcatgttca aagttagagg aagacacatt tcttctcttt tgtaaagtga 

caagtattct ttgactatga gaaatcttgg ccaggcacag tagtaacgcc 

gcactttggg aggccaaggc aggtggatca cttgagccca ggagtttgag 

gaaacatgat gaaaccccat ctctagaaaa aacaccaaaa aattggacaa 

acatgcctgt agtccctgct tcttgggagg ctgaaatggg aggatcacct 

ggttgaggct atagtgagcc atgatcgcac tattgcactc ccacctgggt 

cccttcctca aaaaacaaga aaagaaaaaa aaaaaaaaaa 



PCT/IBOl/00914 



gattgcatca 
gtaatgctaa 
ggtttaccaa 
tataatccta 
accagccttg 
gagtgttggc 
gagcccagga 
ggcagtgaga 



652 
712 
772 
832 
892 
952 
1012 
1072 
1112 



<210> 39 

<211> 1112 

<212> DNA 

<213> Homo sapiens 

<220> 
<221> CDS 
<222> 117. .362 



<220> 

<221> sig_peptide 

<222> 117. .245 

<223> Von Heijne matrix 

score 5.65876793443964 
seq VVSFALIATLVYA/LF 



<220> 

<221> polyA_site 
<222> 1097.. 1112 



<220> 

<221> polyA_signal 
<222> 1089. .1094 



<400> 39 

ataaggggac gtctagtggg ttgcccggga ggggtggcgg gagcggtcct ggaaataatc 60 
tgtcctctgt cgccgggaac tggcgaggta gttccttcgc ggtggagaga cctgga atg 119 

Met 



gcc 


aaa 


tat 


caa 


ggt 


gaa 


gtt 


caa 


agt 


ttg 


aaa 


ctg 


gat 


gat 


gat 


tea 


167 


Ala 


Lys 


Tyr 
-40 


Gin 


Gly 


Glu 


Val 


Gin 
-35 


Ser 


Leu 


Lys 


Leu 


Asp 

-30 


Asp 


Asp 


Ser 




gtt 


ata 


gaa 


gga 


gta 


age 


gac 


caa 


gta 


ctt 


gtg 


gca 


gtt 


gtg 


gtc 


agt 


215 


Val 


He 
-25 


Glu 


Gly 


Val 


Ser 


Asp 
-20 


Gin 


Val 


Leu 


Val 


Ala 
-15 


Val 


Val 


Val 


Ser 




ttc 


get 


ttg 


att 


get 


acc 


ctg 


gta 


tat 


gca 


ctt 


ttc. 


aga 


aat 


gta 


cat 


263 


Phe 


Ala 


Leu 


He 


Ala 


Thr 


Leu 


Val 


Tyr 


Ala 


Leu 


Phe 


Arg 


Asn 


Val 


His 




-10 










-5 










1 








5 






caa 


aac 


att 


cac 


cca 


gaa 


aac 


cag 


gag 


eta 


gta 


agg 


gta 


ctt 


cga 


gaa 


311 


Gin 


Asn 


He 


His 
10 


Pro 


Glu 


Asn 


Gin 


Glu 

15 


Leu 


Val 


Arg 


Val 


Leu 

20 


Arg 


Glu 




cag 


ctt 


caa 


aca 


gaa 


cag 


gat 


gca 


cct 


gst 


gac 


teg 


aca 


gca 


gtt 


eta 


359 


Gin 


Leu 


Gin 
25 


Thr 


Glu 


Gin 


Asp 


Ala 
30 


Pro 


Xaa 


Asp 


Ser 


Thr 
35 


Ala 


Val 


Leu 




cac 


tgacatgtac tgtcccatct gcctgcacca agcctccttc 


ccggtggaga 




412 



His 

ccaactgtgg acatcttttt tgtggtgcct gcattattgc ttactggcga tatggttcat 472 

ggcttggggc aatcagttgt ccaatctgta gacaaacgag acatggccac attgcattgt 532 

ccagaacagc ttagaccatg acagttagca tcgaagecac ctgaggaggg aggcagtaac 592 

cttactccta acagtatttg gtgaagatga tcagtctcag gatgttctga gattgcatca 652 

ggatattaat gattataacc ggagattctc agggcaaccc agatctgtaa gtaatgctaa 712 

agcatgttca aagttagagg aagacacatt tcttctcttt tgtaaagtga ggtttaccaa 772 

caagtattct ttgactatga gaaatcttgg ccaggcacag tagtaacgcc tataatccta 832 

gcactttggg aggccaaggc aggtggatca cttgagccca ggagtttgag accagccttg 892 



wo 02/083898 



PCT/IBOl/00914 



aattggacaa gagtgttggc 952 
aggatcacct gagcccagga 1012 
ccacctgggt ggcagtgaga 1072 

1112 

<210> 40 

<211> 1112 

<212> DNA 

<213> Homo sapiens 

<220> 
<221> CDS 
<222> 117.. 362 



gaaacatgat 
acatgcctgt 
ggttgaggct 
cccttcctca 



gaaaccccat 
agtccctgct 
atagtgagcc 
aaaaacaaga 



ctctagaaaa 
tcttgggagg 
atgatcgcac 
aaagaaaaaa 



34 

aacaccaaaa 
ctgaaatggg 
tattgcactc 
aaaaaaaaaa 



<220> 

<221> sig_peptide 

<222> 117. .245 

<223> Von Heijne matrix 

score 5.65876793443964 
seq VVSFALIATLVYA/LF 



<220> 

<221> polyA_site 
<222> 1097.. 1112 



<220> 

<221> polyA_signal 
<222> 1089.. 1094 



<400> 40 

ataaggggac gtctagtggg ttgcccggga ggggtggcgg gagcggtcct ggaaataatc 60 
tgtcctctgt cgccgggaac tggcgaggta gttccttcgc ggtggagaga cctgga atg 119 

Met 



gcc 


aaa 


tat 


caa 


ggt 


gaa 


gtt 


caa 


agt 


ttg 


aaa 


ctg 


gat 


gat 


gat 


tea 


167 


Ala 


Lys 


Tyr 

-40 


Gin 


Gly 


Glu 


Val 


Gin 
-35 


Ser 


Leu 


Lys 


Leu 


Asp 
-30 


Asp 


Asp 


Ser 




gtt 


ata 


gaa 


gga 


gta 


age 


gac 


caa 


gta 


ctt 


gtg 


gca 


gtt 


gtg 


gtc 


agt 


215 


Val 


He 
-25 


Glu 


Gly 


Val 


Ser 


Asp 
-20 


Gin 


Val 


Leu 


Val 


Ala 
-15 


Val 


Val 


Val 


Ser 




ttc get 


ttg 


att 


get 


ace 


ctg 


gta 


tat 


gca 


ctt 


ttc 


aga 


aat 


gta 


cat 


263 


Phe 


Ala 


Leu 


He 


Ala 


Thr 


Leu 


Val 


Tyr 


Ala 


Leu 


Phe 


Arg 


Asn 


Val 


His 




-10 










-5 










1 








5 






caa 


aac 


att 


cac 


cca 


gaa 


aac 


cag 


gag 


eta 


gta 


agg 


gta 


ctt 


cga 


gaa 


311 


Gin 


Asn 


He 


His 
10 


Pro 


Glu 


Asn 


Gin 


Glu 
15 


Leu 


Val 


Arg 


Val 


Leu 
20 


Arg 


Glu 




cag 


ctt 


caa 


aca 


gaa 


cag 


gat 


gca 


cct 


get 


gac 


teg 


aca 


gca 


gtt 


eta 


359 


Gin 


Leu 


Gin 
25 


Thr 


Glu 


Gin 


Asp 


Ala 

30 


Pro 


Ala 


Asp 


Ser 


Thr 
35 


Ala 


Val 


Leu 





cac tgacatgtac tgtcccatct gcctgcacca agcctccttc ccggtggaga 412 
His 

ccaactgtgg acatcttttt tgtggtgcct gcattattgc ttactggcga tatggttcat 472 

ggcttggggc aatcagttgt ccaatctgta gacaaacgag acatggccac attgcattgt 532 

ccagaaeage ttagaccatg acagttagca tcgaagccac ctgaggaggg aggcagtaac 592 

cttactccta acagtatttg gtgaagatga tcagtctcag gatgttctga gattgcatca 652 

ggatattaat gattataacc ggagattetc agggcaaccc agatctgtaa gtaatgctaa 712 

agcatgttca aagttagagg aagacacatt tettctcttt tgtaaagtga ggtttaccaa 772 

caagtattct ttgactatga gaaatcttgg ecaggcacag tagtaacgcc tataatccta 832 

gcactttggg aggccaaggc aggtggatca ettgagccca ggagtttgag accagccttg 892 

gaaacatgat gaaaccccat ctctagaaaa aacaccaaaa aattggacaa gagtgttggc 952 

acatgcctgt agtccctgct tcttgggagg ctgaaatggg aggatcacct gagcccagga 1012 

ggttgaggct atagtgagcc atgatcgcac tattgcactc ccacctgggt ggcagtgaga 1072 

cccttcctca aaaaacaaga aaagaaaaaa aaaaaaaaaa 1112 



wo 02/083898 

<210> 41 
<211> 508 
<212> DNA 

<213> Homo sapiens 

<220> 
<221> CDS 
<222> 17.. 313 

<220> 

<221> sig peptide 
<222> 17.Tl00 
<223> Von Heijne matrix 

score 5.67441405983879 

seq LGYLVIiSEGAVLA/SX 

<220> 

<221> polyA_site 
<222> 493. .508 

<220> 

<221> polyA_signal 
<222> 468. .473 

<400> 41 

agcaccgaag actgcg atg act tct gca ctg acc cag ggg ctg gag cga ate 52 
Met Thr Ser Ala Leu Thr Gin Gly Leu Glu Arg lie 
-25 -20 



cca 


gac 


cag 


etc 


ggc 


tac 


ctg 


gta 


ctg 


agt 


gaa 


ggt 


gca 


gtg 


ctg 


gcg 


100 


Pro 


Asp 
-15 


Gin 


Leu 


Gly 


Tyr 


Leu 
-10 


Val 


Leu 


Ser 


Glu 


Gly 
-5 


Ala 


Val 


Leu 


Ala 




tea 


tyt 


ggg 


gac 


ctg 


gag 


aat 


gat 


gag 


cag 


gca 


gcc 


agt 


gcc 


ate 


tyt 


148 


Ser 


Xaa 


Gly 


Asp 


Leu 


Glu 


Asn 


Asp 


Glu 


Gin 


Ala 


Ala 


Ser 


Ala 


He 


Xaa 




1 








5 










10 










1-5 






gag 


ctg 


gtc 


age 


aca 


gee 


tgc 


ggt 


ttc 


egg 


ctg 


cac 


cgc 


ggc 


atg 


aat 


196 


Glu 


Leu 


Val 


Ser 
20 


Thr 


Ala 


Cys 


Gly 


Phe 
25 


Arg 


Leu 


His 


Arg 


Gly 
30 


Met 


Asn 




gtg 


ccc 


ttc 


aag 


cgc 


ctg 


tct 


gtg 


gtc 


ttt 


gga 


gaa 


cac 


aca 


ctg 


ctg 


244 


Val 


Pro 


Phe 
35 


Lys 


Arg 


Leu 


Ser 


Val 
40 


Val 


Phe 


Gly 


Glu 


His. 
45 


Thr 


Leu 


Leu 




gtg 


a eg 


gtg 


tea 


gga 


cag 


agg 


gtg 


ttt 


gtg 


gtg 


aag 


agg 


cag 


aae 


cga 


292 


Val 


Thr 

50 


Val 


Ser 


Gly 


Gin 


Arg 

55 


Val 


Phe 


Val 


Val 


Lys 

60 


Arg 


Gin 


Asn 


Arg 




ggt 


egg 


gag 


ccc 


att 


gat 


gtc 


tgagcetgee ggagggegag ggtcggagaa 


343 


Gly 


Arg 


Glu 


Pro 


He 


Asp 


Val 






















65 










70 

























gcggattggg ecttetgctc gcccacctcc cacecctacc tggacgggcc caggcttggg 403 
gactctgagc tgtgttaagg agaacaaggg caaggagacc tecctttgtg ctccctcact 463 
ccctaataaa eatgagtctg atgttctcca aaaaaaaaaa aaaaa 508 

<210> 42 

<211> 623 

<212> DNA 

<213> Homo sapiens 

<220> 
<221> CDS 
<222> 69. .365 

<220> 

<221> sig_peptide 

<222> 69.. 152 

<223> Von Heijne matrix 



PCT/IBOl/00914 

35 
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36 

score 5.54338579743238 
seq LGYLVLSEGAVLA/SS 



<220> 

<221> polyA_site 
<222> 608.. 623 

<220> 

<221> polyA_signal 
<222> 582.-587 



<400> 42 

acgtgaccgg ggcctgaagc cggaagctac ctatctggta gggagctccc ccagcaccga 60 

agactgcg atg act tct gca ctg acc cag ggg ctg gag cga ate cca gac 110 
Met Thr Ser Ala Leu Thr Gin Gly Leu Glu Arg lie Pro Asp 

-25 -20 -15 

cag etc ggc tac ctg gta ctg agt gaa ggt gca gtg ctg gcg tea tct 158 

Gin Leu Gly Tyr Leu Val Leu Ser Glu Gly Ala Val Leu Ala Ser Ser 

-10 -5 1 

ggg gac etg gag aat gat gag eag gca gee agt gee ate tyt gag ctg 206 

Gly Asp Leu Glu Asn Asp Glu Gin Ala Ala Ser Ala lie Xaa Glu Leu 

5 10 15 

gtc age aca gee tgc ggt ttc egg etg eae egc ggc atg aat gtg cee 254 

Val Ser Thr Ala Cys Gly Phe Arg Leu His Arg Gly Met Asn Val Pro 

20 25 30 

ttc aag egc ctg tct gtg gtc ttt gga gaa cac aca ctg ctg gtg acg 302 

Phe Lys Arg Leu Ser Val Val Phe Gly Glu His Thr Leu Leu Val Thr 

35 40 45 50 

gtg tea gga cag agg gtg ttt gtg gtg aag agg cag aac cga ggt egg 350 

Val Ser Gly Gin Arg Val Phe Val Val Lys Arg Gin Asn Arg Gly Arg 

55 60 65 

gag cee att gat gtc tgagectgce ggagggcgag ggtcggagaa gcggattggg 405 
Glu Pro lie Asp Val 
70 

teetgggcct ctgtgatgag gcaggeacac etgteggtct tggettgetg etagaaetag 465 

ggccttytgs tcgcccacct eccaccccta cctggacggg cecaggcttg gggactytga 525 

getgtgttaa ggagaacaag ggcaaggaga cctccctttg tgctcectea etccctaata 585 

aacatgagtc tgatgttetc cgaaaaaaaa aaaaaaaa 623 



<210> 43 
<211> 1166 
<212> DNA 

<213> Homo sapiens 



<220> 
<221> CDS 
<222> 2. .559 



<220> 

<221> sig_peptide 
<222> 2. .46 

<223> Von Heijne matrix 

score 5.34456047740956 
seq VMAASMARGGVSA/RV 



<220> 

<221> polyA_site 
<222> 1151. .1166 



<400> 43 

g atg eat gtc atg gee gee tec atg gee egg gga ggc gtg agt gee agg 

Met His Val Met Ala Ala Ser Met Ala Arg Gly Gly Val Ser Ala Arg 
-15 -10 -5 1 



49 
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gtt 


eta 


ctg 


cag 


get 


gcc 


agg 


ggc 


acc 


tgk 


tgg 


aac 


aga 


cct 


ggg 


ggc 


97 


Val 


Leu 


Leu 


Gin 


Ala 


Ala 


Arg 


Gly 


Thr 


Xaa 


Trp 


Asn 


Arg 


Pro 


Gly 


Gly 










5 










10 










15 








act 


tec 


ggg 


teg 


ggg 


gag 


ggg 


gtg 


gcg 


ctg 


ggg 


aca 


acc 


aga 


aag 


ttt 


145 


Thr 


Ser 


Gly 


Ser 


Gly 


Glu 


Gly 


Val 


Ala 


Leu 


Gly 


Thr 


Thr 


Arg 


Lys 


Phe 








20 










25 










30 










caa 


gcg 


aca 


kgc 


teg 


cgc 


ccg 


get 


gga 


gag 


gag 


gac 


gcg 


ggc 


ggc 


ccg 


193 


Gin 


Ala 


Thr 


Xaa 


Ser 


Arg 


Pro 


Ala 


Gly 


Glu 


Glu 


Asp 


Ala 


Gly 


Gly 


Pro 






35 










40 










45 












gag 


egg 


ccc 


ggg 


gac 


gtg 


gtg 


aac 


gtg 


gtg 


ttc 


gta 


gac 


cgc 


tea 


ggc 


241 


Glu 


Arg 


Pro 


Gly 


Asp 


Val 


Val 


Asn 


Val 


Val 


Phe 


Val 


Asp 


Arg 


Ser 


Gly 




50 










55 










60 










65 




cag 


egg 


ate 


cca 


gtg 


agt 


ggc 


aga 


gtc 


ggg 


gac 


aat 


gtt 


ctt 


cac 


ctg 


289 


Gin Arg 


lie 


Pro 


Val 


Ser 


Gly 


Arg 


Val 


Gly 


Asp 


Asn 


Val 


Leu 


His 


Leu 












70 










75 










80 






gcc 


cag 


cgc 


cac 


ggg 


gtg 


gac 


ctg 


gaa 


ggg 


gcc 


tgt 


gaa 


gcc 


tec 


ctg 


337 


Ala 


Gin 


Arg 


His 


Gly 


Val 


Asp 


Leu 


Glu 


Gly 


Ala 


Cys 


Glu 


Ala 


Ser 


Leu 










85 










90 










95 








gcc tgc 


tec 


acc 


tgc 


cat 


gtg 


tat 


gtg 


agt 


gaa 


gac 


cac 


ctg 


gat 


etc 


385 


Ala Cys 


Ser 


Thr 


Cys 


His 


Val 


Tyr 


Val 


Ser 


Glu 


Asp 


His 


Leu 


Asp 


Leu 








100 










105 










110 










ctg 


cat 


cct 


ccc 


gag 


gag 


agg 


gaa 


gac 


gac 


atg 


eta 


gac 


atg 


gcc 


ccc 


433 


Leu 


Pro 


Pro 


Pro 


Glu 


Glu 


Arg 


Glu 


Asp 


Asp 


Met 


Leu 


Asp 


Met 


Ala 


Pro 






115 










120 










125 












etc 


etc 


cag 


gag 


aac 


teg 


egg 


ctg 


ggc 


tgc 


cag 


att 


gtg 


ctg 


aca 


ccg 


481 


Leu 


Leu 


Gin 


Glu 


Asn 


Ser 


Arg 


Leu 


Gly 


Cys 


Gin 


lie 


Val 


Leu 


Thr 


Pro 




130 










135 










140 










145 




gag 


ctg 


gaa 


rga 


gcg 


gaa 


ttc 


acc 


ctg 


ccc 


aag 


ate 


acc 


agg 


aac 


ttc 


529 


Glu 


Leu 


Glu 


Xaa 


Ala 


Glu 


Phe 


Thr 


Leu 


Pro 


Lys 


lie 


Thr 


Arg 


Asn 


Phe 












150 










155 










160 






tac 


gtg 


gat 


ggc 


cat 


gte 


ccc 


aag 


ccc 


cac 


tgacatgaac acctggacca 


579 


Tyr Val 


Asp 


Gly 


His 


Val 


Pro 


Lys 


Pro 


His 

















165 170 

ttccacattg ccatggcccc agggeccaga ttgagggaat agccaggtgc cagecctgec 639 

cagagtgcgg acaggcccgg gagagaegtg gaageccctg tgaaggacaa cacccctget 699 

tgggagagag tcccatgtcc aggctctggt ggggacaggg cccctagtgg ggtggccttc 759 

cccaggccce tgagaatcag ggtttgagta ggagtggact catattggag ctgcaataaa 819 

tcgataacac aggceccagc atgttgagtg tecttggggg acagatctgg ggtctacagg 879 

tggeteacac ctgtaatcct agcactttgg gaagecaaga tgggaggate acttgaggee 939 

aggagcttaa gaccatcctg ggcaacatgg cgagacctcg tetctataaa aacagtaaaa 999 

attagcgtgg agtggtggta agtgcctgta gtcccagcta ctcgggagac tgaggtggga 1059 

ggatcactta gcccagggga tttaggctgc agtggagcgg tgatcaggcc attccactcc 1119 

agcctgggtg acagagtgag gccctgtetc taaaaaaaaa aaaaaaa 1166 



<210> 44 

<211> 649 

<212> DNA 

<213> Homo sapiens 



<220> 
<221> CDS 
<222> 18.. 386 



<220> 

<221> sig_peptide 

<222> 18. .161 

<223> Von Heijne matrix 

score 5.26064226788865 
seq LLMLLLFLSELQY/YL 



<220> 

<221> polyA site 
<222> 634.. 649 



wo 02/083898 



PCT/reOl/00914 



38 

<220> 

<221> polyA_signal 
<222> 611. .616 

<400> 44 

ctttccggcc ggtcccc atg gag gcg ctg ggg aag ctg aag cag ttc gat 50 

Met Glu Ala Leu Gly Lys Leu Lys Gin Phe Asp 
-45 -40 



gcc 


tac 


ccc 


aag 


act 


ttg 


gag 


gac 


ttc 


egg 


gtc 


aag 


acc 


tgc 


ggg 


ggc 


98 


Ala 


Tyr 


Pro 
-35 


Lys 


Thr 


Leu 


Glu 


Asp 
-30 


Phe 


Arg 


Val 


Lys 


Thr 
-25 


Cys 


Gly 


Gly 




gcc 


acc 


gtg 


acc 


att 


gtc 


agt 


ggc 


ctt 


etc 


atg 


ctg 


eta 


ctg 


ttc 


ctg 


146 


Ala 


Thr 
-20 


Val 


Thr 


He 


Val 


Ser 
-15 


Gly 


Leu 


Leu 


Met 


Leu 
-10 


Leu 


Leu 


Phe 


Leu 




tec 


gag 


ctg 


cag 


tat 


tac 


etc 


acc 


aeg 


gag 


gtg 


cat 


cct 


gag 


etc 


tac 


194 


Ser 


Glu 


Leu 


Gin 


Tyr 


Tyr 


Leu 


Thr 


Thr 


Glu 


Val 


His 


Pro 


Glu 


Leu 


Tyr 




-5 










1 








5 










10 






gtg 


gac 


aag 


teg 


egg 


gga 


gat 


aaa 


ctg 


aag 


ate 


aac 


ate 


gat 


gta 


ctt 


242 


Val 


Asp 


Lys 


Ser 
15 


Arg 


Gly 


Asp 


Lys 


Leu 
20 


Lys 


He 


Asn 


He 


Asp 
25 


Val 


Leu 




ttt 


ccg 


cac 


atg 


cct 


tgt 


gcc 


tat 


ctg 


agt 


att 


gat 


gcc 


atg 


gat 


gtg 


290 


Phe 


Pro 


His 
30 


Met 


Pro 


Cys 


Ala 


Tyr 
35 


Leu 


Ser 


He 


Asp 


Ala 
40 


Met 


Asp 


Val 




gcc 


gga 


gaa 


cag 


car 


ctg 


gat 


gtg 


gaa 


cac 


aac 


ctg 


ttc 


aag 


caa 


cga 


338 


Ala 


Gly 
45 


Glu 


Gin 


Gin 


Leu 


Asp 
50 


Val 


Glu 


His 


Asn 


Leu 
55 


Phe 


Lys 


Gin 


Arg 




eta 


gat 


aaa 


gat 


ggc 


ate 


ccc 


gtg 


age 


tea 


gag 


get 


gag 


egg 


cat 


gmt 


386 


Leu Asp 


Lys 


Asp 


Gly 


He 


Pro 


Val 


Ser 


Ser 


Glu 


Ala 


Glu 


Arg 


His 


Xaa 




60 










65 










70 










75 





taatgcagga caccagtgga accaacataa tttgctatag gactggatgt ggagtggagg 44 6 

agagcaaagt catggttgtt gattccaacg ctttcagcct gagcaaatga atgacttgag 506 

atggagatga etggaggaga aggttttttt ggtgggaggg aggagtggct attggatatc 566 

emctgetgtg taacaaagta etccaaaagt tagtagtttg atacaataaa aatgtgtttt 626 

ttcagtcaaa aaaaaaaaaa aaa 649 



<210> 45 

<211> 836 

<212> DNA 

<213> Homo sapiens 

<220> 
<221> CDS 
<222> 87.. 317 

<220> 

<221> sig_peptide 

<222> 87.. 173 

<223> Von Heijne matrix 

score 5.19818357425197 
seq LLLKLQRLPQAEP/VE 

<220> 

<221> polyA__site 
<222> 821. .836 

<220> 

<221> polyA_signal 
<222> 805. .810 



<400> 45 

gagtgtgggg ccagccgtgg aggctccagg tgttctctct gccccagcag agcccggcag 
gagecccaac aggaagceag cgcggc atg get gcc acc gac ttc gtg cag gag 



60 
113 
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Met Ala Ala Thr Asp Phe Val Gin Glu 
-25 



atg 


cgc 


gcc 


gtg 


ggc gag agg ctg 


ctg 


etc 


aag 


ctg cag 


aga 


ctg 


ccc 


161 


Met 


Arg 


Ala 


Val 


Gly Glu Arg Leu 


Leu 


Leu 


Lys 


Leu Gin 


Arg 


Leu 


Pro 




-20 








-15 






-10 








-5 




cag 


get 


gag 


cce 


gtg gag ate gtg 


gcc 


ttc 


tea 


gtc ate 


ate 


ctt 


ttc 


209 


Gin 


Ala 


Glu 


Pro 


Val Glu He Val 
1 


Ala 

5 


Phe 


Ser 


Val He 


He 
10 


Leu 


Phe 




aca 


get 


act 


gtt 


ctg ctg ttg ctg 


ctg 


ata 


gcc 


tgc age 


tgc 


tgc 


tgc 


257 


Thr 


Ala 


Thr 


Val 


Leu Leu Leu Leu 


Leu 


He 


Ala 


Cys Ser 


Cys 


Cys 


Cys 








15 




20 








25 










act 


cac 


tgc 


tgc 


tgc cet gag egg 


aga 


ggc 


agg 


aag gtc 


eag 


gtg 


eag 


305 


Thr 


His 


Cys 


Cys 


Cys Pro Glu Arg 


Arg 


Gly 


Arg 


Lys Val 


Gin 


Val 


Gin 






30 






35 








40 










ccg 


aca 


cca 


cca 


tgaeggacgg gcgatggctg aggagaaget ggagaggaga 




357 


Pro 


Thr 


Pro 


Pro 





















45 



tggccaatgc catgacacag 
ggctcccetg gececagctc 
taccaaggaa acaagggctg 
gctggggagt cagctgtttc 
actttttaac aaaacaagga 
tgtgggcaat tggcccttgg 
aggggctttg tgcaatagct 
ttctagtcca gtatttgeca 

<210> 46 
<211> 482 
<212> DNA 
<213> Homo sapiens 

<220> 
<221> CDS 
<222> 17.. 337 

<220> 

<221> sig_peptide 
<222> 17. .97 
<223> Von Heijne matrix 

score 4.95181310301952 

seq VLSTLSLVQFSPS/GR 

<220> 

<221> polyA__site 
<222> 467.. 482 

<220> 

<221> polyA_signal 
<222> 442. .447 

<400> 46 

actttagctg etgeag atg ctg ttt gaa gaa get ctg ccc etc age tgc tct 52 
Met Leu Phe Glu Glu Ala Leu Pro Leu Ser Cys Ser 
-25 -20 

gat cet gtg ctt age act ctt age ctg gtg cag ttc age ccc agt gga 100 

Asp Pro Val Leu Ser Thr Leu Ser Leu Val Gin Phe Ser Pro Ser Gly 

-15 -10 -5 1 

agg ace cag gae ctg etc tct cca ggg gtg gag aac ctg teg gtg ctg 148 

Arg Thr Gin Asp Leu Leu Ser Pro Gly Val Glu Asn Leu Ser Val Leu 

5 10 15 

gac gtg tec cet ctg ggc ttg gcc tgc tgt ctg etc act etc ace atg 196 

Asp Val Ser Pro Leu Gly Leu Ala Cys Cys Leu Leu Thr Leu Thr Met 
20 25 30 



geeateagee tggcectgea geccttacce cteaagacca 417 

tggcccagcc eaggtaeetg gacactgaca acttgagcee 477 

gtataggtge aaacetctea tetgecagtg gacactgggt 537 

aaagactggg tcaactgcet gggcttcttc gectacctgc 597 

agtaggggtc cecataeett gatggagaac agtceccace 657 

ggetetgetg atacatgeca aagaggagca aggeaatcag 717 

tctgcatccg ageteccgee agagegtgag catgtcagta 777 

gtttceaagt aaaagctttt gtgaaaaaaa aaaaaaaaa 836 



wo 02/083898 PCT/IBO 1/009 14 

40 

tec tgc cca ggg cct gac cct cct gag ggg ccc ggg acc cag cgt gtg 244 
Ser Cys Pro Gly Pro Asp Pro Pro Glu Gly Pro Gly Thr Gin Arg Val 

35 40 45 

tgg caa ggg get eta egg ate eta cag etc cca gga gee cca gat ggg 292 
Trp Gin Gly Ala Leu Arg He Leu Gin Leu Pro Gly Ala Pro Asp Gly 
50 55 60 65 

gtt tea cca tat cag cca gtc tgg tct cga act cct gac etc aag 337 
Val Ser Pro Tyr Gin Pro Val Trp Ser Arg Thr Pro Asp Leu Lys 

70 75 80 

tgattcgeca gectcgtcct cccaaagtgc tgggattaea gatgtgagee accatgecca 397 
tggagaatgt attacttttg taatttaaaa aataatagta ataaaataaa gatctttaca 457 
atgtttcaga aaaaaaaaaa aaaaa 482 



<210> 47 
<211> 806 
<212> DNA 

<213> Homo sapiens 

<220> 
<221> CDS 
<222> 136.. 471 



<220> 

<221> sig_peptide 

<222> 136.. 252 

<223> Von Heijne matrix 

score 4.89253405263876 
seq IILFSAIVGFIYG/YV 



<220> 

<221> polyA_site 
<222> 792. .806 



<220> 

<221> polyA_signal 
<222> 775.. 780 



<400> 47 

ttcgctetcc cgggcttaga aggcccggct actgacgegc agtgccagac ettaccccte 60 
acggtcetta agtctcggtc gccctcgect cgcagcctgc cacccgcgct cagctgeccg 120 
cctcctcagc cagcc atg ctg gag cat ctg age teg etg ccc aeg cag atg 171 
Met Leu Glu His Leu Ser Ser Leu Pro Thr Gin Met 
-35 -30 



gat 


tac 


aag 


ggc 


cag 


aag 


eta 


get 


gaa 


cag 


atg 


ttt 


cag 


gga 


att 


att 


219 


Asp Tyr 


Lys 


Gly 


Gin 


Lys 


Leu 


Ala 


Glu 


Gin 


Met 


Phe 


Gin 


Gly 


He 


He 








-25 










-20 










-15 










ett 


ttt 


tct 


gea 


ata 


gtt 


gga 


ttt 


ate 


tac 


ggg 


tac 


gtg 


get 


gaa 


cag 


267 


Leu 


Phe 
-10 


Ser 


Ala 


lie 


Val 


Gly 
-5 


Phe 


He 


Tyr 


Gly 


Tyr 
1 


Val 


Ala 


Glu 


Gin 
5 




ttc 


ggg 


tgg 


act 


gtc 


tat 


ata 


gtt 


atg 


gcc 


gga 


ttt 


get 


ttt 


tea 


tgt 


315 


Phe 


Gly 


Trp 


Thr 


Val 
10 


Tyr 


lie 


Val 


Met 


Ala 
15 


Gly 


Phe 


Ala 


Phe 


Ser 
20 


Cys 




ttg 


ctg 


aca 


ett 


cct 


cca 


tgg 


ccc 


ate 


tat 


cge 


egg 


eat 


cct 


etc 


aag 


363 


Leu 


Leu 


Thr 


Leu 
25 


Pro 


Pro 


Trp 


Pro 


He 
30 


Tyr 


Arg 


Arg 


His 


Pro 
35 


Leu 


Lys 




tgg 


tta 


cct 


gtt 


caa 


gca 


cag 


acg 


aca 


aga 


aac 


cag 


ggg 


aaa 


gaa 


aaa 


411 


Trp 


Leu 


Pro 
40 


Val 


Gin 


Ala 


Gin 


Thr 
45 


Thr 


Arg 


Asn 


Gin 


Gly 
50 


Lys 


Glu 


Lys 




tta 


aga 


ggc 


atg 


eta 


aaa 


ata 


att 


gag 


gtt 


ttc 


atg 


att 


cag 


cac 


etg 


459 


Leu 


Arg 
55 


Gly 


Met 


Leu 


Lys 


He 
60 


He 


Glu 


Val 


Phe 


Met 
65 


He 


Gin 


His 


Leu 





ett ttg ttt ctg tgagatgagc taaattgctt tcatacccca gataagagct 511 
Leu Leu Phe Leu 
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70 

aaaaccacct aatgctctta tggcacagct gtgtatagat ttagttctct ttatacttca 571 

tttctagccc agttgggttt tgatttatat aagtagttta gaccttctct tcataatctt 631 

gctctgagat ggggaacaga acacacaagt atgaagtttc tttcaggtgt aaataatgaa 691 

aaataaatgc ctcataaatg atagtacaat gtaactatca aagttttata attcattatg 751 

agttaaccat tttaatgttt ccaattaaac ctcatagtgc aaaaaaaaaa aaaaa 806 

<210> 48 

<211> 582 

<212> DNA 

<213> Homo sapiens 



<220> 
<221> CDS 
<222> 148.. 360 



<220> 

<221> sigjpeptide 

<222> 148.. 216 

<223> Von Heijne matrix 

score 5.918755252201 
SALMLLPCRPVLT/SV 



<220> 

<221> polyA_site 
<222> 566.. 582 

<220> 

<221> polyA_signal 
<222> 544. .549 

<400> 48 

aacgtcatct aggagcaccg agcagcttgg ctaaaagtaa gggtgtcgtg ctgatggccc 60 
tgtgcgcact gacccgckct sygctctctg aacctggcgc ccccgaccgt cgccgcccct 120 
gccccgagtc tgttccccgc cgcccag atg atg aac aat ggc etc etc eaa eag 174 

Met Met Asn Asn Gly Leu Leu Gin Gin 
-20 -15 



ccc 


tct 


gee 


ttg 


atg 


ttg 


etc 


ccc 


tgc 


cgc 


cca 


gtt 


ctt 


act 


tct 


gtg 


222 


Pro 


Ser 


Ala 


Leu 


Met 


Leu 


Leu 


Pro 


Cys 


Arg 


Pro 


Val 


Leu 


Thr 


Ser 


Val 












-10 










-5 










1 






gee 


ctt 


aat 


gee 


aac 


ttt 


gtg 


tec 


tgg 


aag 


agt 


egt 


aec 


aag 


tac 


aec 


270 


Ala 


Leu 


Asn 


Ala 


Asn 


Phe 


Val 


Ser 


Trp 


Lys 


Ser 


Arg 


Thr 


Lys 


Tyr 


Thr 








5 










10 










15 










att 


aca 


oca 


gtg 


aag 


atg 


agg 


aag 


tct 


ggg 


ggc 


cga 


gac 


eac 


aca 


ggt 


318 


He 


Thr 


Pro 


Val 


Lys 


Met 


Arg 


Lys 


Ser 


Gly 


Gly 


Arg 


Asp 


His 


Thr 


Gly 






20 










25 










30 












get 


gga 


aac 


gtg 


cgt 


age 


aae 


agt 


agg 


ceg 


agt 


ate 


caa 


egt 






360 


Ala Gly Asn Val 


Arg 


Ser 


Asn 


Ser 


Arg 


Pro 


Ser 


He 


Gin 


Arg 









35 40 45 



tgatcataac aaacgggtca ttggcaagge aggtegcaac egctggctgg gcaagaggcc 420 

taacagtggg cggtggcacc gcaagggggg ctgggctggc cgaaagattc ggccaetacc 480 

ccccatgaag agttacgtga agctgccttc tgcttctgcc caaagctgat atecetgtac 540 

tetaataaaa tgcccccecc ccctcaaaaa aaaaaaaaaa aa 582 



<210> 49 

<211> 583 

<212> DNA 

<213> Homo sapiens 



<220> 
<221> CDS 
<222> 149. .361 



wo 02/083898 
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<220> 

<221> sig_pepticle 

<222> 149. .217 

<223> Von Heijne matrix 

score 5.918755252201 

SALMLLPCRPVLT/SV 



<220> 

<221> polyA_site 
<222> 567. .583 



<220> 

<221> polyA_signal 
<222> 545. .550 



<400> 49 

aacgtcatct aggagcaccg agcagcttgg ctaaaagtaa gggtgtcgtg ctgatggccc 60 
tgtgcgcact gacccgcgct ctgcstctct gaacctggcg cccccgaccg tcgccgcccc 120 
tgccccgagt ctgttccccg ccgcccag atg atg aac aat ggc etc etc caa 172 

Met Met Asn Asn Gly Leu Leu Gin 
-20 

cag ccc tct gcc ttg atg ttg etc ccc tgc cgc cca gtt ctt act tct 220 

Gin Pro Ser Ala Leu Met Leu Leu Pro Cys Arg Pro Val Leu Thr Ser 

-15 -10 -5 1 

gtg gcc ctt aat gcc aac ttt gtg tec tgg aag agt cgt ace aag tac 268 

Val Ala Leu Asn Ala Asn Phe Val Ser Trp Lys Ser Arg Thr Lys Tyr 

5 10 15 

ace att aca cca gtg aag atg agg aag tct ggg ggc cga gac cac aca 316 
Thr He Thr Pro Val Lys Met Arg Lys Ser Gly Gly Arg Asp His Thr 

20 25 30 

ggt get gga aac gtg cgt age aac agt agg ccg agt ate caa cgt 361 
Gly Ala Gly Asn Val Arg Ser Asn Ser Arg Pro Ser He Gin Arg 

35 40 45 

tgatcataac aaacgggtea ttggcaagge aggtegcaac cgctggctgg gcaagaggcc 421 
taacagtggg cggtggcacc gcaagggggg ctgggctgge cgaaagattc ggccactacc 481 
ccccatgaag agttacgtga agctgccttc tgcttctgcc caaagctgat atccctgtac 541 
tctaataaaa tgecccccec cectcaaaaa aaaaaaaaaa aa 583 



<210> 50 

<211> 746 

<212> DNA 

<213> Homo sapiens 

<220> 
<221> CDS 
<222> 83.. 358 



<220> 

<221> sig_peptide 

<222> 83. . 127 

<223> Von Heijne matrix 

score 4.68058603039206 
VLAGSLLGPTSRS/AA 



<400> 50 

etttttttec tttgttccag ggctggagcg gctctgggct ccggaategc ecgcagccgg 60 
tactgeggga cccactgcgg at atg get gtc ttg get gga tec ctg ttg ggc 112 

Met Ala Val Leu Ala Gly Ser Leu Leu Gly 

-15 -10 
ccc acg agt agg teg gea gcg ttg ctg ggt ggc agg tgg etc cag ccc 160 
Pro Thr Ser Arg Ser Ala Ala Leu Leu Gly Gly Arg Trp Leu Gin Pro 
-5 15 10 

egg gcc tgg ctg ggg ttc cca gac gcc tgg ggc etc ccc ace ccg cag 208 



wo 02/083898 PCT/IBOl/00914 



Arg Ala 


Trp 


Leu 
15 


Gly 


Phe 


Pro 


Asp Ala 
20 


43 

Trp Gly 


Leu Pro 


Thr 

25 


Pro 


Gin 




cag gcc 
Gin Ala 


egg 
Arg 
30 


ggc 
Gly 


aag 
Lys 


get 
Ala 


cgc 
Arg 


ggg 

Gly 
35 


aat 
Asn 


gag tat 
Glu Tyr 


cag ccg 
Gin Pro 
40 


age 
Ser 


aac 
Asn 


ate 
He 


256 


aaa cgc 
Lys Arg 
45 


aag 
Lys 


aac 
Asn 


aag 
Lys 


cac 
His 


ggc 
Gly 

50 


tgg 
Trp 


gtc 
Val 


egg cgc 
Arg Arg 


ctg age 
Leu Ser 
55 


acg 
Thr 


cca 
Pro 


gcc 
Ala 


304 


ggc gts 
Gly Val 
60 


cag 
Gin 


gtc 
Val 


ate 
He 


ett 
Leu 
65 


cgc 
Arg 


ega 
Arg 


atg 
Met 


etc aag 
Leu Lys 
70 


ggc cgc 
Gly Arg 


aag 
Lys 


teg 
Ser 


ctg 
Leu 
75 


352 


age cat 


tgaggatcgc gacgcagtcg gcgggaccct catggaagca 


tcgccctcgc 


408 



Ser His 

ctcggaeett gcetggcgct atttttgeag ggagctgggg agcaggaacg cctcggacct 4 68 

gagtgctctc catattgtgg ggttgaagtc tggatgggag cttgccaagt ccetttttag 528 

gctttttaat taggaageat ttcgaaeetg cgcaacagae caaagaaeag tacaaagaac 588 

atccgtgtae ceagtacect gaetaeegae tacctaeaac ccgtceetgc eccatcctga 648 

gttcttttga agetgatetc aggcategga ttatttctte tgtaaatatt tcagaatgta 708 

tctetecaag atgagagctc attaaaagac aattacaa 74 6 

<210> 51 

<211> 731 

<212> DNA 

<213> Homo sapiens 

<220> 
<221> CDS 
<222> 73. .348 

<220> 

<221> sig_peptide 

<222> 73. .117 

<223> Von Heijne matrix 

score 4.68058603039206 

seq VLAGSLLGPTSRS/AA 

<400> 51 

gttgttccag ggctggagcg gctctgggct ccggaatcgc cegcagccgg taetgcggga 60 

eccactgeag at atg get gtc ttg get gga tec ctg ttg ggc cec aeg agt 111 

Met Ala Val Leu Ala Gly Ser Leu Leu Gly Pro Thr Ser 
-15 -10 -5 



agg 


teg 


gca 


gcg 


ttg 


ctg 


ggt 


ggc 


agg 


tgg 


etc 


car 


cec 


egg 


gee 


tgg 


159 


Arg 


Ser 


Ala 


Ala 


Leu 


Leu 


Gly 


Gly 


Arg 


Trp 


Leu 


Gin 


Pro 


Arg 


Ala 


Trp 








1 








5 










10 












ctg 


ggg 


ttc 


cca 


gac 


gcc 


tgg 


ggc 


etc 


ccc 


ace 


ccg 


cag 


cag 


gcc 


egg 


207 


Leu Gly 


Phe 


Pro 


Asp 


Ala 


Trp 


Gly 


Leu 


Pro 


Thr 


Pro 


Gin 


Gin 


Ala 


Arg 




15 










20 










25 










30 




ggc 


aag 


get 


cgc 


ggg 


aat 


gag 


tat 


cag 


ccg 


age 


aat 


ate 


aaa 


cgc 


aag 


255 


Gly 


Lys 


Ala 


Arg 


Gly 


Asn 


Glu 


Tyr 


Gin 


Pro 


Ser 


Asn 


He 


Lys 


Arg 


Lys 












35 










40 










45 






aac 


aag 


cac 


ggc 


tgg 


gtc 


egg 


cgc 


ctg 


arc 


acg 


ccg 


gcc 


ggc 


gtg 


cag 


303 


Asn 


Lys 


His 


Gly 


Trp 


Val 


Arg 


Arg 


Leu 


Xaa 


Thr 


Pro 


Ala 


Gly 


Val 


Gin 










50 










55 










60 








gtc 


ate 


ett 


cgc 


ega 


atg 


etc 


aag 


ggc 


cgc 


aag 


teg 


ctg 


age 


cat 




348 


Val 


He 


Leu 


Arg 


Arg 


Met 


Leu 


Lys 


Gly 


Arg 


Lys 


Ser 


Leu 


Ser 


His 










65 










70 










75 











tgaggatcgc gacgcagtcg gcgggaccct catggaagca tcgccctcgc ctcggaeett 408 

gcetggcgct atttttgeag ggagctgggg agcaggaacg cctcggacct gagtgctctc 4 68 

catattgtgg ggttgaagtc tggatgggag cttgccaagt ccetttttag gctttttaat 528 

taggaageat ttcgaaeetg cgcaacagae caaagaaeag tacaaagaac atccgtgtae 588 

ceagtacect gaetaeegae tacctaeaac ccgtceetgc eccatcctga gttcttttga 648 

agetgatetc aggcategga ttatttctte tgtaaatatt tcagaatgta tctetecaag 708 

atgagagctc attaaaagat aat 731 
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<210> 52 
<211> 773 
<212> DNA 

<213> Homo sapiens 

<220> 
<221> CDS 
<222> 22.. 297 

<220> 

<221> sig_peptide 

<222> 22.. 66 

<223> Von Heijne matrix 

score 4.68058603039206 

seq VLAGSLLGPTSRS/AA 

<220> 

<221> polyA__site 
<222> 758. .773 

<220> 

<221> polyA_signal 
<222> 740.. 745 

<400> 52 

actgcgggac ccactgcgga t atg get gtc ttg get gga tec etg ttg ggc 51 

Met Ala Val Leu Ala Gly Ser Leu Leu Gly 

-15 -10 

ccc acg agt agg teg gca gcg ttg ctg ggt ggc agg tgg etc eag ccc 99 

Pro Thr Ser Arg Ser Ala Ala Leu Leu Gly Gly Arg Trp Leu Gin Pro 
-5 15 10 

egg gee tgg ctg ggg tte eea gac gee tgg ggc etc ecc aee ccg eag 147 

Arg Ala Trp Leu Gly Phe Pro Asp Ala Trp Gly Leu Pro Thr Pro Gin 

15 20 25 

eag gcc egg ggc aag get cgc ggg aat gar tat cag ccg age aat ate 195 

Gin Ala Arg Gly Lys Ala Arg Gly Asn Glu Tyr Gin Pro Ser Asn lie 

30 35 40 

aaa cge aag aae aag eae ggc tgg gte egg cgc etg are aeg eeg gee 243 

Lys Arg Lys Asn Lys His Gly Trp Val Arg Arg Leu Xaa Thr Pro Ala 

45 50 55 

ggc gtg mrg gte ate ett ege ega atg etc aag ggc cgc aag teg etg 291 

Gly Val Xaa Val lie Leu Arg Arg Met Leu Lys Gly Arg Lys Ser Leu 
60 65 70 75 
age eat tgaggatcgc gacgcagtcg gcgggaccct catggaagca tegccctege 347 
Ser His 

etcggaectt geetggegct atttttgeag ggagctgggg agcaggaacg ectcggacet 407 

gagtgctcte catattgtgg gtttgaagte tggatgggag cttgecaagt ecctttttag 4 67 

getttttaat taggaagcat ttegaaeetg cgeaacagae caaagaaeag tacaaagaac 527 

atccgtgtae ceagtaceet gaetaccgae tacctaeaac ecgtcectgc cecatcetga 587 

gttcttttga agetgatctc aggcategga ttatttcttc tgtaaatatt teagaatgta 647 

tetctecaag atgagagetc attaaaagat aattacaaag cttatcacat ceaaaagaat 707 

tateaataat tttgaaatat tattaaaegt gtaataaatg tteaaagttc aaaaaaaaaa 767 
aaaaaa 773 

<210> 53 
<211> 740 
<212> DNA 

<213> Homo sapiens 



<220> 
<221> CDS 
<222> 80. .355 



wo 02/083898 
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<220> 

<221> sig_peptide 

<222> 80. .124 

<223> Von Heijne matrix 

score 6.4806443024348 

seq VLAGSLLXPTSRS/AA 

<400> 53 

ataatccttt gttccagggc tggagcggct ctgggctccg gaatcgcccg cagccggtac 60 
tgcgggaccc actgcggat atg get gtc ttg get gga tec etg ttg gre cee 112 

Met Ala Val Leu Ala Gly Ser Leu Leu Xaa Pro 













-15 








-10 








-5 




acg 


agt 


agg 


teg 


gea 


gcg 


ttg 


ctg 


ggt 


rgc 


agg 


tgg 


etc 


cag 


ecc 


egg 


160 


Thr 


Ser 


Arg 


Ser 


Ala 
1 


Ala 


Leu 


Leu 


Gly 
5 


Xaa 


Arg 


Trp 


Leu 


Gin 
10 


Pro 


Arg 




gee 


tgg 


ctg 


ggg 


ttc 


cea 


gae 


gcc 


tgg 


ggc 


etc 


cee 


ace 


ccg 


cag 


cag 


208 


Ala 


Trp 


Leu 


Gly 


Phe 


Pro 


Asp 


Ala 


Trp 


Gly 


Leu 


Pro 


Thr 


Pro 


Gin 


Gin 








15 










20 










25 










gee 


egg 


ggc 


aag 


get 


cgc 


ggg 


aat 


gag 


tat 


cag 


ccg 


age 


aac 


ate 


aaa 


256 


Ala 


Arg 


Gly 


Lys 


Ala 


Arg 


Gly 


Asn 


Glu 


Tyr 


Gin 


Pro 


Ser 


Asn 


He 


Lys 






30 










35 










40 












cgc 


aag 


aac 


aag 


cac 


ggc 


tgg 


gtc 


egg 


cgc 


ctg 


arc 


acg 


ccg 


gcc 


ggc 


304 


Arg 


Lys 


Asn 


Lys 


His 


Gly 


Trp 


Val 


Arg 


Arg 


Leu 


Xaa 


Thr 


Pro 


Ala 


Gly 




45 










50 










55 










60 




kwr 


smg 


ktc 


ate 


ctt 


cgc 


cga 


atg 


etc 


aag 


ggc 


cgc 


aag 


teg 


ctg 


age 


352 


Xaa 


Xaa 


Xaa 


lie 


Leu 


Arg 


Arg 


Met 


Leu 


Lys 


Gly 


Arg 


Lys 


Ser 


Leu 


Ser 












65 










70 










75 






cat 


tgaggatcgc gacgcagtsg scggggaacc ctcatggaag 


catcgecctc 




405 



His 

gcctcggacc ttgcctggcg 

ctgagtgcte tecatattgt 

aggettttta attaggaagc 

acatccgtgt acccagtace 

gagttetttt gaagctgate 

tatcteteca agatgagagc 

<210> 54 
<211> 638 
<212> DNA 

<213> Homo sapiens 

<220> 
<221> CDS 
<222> 137. .397 

<220> 

<221> sig_peptide 
<222> 137. .253 
<223> Von Heijne matrix 

score 4.61343345943784 

seq LQGARVLLTGANA/GV 

<220> 

<221> polyA_site 
<222> 623. .638 

<220> 

<221> polyA_signal 
<222> 616. .621 

<400> 54 

ggggtgcggt eggggacccg geaggaggcg gcegagaaga gaggaccgtg ggggcgttcg 60 



ctatttttgc agggagctgg ggagcaggaa cgcctcggac 465 

ggggttgaag tctggatggg agettgecaa gtcccttttt 525 

atttcgaacc tgcgcaacag aecaaagaac agtacaaaga 585 

ctgactaccg actacctaca acccgtccct gccccatcet 645 

teaggcateg gattatttct tctgtaaata tttcagaatg 705 

tcattaaaag acaat 740 
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cgtggctccc agcccgggac cccacccccg ctggacagtg ggggaaactg aggcctgagc 120 
gggcccacac aggacc atg aag gtg ctt etc etc aca ggg ctg ggg gee ctg 172 
Met Lys Val Leu Leu Leu Thr Gly Leu Gly Ala Leu 
-35 -30 



ttc 


ttc 


gcc 


tat 


tat 


tgg 


gat 


gac 


aac 


ttc 


gac 


cea 


gcc 


age 


etc 


cag 


220 


Phe 


Phe 


Ala 


Tyr 


Tyr 


Trp 


Asp 


Asp 


Asn 


Phe 


Asp 


Pro 


Ala 


Ser 


Leu 


Gin 








-25 










-20 










-15 










gga 


gcg 


cga 


gtg 


ctg 


ctg 


aca 


ggg 


gee 


aac 


get 


ggt 


gtt 


ggt 


gag 


gag 


268 


Gly 


Ala 


Arg 


Val 


Leu 


Leu 


Thr 


Gly 


Ala 


Asn 


Ala 


Gly 


Val 


Gly 


Glu 


Glu 






-10 










-5 










1 








5 




ctg 


gcc 


tat 


cac 


tac 


gcg 


cgt 


ctg 


ggc 


tec 


cac 


ctg 


gtg 


etc 


act 


gcc 


316 


Leu 


Ala 


Tyr 


His 


Tyr 


Ala 


Arg 


Leu 


Gly 


Ser 


His 


Leu 


Val 


Leu 


Thr 


Ala 












10 










15 










20 






cac 


act 


gag 


get 


etc 


ctg 


cag 


aag 


gca 


egg 


tgg 


etc 


aeg 


ctt 


gta 


gtc 


364 


His 


Thr 


Glu 


Ala 


Leu 


Leu 


Gin 


Lys 


Ala 


Arg 


Trp 


Leu 


Thr 


Leu 


Val 


Val 










25 










30 










35 








ago 


act 


ttg 


gga 


ggc 


cga 


gga 


aag 


tgg 


ate 


ace 


tgaggtcagg agtteaagac 


417 


Ser 


Thr 


Leu 


Gly 


Gly 


Arg 


Gly 


Lys 


Trp 


lie 


Thr 















40 45 

cagcctggcc aaeatggcga aacctggtct etactaagaa gaeaaaaatt aggccaggca 477 

tggtggctca tgcctgtaat ectagcaett tgggaggcca aggeggatgg atcacttgag 537 

gtcaggagtt caaaaceagc ctggccaaca tggtgaaace ctgtctctac taaaaataca 597 

aaaaataaaa ataaaataaa taaaeaaaaa aaaaaaaaaa a 638 



<210> 55 

<211> 577 

<212> DNA 

<213> Homo sapiens 

<220> 
<221> CDS 
<222> 18.. 392 



<220> 

<221> sig_peptide 

<222> 18. .62 

<223> Von Heijne matrix 

score 4.60228634566274 
seq LLTHNLLSSHVRG/VG 



<220> 

<221> polyA_site 
<222> 562.. 577 



<220> 

<221> polyA signal 
<222> 545. .550 



<400> 55 

tttttttgtg cggcgac atg aaa ctg ctt acc cac aat ctg ctg age teg 50 
Met Lys Leu Leu Thr His Asn Leu Leu Ser Ser 
-15 -10 -5 

cat gtg egg ggg gtg ggg tee cgt ggc ttc cec ctg egc etc cag gcc 98 
His Val Arg Gly Val Gly Ser Arg Gly Phe Pro Leu Arg Leu Gin Ala 

15 10 
ace gag gtc cgt ate tgc cet gtg gaa ttc aac cec aac ttc gtg gcg 14 6 

Thr Glu Val Arg lie Cys Pro Val Glu Phe Asn Pro Asn Phe Val Ala 

15 20 25 

cgt atr ata cct aaa gtg gag tgg teg gcg ttc ctg gag gcg gcc gat 194 
Arg Xaa lie Pro Lys Val Glu Trp Ser Ala Phe Leu Glu Ala Ala Asp 

30 35 40 

aac ttg cgt ctg ate cag gtg ccg aaa ggg ecg gtt gag gga tat gag 242 
Asn Leu Arg Leu lie Gin Val Pro Lys Gly Pro Val Glu Gly Tyr Glu 
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45 50 55 60 

gag aat gag gag ttt ctg agg acc atg cac cac ctg ctg ctg gag gtg 290 

Glu Asn Glu Glu Phe Leu Arg Thr Met His His Leu Leu Leu Glu Val 

65 70 75 

gaa gtg ata gag ggc acc ctg cag tgc ccg gaa tct gga cgt atg ttc 338 
Glu Val He Glu Gly Thr Leu Gin Cys Pro Glu Ser Gly Arg Met Phe 

80 85 90 

ccc ate age cgc ggg ate ccm amc atg ctg ctg agt gaa gag gaa act 386 
Pro He Ser Arg Gly He Pro Xaa Met Leu Leu Ser Glu Glu Glu Thr 

95 100 105 

gag agt tgattgtgsc aggcgccagt ttttcttgtt atgactgttt atttttgttg 442 
Glu Ser 
110 

atctataccc tgtttccgaa ttctgccgtg tgtatcccca acccttgacc caatgacacc 502 
aaacacagtg tttttgagct cggtattata tatttttttc tcattaaagg tttaaaacca 562 
aaaaaaaaaa aaaaa 577 



<210> 56 

<211> 521 

<212> DNA 

<213> Homo sapiens 

<220> 
<221> CDS 
<222> 82. .360 



<220> 

<221> sig_peptide 

<222> 82. .147 

<223> Von Heijne matrix 

score 4.53239781724629 
seq FFLLITIALGTKT/ES 



<220> 

<221> polyA_site 
<222> 506.. 521 



<220> 

<221> polyA_signal 
<222> 486. .491 



<400> 56 

attttctcaa caattcctca ccgcaggagc ctctgaagct cccaccaggc cagctctcct 60 



cccacaacag cttcccacag c 


atg 


aag 


ate 


tec gtg 


get 


gee 


att 


ccc 


ttc 


111 














Met 


Lys 


He 
-20 


Ser Val 


Ala 


Ala 


He 
-15 


Pro 


Phe 




ttc 


etc 


etc 


ate 


acc 


ate 


gee 


eta 


ggg 


acc aag 


act 


gaa 


tec 


tec 


tea 


159 


Phe 


Leu 


Leu 
-10 


He 


Thr 


He 


Ala 


Leu 
-5 


Gly 


Thr Lys 


Thr 


Glu 
1 


Ser 


Ser 


Ser 




egg 


gga 


cct 


tac 


cac 


ccc 


tea 


gag 


tgc 


tgc ttc 


ace 


tac 


act 


acc 


tac 


207 


Arg 


Gly 


Pro 


Tyr 


His 


Pro 


Ser 


Glu 


Cys 


Cys Phe 


Thr 


Tyr 


Thr 


Thr 


Tyr 




5 










10 








15 










20 




aag 


ate 


ccg 


cgt 


cag 


egg 


att 


atg 


gat 


tac tat 


gag 


acc 


aac 


age 


cag 


255 


Lys 


He 


Pro 


Arg 


Gin 
25 


Arg 


He 


Met 


Asp 


Tyr Tyr 
30 


Glu 


Thr 


Asn 


Ser 
35 


Gin 




tgc 


tec 


aag 


ccc 


gga 


att 


gtc 


ttc 


ate 


acc aaa 


agg 


ggc 


cat 


tec 


gty 


303 


Cys 


Ser 


Lys 


Pro 
40 


Gly 


He 


Val 


Phe 


He 
45 


Thr Lys 


Arg 


Gly 


His 
50 


Ser 


Val 




tgt 


acc 


aac 


ccc 


agt 


gae 


aag 


tgg 


gtc 


cag gac 


tat 


ate 


aag 


gae 


atg 


351 


Cys 


Thr 


Asn 
55 


Pro 


Ser 


Asp 


Lys 


Trp 
60 


Val 


Gin Asp 


Tyr 


He 
65 


Lys 


Asp 


Met 





aag gag aac tgagtgaccc agaaggggtg gcgaaggcac agctcagaga 400 
Lys Glu Asn 
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70 

cataaagaga agatgccaag gccccctcct ccacccaccg ctaactctca gccccagtca 4 60 
ccctcttgga gcttccctgc tttgaattaa agaccactca tgcccaaaaa aaaaaaaaaa 520 
a 521 



<210> 57 

<211> 588 

<212> DNA 

<213> Homo sapiens 

<220> 
<221> CDS 
<222> 93. .308 



<220> 

<221> sig_peptide 

<222> 93. .218 

<223> Von Heijne matrix 

score 4.46006149913644 
seq LRLEVAAAPGTPA/QP 



<220> 

<221> polyA site 
<222> 573. .588 



<220> 

<221> polyA^signal 
<222> 558.. 563 



<400> 57 

aaaagaaatt cccgggcccg gccagcctgt gccgctgtta tgaggagaag caaagcccct 60 
tgcagaagca cttgcgggtc tgcagcctac gc atg aat agg ttg gca ggt gtg 113 

Met Asn Arg Leu Ala Gly Val 
-40 



ggc 


tgg 


egg 


gtg 


gac 


tac 


acc 


ctg 


age 


tec 


age 


ctg 


ctg 


caa 


tec 


gtg 


161 


Gly Trp 


Arg 


Val 


Asp 


Tyr 


Thr 


Leu 


Ser 


Ser 


Ser 


Leu 


Leu 


Gin 


Ser 


Val 




-35 










-30 










-25 










-20 




gaa 


gag 


ccc 


atg 


gtg 


cac 


ctg 


egg 


ctg 


gag 


gtg 


gca 


get 


gee 


cca 


ggg 


209 


Glu 


Glu 


Pro 


Met 


Val 


His 


Leu 


Arg 


Leu 


Glu 


Val 


Ala 


Ala 


Ala 


Pro Gly 












-15 










-10 










-5 






acc 


cca 


gcc 


cag 


cct 


gtt 


gcc 


atg 


tec 


etc 


tea 


gca 


gac 


aag 


ttc 


cag 


257 


Thr 


Pro 


Ala 


Gin 
1 


Pro 


Val 


Ala 


Met 

5 


Ser 


Leu 


Ser 


Ala 


Asp 
10 


Lys 


Phe 


Gin 




gtc 


etc 


ctg 


gca 


gaa 


ctg 


aag 


cag 


gee 


cag 


acc 


ctg 


atg 


age 


tec 


ctg 


305 


Val 


Leu 
15 


Leu 


Ala 


Glu 


Leu 


Lys 
20 


Gin 


Ala 


Gin 


Thr 


Leu 
25 


Met 


Ser 


Ser 


Leu 





ggc tgaggagaag ggtgttccag gcctgtgtgg agccgccctg cccgtatgga 358 

Gly 

30 



gtcaegccct ctgaaetgct ettegggagg cagcectggt tetaggatgc tgaggecctg 418 

gcccggacte tggeetccca gatccccagc tgcetcactt ctetettgag aacttggcte 478 

agggetectg aggaectttc ccagcattae ettcccttcc cttgaaagge aattgttggc 538 

tgttttcata ageaggaaaa ataaacagaa gtacaaaaaa aaaaaaaaaa 588 



<210> 58 
<211> 521 

<212> DNA 

<213> Homo sapiens 



<220> 
<221> CDS 
<222> 113.. 364 



wo 02/083898 



PCT/IBOl/00914 



49 

<220> 

<221> sig_peptide 

<222> 113. . 172 

<223> Von Heijne matrix 

score 4.37180298395146 

SLLLSLPPHQGLT/FS 

<220> 

<221> polyA_site 
<222> 500.. 521 

<400> 58 

ttttttacat ggtgttccca cagctgggag gacacccaca tggtcggcgt gcaggatatt 60 
tcgctggacc ctagaaaagc caccacgacc tgtgggccat gatgctaccc ca atg get 118 

Met Ala 
-20 



get 


get 


get 


gtt 


cet 


tct 


ett 


ett 


ett 


tct 


ett 


cct 


cct 


cac 


cag 


ggg 


166 


Ala 


Ala 


Ala 


Val 


Pro 


Ser 


Leu 


Leu 


Leu 


Ser 


Leu 


Pro 


Pro 


His 


Gin 


Gly 










-15 










-10 










-5 








etc 


act 


ttc 


tec 


aac 


aaa 


ata 


caa 


cct 


ttt 


gga 


get 


caa 


gga 


gtc 


ttg 


214 


Leu 


Thr 


Phe 
1 


Ser 


Asn 


Lys 


He 
5 


Gin 


Pro 


Phe 


Gly 


Ala 
10 


Gin 


Gly 


Val 


Leu 




cat 


ccg 


gaa 


cca 


gga 


ctg 


cga 


gae 


tgg 


ctg 


ctg 


cca 


acg 


tgc 


tec 


aga 


262 


His 


Pro 


Glu 


Pro 


Gly 


Leu 


Arg 


Asp 


Trp 


Leu 


Leu 


Pro 


Thr 


Cys 


Ser 


Arg 




15 










20 










25 










30 




caa 


ttg 


cga 


gtc 


gea 


ctg 


ccg 


gag 


aag 


ggg 


tec 


gag 


ggc 


agt 


ctg 


tgt 


310 


Gin 


Leu 


Arg 


Val 


Ala 


Leu 


Pro 


Glu 


Lys 


Gly 


Ser 


Glu 


Gly 


Ser 


Leu 


Cys 












35 










40 










45 






caa 


acg 


cag 


ctg 


cca 


get 


act 


cca 


tgc 


ttc 


ctg 


cct 


teg 


aat 


acg 


gtg 


358 


Gin 


Thr 


Gin 


Leu 


Pro 


Ala 


Thr 


Pro 


Cys 


Phe 


Leu 


Pro 


Ser 


Asn 


Thr 


Val 










50 










55 










60 








aga 


acg 


tgaagtcatg agctgctgct aaggcatgtg gcaaccttga 


agagaaggtc 


414 



Arg Thr 

aagagctace agceaeeaaa agaatgccag caettcctgt gtetttgctt tggattcatg 474 
agaaatatae gttcetattt gctteaaaaa aaaaaaaaaa tgegaaa 521 

<210> 59 

<211> 516 

<212> DNA 

<213> Homo sapiens 

<220> 
<221> CDS 
<222> 83. .244 

<220> 

<221> sig peptide 

<222> 83.7139 

<223> Von Heijne matrix 

score 4.24406450728052 

seq LLILSKVLDDTFQ/SV 

<220> 

<221> polyA_site 
<222> 501.. 516 

<400> 59 

actgetatag egatataact cagatgttct caggacagca agtttacatg tgaatagaga 60 
gggagggacg tgttaggaag ac atg aaa ttc aat ttt gta ett ttg ata ett 112 

Met Lys Phe Asn Phe Val Leu Leu lie Leu 
-15 -10 
tec aaa gtt eta gat gae act ttc caa agt gtc aag aaa tgg tta aat 160 
Ser Lys Val Leu Asp Asp Thr Phe Gin Ser Val Lys Lys Trp Leu Asn 
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-5 1 5 

tat ttt cag ttt act tta aga aat ggt tta atg tgg cca ggt gcg gtg 208 
Tyr Phe Gin Phe Thr Leu Arg Asn Gly Leu Met Trp Pro Gly Ala Val 

10 15 20 

get cat gcc tgt aat ccc age act ggc tea cge ctg taatcccagc 254 
Ala His Ala Cys Asn Pro Ser Thr Gly Ser Arg Leu 

25 30 35 

actttgggag tccgaggtgg gcagatcatg aggtcaggag atcgagacca tcctggctaa 314 
catggtgaaa ccccgtctct actaaaaata caaaaaaatt agccaggcat ggtggcgggc 374 
gcctgtagtc ccagctactc gggaggctga ggcaggagaa tggcgtgaac ctgggaggcg 434 
gagcttacag tgageegaga tcgcgceact gcactceage etgggcgaca gagcgagact 4 94 
ctgtteaaaa aaaaaaaaaa aa 516 

<210> 60 

<211> 517 

<212> DNA 

<213> Homo sapiens 



<220> 
<221> CDS 
<222> 83.. 244 



<220> 

<221> sig_peptide 

<222> 83. .139 

<223> Von Heijne matrix 

score 4.24406450728052 
LLILSKVLDDTFQ/SV 



<220> 

<221> polyA_site 
<222> 502. .517 



<400> 60 

actgetatag cgatataact cagatgttct caggacagca agtttacatg tgaatagaga 60 
gggagggacg tgttaggaag ac atg aaa ttc aat ttt gta ctt ttg ata ctt 112 

Met Lys Phe Asn Phe Val Leu Leu lie Leu 
-15 -10 
tec aaa gtt eta gat gac act ttc caa agt gtc aag aaa tgg tta aat 160 
Ser Lys Val Leu Asp Asp Thr Phe Gin Ser Val Lys Lys Trp Leu Asn 

-5 15 
tat ttt cag ttt act tta aga aat ggt tta atg tgg cca ggt gcg gtg 208 
Tyr Phe Gin Phe Thr Leu Arg Asn Gly Leu Met Trp Pro Gly Ala Val 

10 15 20 

get cat gcc tgt aat ccc age act ggc tea cgc ctg taatcccagc 254 
Ala His Ala Cys Asn Pro Ser Thr Gly Ser Arg Leu 

25 30 35 

actttgggag tccgaggtgg gcagatcatg aggtcaggag atcgagacca tcctggctaa 314 
catggtgaaa ccccgtytct actaaaaata caaaaaaatt agccaggcat ggtggcgggc 374 
gcctgtagtc ccagctactc gggaggctga ggcaggagaa tggcgtgaac ctgggaggcg 4 34 
gagcttacag tgageegaga tcgcgceact gcactceage etgggcgaca gagcgagact 4 94 
ctgtttcaaa aaaaaaaaaa aaa 517 



<210> 61 

<211> 518 

<212> DNA 

<213> Homo sapiens 



<220> 
<221> CDS 
<222> 84.. 245 



<220> 
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<22l> sig_peptide 

<222> 84 . .140 

<223> Von Heijne matrix 

score 4.24406450728052 

LLILSKVLDDTFQ/SV 



<220> 

<221> polyA_site 
<222> 503. .518 



<400> 61 

aactgctata gcgatataac tcagatgttc tcaggacagc aagtttacat gtgaatagag 
agggagggac gtgttaggaa gac atg aaa ttc aat ttt gta ctt ttg ata ctt 

Met Lys Phe Asn Phe Val Leu Leu lie Leu 



tec aaa gtt eta gat gac act ttc caa agt 
Ser Lys Val Leu Asp Asp Thr Phe Gin Ser Val 
-5 1 
ttt act tta aga aat ggt tta atg 
Phe Thr Leu Arg Asn Gly Leu Met 
15 

tgt aat ccc age act ggc tea cgc 
Cys Asn Pro Ser Thr Gly Ser Arg 
30 

gcagatcatg aggtcaggag 
actaaaaata caaaaaaatt 



-15 
gtc aag 



tat 
Tyr 

get 
Ala 



ttt cag 
Phe Gin 
10 

cat gee 
His Ala 
25 

actttgggag tccgaggtgg 
catggtgaaa ccccgtctct 



gcctgtagtc ccagctactc gggaggctga ggcaggagaa 
gagcttacag tgagccgaga tcgcgceact gcactccage 
ctgtttcaaa aaaaaaaaaa aaa 



Lys 

tgg 
Trp 



aaa 
Lys 

cca 



tgg 
Trp 
5 

ggt 



-10 
tta aat 
Leu Asn 

gcg gtg 



Pro Gly Ala Val 
20 

taatcccagc 



ctg 
Leu 
35 

atcgagacca 
agccaggcat 

tggcgtgaac 
ctgggcgaca 



tcctggctaa 
ggtggcgggc 
ctgggaggcg 
gagegagact 



60 
113 



161 



209 



255 



315 
375 
435 
4 95 
518 



<210> 62 

<211> 548 

<212> DNA 

<213> Homo sapiens 



<220> 
<221> CDS 
<222> 57. .329 



<220> 

<221> sig_peptide 

<222> 57.. 113 

<223> Von Heijne matrix 

score 4.2427958051416 
seq FVLLLSEIVSISA/LS 



<220> 

<221> polyA_site 
<222> 533. .548 



<220> 

<221> polyA_signal 
<222> 505. .510 



<400> 62 

agttgtcttt ggtagttttt ttgcactaac ttcaggaacc agctcatgat ctcagg atg 59 

Met 

tat gga aaa ata ate ttt gta tta eta ttg tea gaa att gtg age ata 107 
Tyr Gly Lys He He Phe Val Leu Leu Leu Ser Glu He Val Ser He 

-15 -10 -5 

tea gca tta agt acc act gag gtg gca atg cac act tea ace tet tct 155 
Ser Ala Leu Ser Thr Thr Glu Val Ala Met His Thr Ser Thr Ser Ser 
15 10 



1 
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tea 


gtc 


aca 


aag 


agt 


tac 


ate 


tea 


tea 


cag 


aca aat gga gaa 


atg 


gga 


203 


Ser 


Val 


Thr 


Lys 


Ser 


Tyr 


lie 


Ser 


Ser 


Gin 


Thr Asn Gly Glu 


Met 


Gly 




15 










20 










25 




30 




caa 


ctt 


gtc 


cat 


cgt 


ttc 


act 


gta 


cca 


get 


cct gta gtg ata 


ata 


etc 


251 


Gin 


Leu 


Val 


His 


Arg 

35 


Phe 


Thr 


Val 


Pro 


Ala 
40 


Pro Val Val He 


He 
45 


Leu 




att 


att 


ttg 


tgt 


gtg 


atg 


get 


ggt 


att 


att 


gga acg ate etc 


tta 


att 


299 


lie 


lie 


Leu 


Cys 


Val 


Met 


Ala 


Gly He He Gly Thr He Leu 


Leu 


He 










50 










55 




60 








tct 


tac 


agt 


att 


cgc 


cga 


etg 


ata 


aag 


gca 


tgaggatgtg gcctgcatgc 


349 


Ser 


Tyr 


Ser 


lie Arg Arg Leu 


He 


Lys 


Ala 














65 










70 















tgcctgatct tgcctagaac cggctgcacc tgctgttctc ttgtttatgc aaactggctg 409 

caectgctat tcctttgctt atgcccctac ccctggctat cctaattccc tgttctcctg 469 

cctcaetatt actgtattet ctacttctaa ataaaaataa aacaaaatac aaaccgttaa 529 

aataaaaaaa aaaaaaaaa 548 



<210> 63 

<211> 722 

<212> DNA 

<213> Homo sapiens 

<220> 
<221> CDS 
<222> 331. .672 

<220> 

<221> sig_peptide 

<222> 331. .426 

<223> Von Heijne matrix 

score 4.21873806574589 

seq LLGSGLFVFSLTA/FN 

<220> 

<221> poiyA_site 
<222> 707. .722 

<220> 

<221> polyA_signal 
<222> 688. .693 

<400> 63 

tttttceteg teetcteccg acagagetga cgtgtcctgg gttccaccgg gagcgggcat 60 

ttecaccgga cgggagggtt cggggtgtcc ggggetgggg aatacgtagg ggttgcegcg 120 

cggtgtgggg agttggggcg tgtggctgca gtccegggag ttcttggagg gggteggeec 180 

acegagctte cggaccggct gatctgcccg tagcttgccg gagggagggc ggagcctgac 240 

tctccgtccc ttctcccatc ccctccagtg gtgggtacgg gcacctcgct ggcgctctcc 300 

tccctcctgt ccctgctgct ctttgctggg atg cag atg tac age cgt cag etg 354 

Met Gin Met Tyr Ser Arg Gin Leu 
-30 -25 



gee 


tec 


aec 


gag 


tgg 


etc 


acc 


ate 


cag 


gge 


gge 


etg 


ctt 


ggt 


teg 


ggt 


402 


Ala 


Ser 


Thr 


Glu 


Trp 


Leu 


Thr 


He 


Gin 


Gly 


Gly 


Leu 


Leu 


Gly 


Ser 


Gly 












-20 










-15 










-10 






etc 


ttc 


gtg 


ttc 


teg 


etc 


act 


gee 


ttc 


aat 


aat 


etg 


gag 


aat 


ctt 


gtc 


450 


Leu 


Phe 


Val 


Phe 
-5 


Ser 


Leu 


Thr 


Ala 


Phe 
1 


Asn 


Asn 


Leu 


Glu 
5 


Asn 


Leu 


Val 




ttt 


gge 


aaa 


gga 


ttc 


caa 


gca 


aag 


ate 


ttc 


cct 


gag 


att 


etc 


etg 


tgc 


498 


Phe Gly 


Lys 


Gly 


Phe 


Gin 


Ala 


Lys 


He 


Phe 


Pro 


Glu 


He 


Leu 


Leu 


Cys 






10 










15 










20 












etc 


etg 


ttg 


get 


etc 


ttt 


gca 


tct 


gge 


etc 


ate 


eac 


cga 


gtc 


tgt 


gtc 


546 


Leu 


Leu 


Leu 


Ala 


Leu 


Phe 


Ala 


Ser 


Gly 


Leu 


He 


His 


Arg 


Val 


Cys 


Val 




25 










30 










35 










40 




ace 


acc 


tgc 


ttc 


ate 


ttc 


tec 


atg 


gtt 


ggt 


etg 


tac 


tac 


ate 


aac 


aag 


594 
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Thr Thr Cys Phe He Phe Ser Met Val Gly Leu Tyr Tyr He Asn Lys 

45 50 55 

aty tec tec acc ctg tac cag gca gca get cca gtc etc aca cea gee 642 
He Ser Ser Thr Leu Tyr Gin Ala Ala Ala Pro Val Leu Thr Pro Ala 

60 65 70 

aag gtc aca gge aag age aag aag aga aac tgaccctgaa tgttcaataa 692 
Lys Val Thr Gly Lys Ser Lys Lys Arg Asn 

75 80 
agttgattct ttgeaaaaaa aaaaaaaaaa 722 

<210> 64 
<211> 479 
<212> DNA 

<213> Homo sapiens 

<220> 
<221> CDS 
<222> 96. .320 

<220> 

<221> sig_peptide 

<222> 96. .200 

<223> Von Heijne matrix 

score 4.14350049367799 

seq MAVAFVLSLGVAA/LY 

<220> 

<221> polyA__site 
<222> 464. .479 

<220> 

<221> polyA_signal 
<222> 437.. 442 

<400> 64 

attetgegcc tgcgcgegte tacageacgg ttegttttte ctttagteag gaaggacgtt 60 
ggtgttgage ataegtatca aggacagtaa ytacc atg get ecc gaa gtt ttg 113 

Met Ala Pro Glu Val Leu 





















-35 










•30 




eea aaa 


cet 


egg 


atg 


egt 


ggc 


ett 


ctg 


gee 


agg 


egt 


ctg 


cga 


aat 


eat 


161 


Pro Lys 


Pro 


Arg 


Met 
-25 


Arg 


Gly 


Leu 


Leu 


Ala 
-20 


Arg 


Arg 


Leu 


Arg 


Asn 
-15 


His 




atg get 


gta 


gca 


tte 


gtg 


eta 


tec 


ctg 


ggg 


gtt 


gca 


get 


ttg 


tat 


aag 


209 


Met Ala 


Val 


Ala 
-10 


Phe 


Val 


Leu 


Ser 


Leu 
-5 


Gly 


Val 


Ala 


Ala 


Leu 
1 


Tyr 


Lys 




ttt egt 


gtg 


get 


gat 


eaa 


aga 


aag 


aac 


gca 


tac 


gca 


gat 


ttc 


tac 


aga 


257 


Phe Arg 


Val 


Ala 


Asp 


Gin 


Arg 


Lys 


Asn 


Ala 


Tyr 


Ala 


Asp 


Phe 


Tyr 


Arg 




5 










10 










15 












aac tac 


gat 


gtc 


atg 


aaa 


gat 


ttt 


gag 


gag 


atg 


agg 


aag 


get 


ggt 


ate 


305 


Asn Tyr 


Asp 


Val 


Met 


Lys 


Asp 


Phe 


Glu 


Glu 


Met 


Arg 


Lys 


Ala 


Gly 


He 




20 








25 










30 










35 





ttt cag agt gta aag taatcttgga atataaagaa tttcttcagg ttgaattacc 360 
Phe Gin Ser Val Lys 
40 



tagaagtttg teactgactt gtgttcctga actatgacac atgaatatgt gggctaagaa 420 
atagttcete ttgataaata aacaattaac aaatactttg gacaaaaaaa aaaaaaaaa 479 

<210> 65 
<211> 1627 
<212> DNA 

<213> Homo sapiens 



<220> 
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<221> CDS 
<222> 90. .413 



<220> 

<221> sigjpeptide 

<222> 90.. 209 

<223> Von Heijne matrix 

score 4.06476262296284 
seq VTAPSMVAPVTFA/SI 

<220> 

<221> polyA_site 
<222> 1612. .1627 



<220> 

<221> polyA_signal 
<222> 1577. ,1582 



<400> 65 

aatctctgcg aaagcttcca ggttcgctca tcgtaggtca ctggaaaaaa ttgtagcatt 



60 



gctaaagagt atttcagaga aaattttgc 


atg 


caa 


aaa 


ttt 


agg 


aag 


atg 


agt 


113 


















Met 


Gin 


Lys 


Phe 


Arg 


Lys 


Met 


Ser 




















-40 










-35 








gaa 


act 


cat 


cat 


tct 


gta 


ate 


tct 


gtg 


aaa 


get 


tec 


agt 


cec 


tgg 


eta 


161 


Glu 


Thr 


His 
-30 


His 


Ser 


Val 


He 


Ser 
-25 


Val 


Lys 


Ala 


Ser 


Ser 
-20 


Pro 


Trp 


Leu 




tct 


tct 


tea 


gtg 


act 


get 


cca 


tec 


atg 


gta 


gee 


cca 


gtc 


act 


ttt 


gca 


209 


Ser 


Ser 
-15 


Ser 


Val 


Thr 


Ala 


Pro 
-10 


Ser 


Met 


Val 


Ala 


Pro 
-5 


Val 


Thr 


Phe 


Ala 




tct 


att 


gta 


gaa 


gaa 


gaa 


eta 


caa 


caa 


gaa 


gca 


get 


ett 


att 


aga 


agt 


257 


Ser 


He 


Val 


Glu 


Glu 


Glu 


Leu 


Gin 


Gin 


Glu 


Ala 


Ala 


Leu 


lie 


Arg 


Ser 




1 








5 










10 










15 






cga 




aaa 


ccg 


ttg 


get 


etg 


att 


cag 


att 


gag 


gag 


cat 


gcc 


ata 


caa 


305 


Arg Glu 


Lys 


Pro 


Leu 


Ala 


Leu 


He 


Gin 


He 


Glu 


Glu 


His 


Ala 


He 


Gin 










20 










25 










30 








gat 


tta 


ttg 


gtt 


ttc 


tat 


gag 


gca 


ttt 


ggc 


aac 


cct 


gaa 


gag 


ttt 


gtc 


353 


Asp 


Leu 


Leu 
35 


Val 


Phe 


Tyr 


Glu 


Ala 
40 


Phe 


Gly 


Asn 


Pro 


Glu 
45 


Glu 


Phe 


Val 




att 


gtt 


gaa 


agg 


aca 


ccg 


cag 


gga 


cca 


ctg 


gca 


gta 


cct 


atg 


tgg 


aat 


401 


He 


Val 

50 


Glu 


Arg 


Thr 


Pro 


Gin 
55 


Gly 


Pro 


Leu 


Ala 


Val 
60 


Pro 


Met 


Trp 


Asn 





aag cat gga tgc tagttcactg tggagttgag atgcatttta cataattatg 

Lys His Gly Cys 

65 

agtttgttca 
taaatagatt 
gattaaaaat 
aaaaacttat 
aagtgattga 
taacttgaga 
gcattgatga 
ttttgtgttt 
tcatgtggte 
cacatattct 
attattctga 
gagtcattgt 
aatcaattgc 
ttaaatagtt 
ttctctgaac 



tataaagaaa 
aagaaaagat 
aaatttctea 
taaggttaaa 
atcataagat 
atttagaaga 
atttaaagta 
ggtgtaatag 
catgtctett 
ccattttgca 
acagtttagt 
gggttatttt 
tgaaagtgtt 
tggaagaggg 
ctcaattgta 
tggtttgttt ctgagtgttt 
ctagataaat tggaaaggtt 
ageaattcac tgctataatt 
gtttaactet gcccaaaage 



agctgtggaa 
ataatttett 
gaattgtgat 
tagaattgtt 
ataagggggg 
ttttgccaag 
ggagcatcag 
ttgtgtcacc 
caaagcttga 
ttgagcagta 
cecaagagaa 
tggtgggatg 
aaaatttctg 
ttggattatc 
tatctagaca 
aaattttgea 



aagagtctta 
tactgeagtt 
tttagtaact 
tettstgaat 
taaagtatta 
tatgtgttgt 
ttattacttc 
attcttgttg 
cetgeacaaa 



gagattttgt 
aaateatata 
ttatataaaa 
ratctaacaa 
aaaataactt 
tgcttgactt 
tgattcatta 
ctttttaaaa 
tgecatattt 



gagtaeagtg gaaagggaat 
tagcgtttta aaaaagaaaa 



gaggatctta 
aaaataaatg 
attaccttta 
taaaaattgt 
aaaacaaatg 



aaaatgecta 
cttaattaea 
caatactgta 
tttctgtata 
ttaaatttgt 



aatatcattc 
atgtttgtat 
tgtgtgagac 
aggaraatat 
ttttgtttga 
cttaaatatg 
atggccagaa 
atcaggctaa 
cyatttggac 
aagaatactg 
acaagatttg 
attgtgagag 
tatacaggaa 
taatcagaag 
ggatgttgtt 
gcttcagtae 



aatgttctag tttctggaag gtaagcctgg gagaeaeata 
tagttgatgt aaaatgaegg aaactggctc aatatgtcag 
ageagacatg taageagatg tgcaataaaa aatgatcttg 



453 



513 
573 
633 
693 
753 
813 
873 
933 
993 
1053 
1113 
1173 
1233 
1293 
1353 
1413 
1473 
1533 
1593 



wo 02/083898 

55 

atccatttca aaaaaaagaa aaaaaaaaaa aaaa 

<210> 66 
<211> 555 
<212> DNA 

<213> Homo sapiens 

<220> 
<221> CDS 
<222> 9. .389 

<220> 

<221> sig_peptide 
<222> 9. .149 
<223> Von Heijne matrix 

score 3.92635967755835 

seq LSSDLQQVGGASA/RI 

<220> 

<221> polyA_site 
<222> 540.. 555 

<220> 

<221> polyA_signal 
<222> 512. .517 

<400> 66 

tgggagtg atg ttc acg cct ctg aca gtg aaa tac gcg tac tac gac act 50 

Met Phe Thr Pro Leu Thr Val Lys Tyr Ala Tyr Tyr Asp Thr 

-45 -40 -35 



gaa 


cgc 


ate 


gga 


gtt 


gac 


ctg 


ate 


atg 


aag 


ace 


tgc 


ttt 


age 


ccc 


aac 


98 


Glu Arg 


lie 


Gly 


Val 


Asp 


Leu 


He 


Met 


Lys 


Thr 


Cys 


Phe 


Ser 


Pro 


Asn 










-30 










-25 










-20 








aga 


gtg 


att 


gga 


etc 


tea 


agt 


gac 


ttg 


eag 


caa 


gta 


gga 


ggg 


gea 


tea 


146 


Arg Val 


He 


Gly 


Leu 


Ser 


Ser 


Asp 


Leu 


Gin 


Gin 


Val 


Gly 


Gly 


Ala 


Ser 








-15 










-10 










-5 










get 


cgc 


ate 


eag 


gat 


gcc 


ctg 


agt 


aca 


gtg 


ttg 


caa 


tat 


gea 


gag 


gat 


194 


Ala Arg 


He 


Gin 


Asp 


Ala 


Leu 


Ser 


Thr 


Val 


Leu 


Gin 


Tyr 


Ala 


Glu 


Asp 






1 








5 










10 










15 




gta 


ctg 


tet 


gga 


aag 


gtg 


tea 


get 


gac 


aat 


act 


gtg 


ggc 


cgc 


ttc 


ctg 


242 


Val 


Leu 


Ser 


Gly 


Lys 
20 


Val 


Ser 


Ala 


Asp 


Asn 
25 


Thr 


Val 


Gly 


Arg 


Phe 
30 


Leu 




atg 


age 


ctg 


gtt 


aac 


caa 


gta 


ceg 


aaa 


ata 


gtt 


ccc 


gat 


gac 


ttt 


gag 


290 


Met 


Ser 


Leu 


Val 

35 


Asn 


Gin 


Val 


Pro 


Lys 
40 


He 


Val 


Pro 


Asp 


Asp 
45 


Phe 


Glu 




acc 


atg 


etc 


aac 


age 


aac 


ate 


aat 


gac 


ctt 


ttg 


atg 


gtg 


acc 


tac 


ctg 


338 


Thr 


Met 


Leu 


Asn 


Ser 


Asn 


He 


Asn 


Asp 


Leu 


Leu 


Met 


Val 


Thr 


Tyr 


Leu 








50 










55 








60 










gee 


aac 


etc 


aca 


cag 


tea 


cag 


att 


gea 


etc 


aat 


gaa 


aaa 


ctt 


gta 


aac 


386 


Ala 


Asn 


Leu 


Thr 


Gin 


Ser 


Gin 


He 


Ala 


Leu 


Asn 


Glu 


Lys 


Leu 


Val 


Asn 





65 70 75 

ctg tgaatggaec ccaagcagta cacttgetgg tctaggtatt aaceeeagga 439 
Leu 
80 

eteagaagtg aaggagaaat gggttttttg tggtcttgag tcaeactgag atagteagtt 499 
gtgtgtgact etaataaaeg gagectacet tttgtaaatt aaaaaaaaaa aaaaaa 555 

<210> 67 
<211> 422 
<212> DNA 

<213> Homo sapiens 



PCT/IBOl/00914 
1627 



<220> 



wo 02/083898 PCT/IBO 1/009 14 

56 

<221> CDS 
<222> 24. .278 

<220> 

<221> sig_peptide 

<222> 24. .119 

<223> Von Heijne matrix 

score 3.85301646018147 

seq LAHIQALISGIEA/QL 

<220> 

<221> polyA_site 
<222> 407. .422 

<220> 

<221> polyA_signal 
<222> 384. .389 

<400> 67 

ttgagctgca gtcacagctg age atg aaa get gcc ttg gaa gac aca ctg gca 53 

Met Lys Ala Ala Leu Glu Asp Thr Leu Ala 
-30 -25 



gaa 


acg 


gag 


gcg 


cgc 


ttt 


gga 


gcc 


cag 


ctg 


gcg 


cat 


ate 


cag 


gcg 


ctg 


101 


Glu 


Thr 


Glu 


Ala 


Arg 


Phe 


Gly 


Ala 


Gin 


Leu 


Ala 


His 


He 


Gin 


Ala 


Leu 








-20 










-15 










-10 










ate 


age 


ggt 


att 


gaa 


gcc 


eag 


ctg 


ggc 


gat 


gtg 


eg a 


get 


gat 


agt 


gag 


149 


He 


Ser 


Gly 


He 


Glu 


Ala 


Gin 


Leu 


Gly 


Asp 


Val Arg 


Ala 


Asp 


Ser 


Glu 






-5 










1 








5 










10 




egg 


cag 


aat 


cag 


gag 


tac 


cag 


egg 


etc 


atg 


gac 


ate 


aag 


teg 


egg 


ctg 


197 


Arg 


Gin 


Asn 


Gin 


Glu 


Tyr 


Gin 


Arg 


Leu 


Met 


Asp 


He 


Lys 


Ser 


Arg 


Leu 












15 










20 










25 






gag 


cag 


gag 


att 


gcc 


ace 


tac 


cgc 


age 


ctg 


etc 


gag 


gga 


cag 


gaa 


gat 


245 


Glu 


Gin 


Glu 


He 


Ala 


Thr 


Tyr 


Arg 


Ser 


Leu 


Leu 


Glu 


Gly 


Gin 


Glu 


Asp 










30 










35 










40 








cac 


tac 


aac 


aat 


ttg 


tct 


gcc 


tec 


aag 


gtc 


etc 


tgaggcagea < 


ggetctgggg 


298 


His 


Tyr 


Asn 


Asn 


Leu 


Ser 


Ala 


Ser 


Lys 


Val 


Leu 















45 50 



ettctgctgt cctttggagg gtgtcttctg ggtagaggga tgggaaggaa gggaecetta 358 
eceeeggcte ttcteetgac etgceaataa aaatttatgg tccaaggeaa aaaaaaaaaa 418 



<210> 68 
<211> 630 
<212> DNA 

<213> Homo sapiens 

<220> 
<221> CDS 
<222> 232. .534 

<220> 

<221> sig_peptide 

<222> 232.. 306 

<223> Von Heijne matrix 

score 3.78142478107177 
seq AKTCLVLCSRVLS/VI 

<220> 

<221> polyA__site 
<222> 615. .630 



<220> 

<221> polyA_signal 



wo 02/083898 
<222> 595. .600 



57 



PCT/IBOl/00914 



<400> 68 

tatcactgtt acgaaccaag gatttacaga tcactggcaa aaattctgag aactttcaca 60 
ccagtatact gtccaagccc attaagtggc atcacacctc tcttttatgt agctcagaca 120 
agacagtcta atatcttcaa aatactactg caatatggaa tcttagaaag agaaaaaaac 180 
cctatcaaca ttgtcttaac aatagtactc tacccttcga gagtaagagt a atg gtt 237 

Met Val 
-25 



gat 


cgt 


gaa 


ttg 


get 


gac 


ate 


eat 


gaa 


gat 


gee 


aaa 


aca 


tgt 


ttg 


gta 


285 


Asp Arg 


Glu 


Leu 


Ala 


Asp 


He 


His 


Glu 


Asp 


Ala 


Lys 


Thr 


Cys 


Leu 


Val 










-20 










-15 










-10 








eta 


tgt 


tec 


aga 


gtg 


ctt 


tet 


gtc 


att 


tea 


gtc 


aag 


gaa 


ata 


aag 


aca 


333 


Leu 


Cys 


Ser 


Arg 


Val 


Leu 


Ser 


Val 

1 


He 


Ser 


Val 


Lys 

5 


Glu 


He 


Lys 


Thr 




cag 


ctg 


-5 
agt 


tta 


gga 


aga 


cat 


eca 


att 


att 


tea 


aat 


tgg 


ttt 


gat 


tac 


381 


Gin 


Leu 


Ser 


Leu 


Gly 


Arg 


His 


Pro 


He 


He 


Ser 


Asn 


Trp 


Phe 


Asp 


Tyr 




10 










15 










20 










25 




att 


cct 


tea 


aca 


aga 


tac 


aaa 


gat 


eca 


tgt 


gaa 


eta 


tta 


cat 


ctt 


tgc 


429 


He 


Pro 


Ser 


Thr 


Arg 


Tyr 


Lys 


Asp 


Pro 


Cys 


Glu 


Leu 


Leu 


His 


Leu 


Cys 












30 










35 










40 






aga 


eta 


acc 


ate 


agg 


aat 


eaa 


eta 


tta 


ace 


aac 


aat 


atg 


etc 


cea 


gat 


477 


Arg 


Leu 


Thr 


He 


Arg 


Asn 


Gin 


Leu 


Leu 


Thr 


Asn 


Asn 


Met 


Leu 


Pro 


Asp 










45 










50 










55 








gga 


ata 


ttt 


tea 


ctt 


eta 


att 


cct 


get 


cgt 


eta 


eaa 


aac 


tat 


ctg 


aat 


525 


Gly 


He 


Phe 


Ser 


Leu 


Leu 


He 


Pro 


Ala 


Arg 


Leu 


Gin 


Asn 


Tyr 


Leu 


Asn 





60 65 70 



tta gaa ate taacataegt eagtgtceta agttccttaa caatgcttac 574 
Leu Glu He 

75 

caatgtatgg cttagaagtt aataaaaatt cacttcatgc aaaaaaaaaa aaaaat 630 

<210> 69 

<211> 730 

<212> DNA 

<213> Homo sapiens 

<220> 
<221> CDS 
<222> 80. .283 

<220> 

<221> sig_peptide 

<222> 80. .220 

<223> Von Heijne matrix 

score 3.72491712201476 

seq PLLSLHSRGGSSS/ES 

<220> 

<221> polyA_site 
<222> 715. .730 

<220> 

<221> polyA_signal 
<222> 697. .702 

<400> 69 

attttgegaa eggcgagcag cggeggcggc gcggagagac gcagcggagg ttttectggt 60 
ttcggacccc agcggccgg atg gtg aaa tec tec ctg cag egg ate etc aat 112 

Met Val Lys Ser Ser Leu Gin Arg He Leu Asn 
-45 -40 
age cac tgc ttc gcc aga gag aag gaa ggg gat aaa ccc age gcc acc 160 
Ser His Cys Phe Ala Arg Glu Lys Glu Gly Asp Lys Pro Ser Ala Thr 



wo 02/083898 

-35 -30 

ate cac gcc age cgc acc atg ccg 

He His Ala Ser Arg Thr Met Pro 

-20 -15 

ggc age age agt gag agt tee agg 

Gly Ser Ser Ser Glu Ser Ser Arg 
1 

ceg ggt ccg ggg cct egg tgg tgc 

Pro Gly Pro Gly Pro Arg Trp Cys 



PCT/IBOl/00914 



58 



-25 



etc eta age ctg eac age cgc ggc 
Leu Leu Ser Leu His Ser Arg Gly 

-10 -5 
gtc tec etc eac tge tgt agt aac 
Val Ser Leu His Cys Cys Ser Asn 
5 10 
tec tgatgcecet eaeceaeecc 
Ser 



15 



20 



tgaagatcce aggtgggcga ggaggaggag tagggcegcc tcggggetgg gcateeggcc 
eetggggcea ececttgtca gccgggtggg taggaaccgt agactegete atctcgcetg 
ggtttgtceg catgttgtaa tcgtgcaaat aaacgctcae teegaattag cggtgtattt 
cttgaagttt aatattgtgt ttgtgataet gaagtatttg ctttaattct aaataaaaat 
ttatatttta cttttttatt gctggtttaa gatgattcag attatccttg tactttgagg 
agaagtttct tatttggagt cttttggaaa cagtcttagt ettttaaett ggaaagatga 
ggtattaate ccctccattg ctctccaaaa gccaataaag tgattaeace eaaaaaaaaa 
aaaaaaa 



208 
256 
303 



363 
423 
483 
543 
603 
663 
723 
730 



<210> 70 

<211> 730 

<212> DNA 

<213> Homo sapiens 



<220> 
<221> CDS 
<222> 80. .283 



<220> 

<221> sig_peptide 

<222> 80. .220 

<223> Von Heijne matrix 

score 3.72491712201476 
seq PLLSLHSRGGSSS/ES 



<220> 

<221> polyA_site 
<222> 715.. 730 



<220> 

<221> polyA signal 
<222> 697.. 702 



<400> 70 

attttgcgaa cggcgagcag cggcggcggc gcggagagac gcagcggagg ttttcctggt 60 
ttcggaccee ageggcegg atg gtg aaa tec tee ctg cag egg ate etc aat 112 

Met Val Lys Ser Ser Leu Gin Arg He Leu Asn 
-45 -40 



age 


eac 


tgc 


ttc 


gee 


aga 


gag 


aag 


gaa 


ggg gat aaa 


cec age 


gcc 


ace 


160 


Ser 


His 


Cys 


Phe 


Ala 


Arg 


Glu 


Lys 


Glu 


Gly Asp Lys 


Pro Ser 


Ala 


Thr 






-35 










-30 






-25 










ate 


cac 


gcc 


age 


cgc 


ace 


atg 


ccg 


etc 


eta age ctg 


cac age 


cgc 


ggc 


208 


He 


His 


Ala 


Ser Arg 


Thr 


Met 


Pro 


Leu 


Leu Ser Leu 


His Ser 


Arg 


Gly 




-20 










-15 








-10 






-5 




ggc 


age 


age 


agt 


gag 


agt 


tee 


agg 


gtc 


tec etc cac 


tge tgt 


agt 


aac 


256 


Gly 


Ser 


Ser 


Ser 


Glu 

1 


Ser 


Ser 


Arg 


Val 

5 


Ser Leu His 


Cys Cys 
1 n 


Ser 


Asn 




ceg 


ggt 


ceg 


ggg 


cct 


egg 


tgg 


tgc 


tec 


tgatgcecet cacceaccce 




303 


Pro Gly 


Pro 


Gly 


Pro 


Arg 


Trp 


Cys 


Ser 













15 20 



tgaagatcce aggtgggcga ggaggaggag tagggcegcc tcggggetgg gcateeggcc 363 
eetggggcea ececttgtca geegggtggg taggaaccgt agactegete atctcgcetg 423 
ggtttgtceg catgttgtaa tcgtgcaaat aaacgctcae teegaattag cggtgtattt 483 



wo 02/083898 PCT/IBO 1/009 14 

59 

cttgaagttt aatattgtgt ttgtgatact gaagtatttg ctttaattct aaataaaaat 543 

ttatatttta cttttttatt gctggtttaa gatgattcag attatccttg tactttgagg 603 

agaagtttct tatttggagt cttttggaaa cagtcttagt cttttaactt ggaaagatga 663 

ggtattaatc ccctccattg ctctccaaaa gccaataaag tgattacacc caaaaaaaaa 723 

aaaaaaa 730 



<210> 71 
<211> 863 
<212> DNA 

<213> Homo sapiens 

<220> 
<221> CDS 
<222> 80. .283 



<220> 

<221> sig_peptide 

<222> 80. .220 

<223> Von Heijne matrix 

score 3.72491712201476 
PLLSLHSRGGSSS/ES 



<220> 

<221> polyA_site 
<222> 844. .863 



<220> 

<221> polyA_signal 
<222> 827. .832 



<400> 71 

attttgcgaa cggcgagcag cggcggcggc gcggagagac gcagcggagg ttttcctggt 60 
ttcggacccc agcggccgg atg gtg aaa tec tec ctg cag egg ate etc aat 112 

Met Val Lys Ser Ser Leu Gin Arg lie Leu Asn 
-45 -40 
age cac tgc ttc gcc aga gag aag gaa ggg gat aaa ceo age gcc aec 160 
Ser His Cys Phe Ala Arg Glu Lys Glu Gly Asp Lys Pro Ser Ala Thr 

-35 -30 -25 

ate cac gcc age cgc acc atg ccg etc eta age ctg cac age cgc ggc 208 
lie His Ala Ser Arg Thr Met Pro Leu Leu Ser Leu His Ser Arg Gly 
-20 -15 -10 -5 

ggc age age agt gag agt tec agg gtc tec etc cac tgc tgt agt aac 256 
Gly Ser Ser Ser Glu Ser Ser Arg Val Ser Leu His Cys Cys Ser Asn 

15 10 
ccg ggt ccg ggg cct egg tgg tgc tec tgatgcccct cacccacccc 303 
Pro Gly Pro Gly Pro Arg Trp Cys Ser 

15 20 
tgaagatecc aggtgggcga gggaatagtc agagggatca caatctttca gctaacttat 363 
tetactccga tgatcggctg aatgtaacag aggaactaac gtccaacgac aagacgagga 423 
tteteaacgt cteagcetct acatcgagat ccegggegge gegctgeccg aggggagcaa 483 
ggacagcttt gcagttctcc tggagttcgc tgaggagcag ctgcgagccg accatgtctt 543 
eatttgettc eacaagaacc gcgaggacag agccgccttg ctecgaacet teagcttttt 603 
gggctttgag attgtgagae eggggeatee cettgtcccc aagagacccg acgettgctt 663 
catggcetac acgttegaga gagagtctte gggagaggag gaggagtagg gccgcctcgg 723 
ggctgggcat ccggcccctg gggccacccc ttgtcagccg ggtgggtagg aaccgtagac 783 
tcgctcatct cgcctgggtt tgtccgcatg ttgtaatcgt gcaaataaac gctcactccg 843 
aattaaaaaa aaaaaaaaaa 863 



<210> 72 
<211> 730 
<212> DNA 

<213> Homo sapiens 



wo 02/083898 PCT/IBO 1/009 14 

60 

<220> 
<221> CDS 
<222> 80. .283 



<220> 

<221> sig_peptide 

<222> 80. .220 

<223> Von Heijne matrix 

score 3.72491712201476 
PLLSLHSRGGSSS/ES 



<220> 

<221> polyA_site 
<222> 715.. 730 



<220> 

<221> polyA_signal 
<222> 697. .702 



<400> 72 

attttgcgaa cggcgagcag cggcggcggc gcggagagac gcagcggagg ttttcctggt 60 
ttcggacccc agcggccgg atg gtg aaa tec tec ctg cag egg ate etc aat 112 

Met Val Lys Ser Ser Leu Gin Arg He Leu Asn 
-45 -40 
age cac tgc ttc gee aga gag aag gaa ggg gat aaa ccc age gee ace 160 
Ser His Cys Phe Ala Arg Glu Lys Glu Gly Asp Lys Pro Ser Ala Thr 

-35 -30 -25 

ate cac gcc age egc acc atg eeg etc eta age ctg cac age cge gge 208 
He His Ala Ser Arg Thr Met Pro Leu Leu Ser Leu His Ser Arg Gly 
-20 -15 -10 -5 

gge age age agt gag agt tee agg gtc tee etc cay tgy tgt agt aas 256 
Gly Ser Ser Ser Glu Ser Ser Arg Val Ser Leu His Cys Cys Ser Xaa 

15 10 
ccg ggt eeg ggg cet egg tgg tgc tec tgatgccect caeccacccc 303 
Pro Gly Pro Gly Pro Arg Trp Cys Ser 

15 20 
tgaagatcce aggtgggega ggaggaggag tagggeegce tcggggetgg geatccggee 363 
eetggggcca eeeettgtea gccgggtggg taggaaeegt agaetegetc atctegeetg 423 
ggtttgtceg eatgttgtaa tcgtgeaaat aaaegctcac tecgaattag eggtgtattt 483 
cttgaagttt aatattgtgt ttgtgatact gaagtatttg etttaattct aaataaaaat 543 
ttatatttta cttttttatt gctggtttaa gatgattcag attatccttg taetttgagg 603 
agaagtttct tatttggagt cttttggaaa cagtcttagt cttttaactt ggaaagatga 663 
ggtattaatc ccetccattg ctctccaaaa gccaataaag tgattacacc caaaaaaaaa 723 
aaaaaaa 730 



<210> 73 
<211> 879 
<212> DNA 

<213> Homo sapiens 



<220> 
<221> CDS 
<222> 9. .395 



<220> 

<221> sig_peptide 
<222> 9. .56 

<223> Von Heijne matrix 

score 3.61791418904325 
LLLLLE<ALRRGPG/PG 



<220> 

<221> polyA_site 



wo 02/083898 
<222> 864. .879 



61 



PCT/IBO 1/009 14 



<400> 73 

aggccaac atg gcc gtg ctg ctg ctg ctg etc cgt gcc etc cgc egg ggt 50 
Met Ala Val Leu Leu Leu Leu Leu Arg Ala Leu Arg Arg Gly 

98 



146 









-15 








-10 








-5 






cca 


ggc 


ccg 


ggt 


cct 


egg 


ccg 


ctg 


tgg 


ggc 


cca .ggc 


ccg 


gee 


tgg 


agt 


Pro Gly 


Pro 


Gly 


Pro 


Arg 


Pro 


Leu 


Trp 


Gly 


Pro 


Gly 


Pro 


Ala 


Trp 


Ser 






1 








5 










10 










cca 


ggg 


ttc 


ccc 


gcc 


agg 


ccc 


ggg 


agg 


ggg 


egg 


ccg 


tac 


atg 


gee 


age 


Pro Gly 


Phe 


Pro 


Ala 


Arg 


Pro 


Gly 


Arg 


Gly 


Arg 


Pro 


Tyr 


Met 


Ala 


Ser 


15 










20 










25 










30 


agg 


cct 


ccg 


ggg 


gac 


etc 


gcc 


gag 


get 


gga 


acre 


ccra 


get 


ctg 


cag 


age 


Arg 


Pro 


Pro 


Gly 


Asp 


Leu 


Ala 


Glu 


Ala 


Gly 


Gly 


Arg 


Ala 


Leu 


Gin 


Ser 










35 










40 










45 




tta 


caa 


ttg 


aga 


ctg 


eta 


ace 


cct 


acc 


ttt 


gaa 


ggg 


ate 


aac 


gga 


ttg 


Leu 


Gin 


Leu 


Arg 


Leu 


Leu 


Thr 


Pro 


Thr 


Phe 


Glu 


Gly 


He 


Asn 


Gly 


Leu 








50 










55 










60 






ttg 


ttg 


aaa 


caa 


cat 


tta 


gtt 


cag 


aat 


eea 


gtc 


aga 


etc 


tgg 


caa 


ett 


Leu 


Leu 


Lys 


Gin 


His 


Leu 


Val 


Gin 


Asn 


Pro 


Val 


Arg 


Leu 


Trp 


Gin 


Leu 






65 










70 










75 








tta 


ggt 


ggt 


act 


ttc 


twt 


ttt 


aac 


acc 


tea 


agg 


ttg 


aag 


cak 


aag 


aat 


Leu Gly 


Gly 


Thr 


Phe 


xaa 


Phe 


Asn 


Thr 


Ser 


Arg 


Leu 


Lys 


Xaa 


Lys 


Asn 




80 










85 










90 










aag 


gag 


aag 


gat 


aag 


teg 


aag 


ggg 


aag 


gcg 


cct 


gaa 


gag 


gac 


gaa 


ggt 


Lys 


Glu 


Lys 


Asp 


Lys 


Ser 


Lys 


Gly 


Lys 


Ala 


Pro 


Glu 


Glu 


Asp 


Glu 


Gly 


95 










100 










105 










110 



194 



242 



290 



338 



386 



ata ttc ate tgatgttctt cagtcagtag ctgcctctgg atgtetttae 435 
He Phe He 

atttctgttt tccttttagc aaggtgaaac cagtctggaa aatggggaga tgggccgggt 4 95 

gcagtggctc acacttgtaa tcgaaacgct ttgggaggce caggtggaag gatcacttga 555 

ggcetatace acatagcgag accctgtcte actgcaaatt aaaaggctgg gcgtggtggc 615 

tcaeacetgt aatcccagca ctttgggagg ctgaggeagg cggatcacct gcaccetggc 675 

caacatggtg aaaccccgtc tttactaaaa atagaaaatt agccgggcgt gatggcacac 735 

gcctgtaatc ccagctaetc gggaggctga ggcaggagaa ttgettgaac ctgggaggtg 795 

gaggttgctg tgagtggaga tcatgecatt gcactccagc ctgagcaaca agagcaaaac 855 

tccatctcaa aaaaaaaaaa aaaa 879 



<210> 74 

<211> 879 

<212> DNA 

<213> Homo sapiens 



<220> 
<221> CDS 
<222> 9. .395 



<220> 

<221> sig_peptide 
<222> 9. .56 

<223> Von Heijne matrix 

score 3.61791418904325 
LLLLLRALRRGPG/PG 



<220> 

<221> polyA site 
<222> 864.. 879 



<400> 74 

aggccaac atg gcc gtg ctg ctg ctg ctg etc cgt gcc etc cgc egg ggt 50 
Met Ala Val Leu Leu Leu Leu Leu Arg Ala Leu Arg Arg Gly 
-15 -10 -5 

cca ggc ccg ggt cct egg ccg ctg tgg ggc cca ggc ccg gee tgg agt 98 
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62 



Pro 


Gly 


Pro 
1 


Gly 


Pro 


Arg 


Pro 
5 


Leu 


Trp 


Gly 


Pro 


Gly 
10 


Pro 


Ala 


Trp 


Ser 
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ggg 


ttc 


ccc 


gcc 


agg 


ccc 


ggg 


agg 


ggg 


egg 


ccg 


tac 


atg 


gcc 


age 


146 


Pro 


Gly 


Phe 


Pro 


Ala 


Arg 


Pro 


Gly 


Arg 


Gly 


Arg 


Pro 


Tyr 


Met 


Ala 


Ser 




15 










20 










25 










30 




agg 


cct 


ccg 


ggg 


gac 


etc 


gcc 


gag 


get 


gga 


ggc 


ega 


get 


ctg 


cag 


age 


194 


Arg 


Pro 


Pro 


Gly 


Asp 
35 


Leu 


Ala 


Glu 


Ala 


Gly 
40 


Gly 


Arg 


Ala 


Leu 


Gin 
45 


Ser 




tta 


caa 


ttg 


aga 


ctg 


eta 


acc 


cct 


acc 


ttt 


gaa 


ggg 


ate 


aac 


gga 


ttg 


242 


Leu 


Gin 


Leu 


Arg 
50 


Leu 


Leu 


Thr 


Pro 


Thr 
55 


Phe 


Glu 


Gly 


lie 


Asn 
60 


Gly 


Leu 




ttg 


ttg 


aaa 


caa 


cat 


tta 


gtt 


cag 


aat 


cca 


gtc 


aga 


etc 


tgg 


caa 


ett 


290 


Leu 


Leu 


Lys 
65 


Gin 


His 


Leu 


Val 


Gin 
70 


Asn 


Pro 


Val 


Arg 


Leu 
75 


Trp 


Gin 


Leu 




tta 


ggt 


ggt 


act 


ttc 


tat 


ttt 


aac 


acc 


tea 


agg 


ttg 


aag 


cag 


aag 


aat 


338 


Leu Gly 


Gly 


Thr 


Phe 


Tyr 


Phe 


Asn 


Thr 


Ser 


Arg 


Leu 


Lys 


Gin 


Lys 


Asn 






80 










85 










90 












aag 


gag 


aag 


gat 


aag 


teg 


aag 


ggg 


aag 


geg 


cct 


gaa 


gag 


gac 


gaa 


ggt 


386 


Lys 


Glu 


Lys 


Asp 


Lys 


Ser 


Lys 


Gly 


Lys 


Ala 


Pro 


Glu 


Glu 


Asp 


Glu 


Gly 




95 










100 










105 










110 





ata ttc ate tgatgttctt cagtcagtag ctgcctctgg atgtetttac 435 
He Phe He 
rtttctgttt wccttttagc 
gcagtggctc acaettgtaa 
ggcctatacc acatagetag 
teaeaeetgt aateceagca 
eaaeatggtg aaaeeecgtc 
gectgtaate ceagctactc 
gaggttgctg tgagtggaga 
tccatcecaa aaaaaaaaaa 

<210> 75 
<211> 464 
<212> DNA 
<213> Homo sapiens 

<220> 
<221> CDS 
<222> 16. .342 

<220> 

<221> sig peptide 
<222> 16.7114 
<223> Von Heijne matrix 

score 3.66786754931606 

seq LAVFQMLKSMCAG/QR 

<220> 

<221> polyA_site 
<222> 446.. 464 

<220> 

<221> polyA signal 
<222> 412. .417 

<400> 75 

gaatgegcet gegtt atg gcg gee tec ggc gee cca agg ate ctg gtg gac 51 
Met Ala Ala Ser Gly Ala Pro Arg He Leu Val Asp 
-30 -25 
ctg ctg aag ctg aac gtg gee ccc etc gcc gtc ttc cag atg etc aag 99 
Leu Leu Lys Leu Asn Val Ala Pro Leu Ala Val Phe Gin Met Leu Lys 

-20 -15 -10 

tec atg tgt gee ggg cag agg eta gcg age gag ccc cag gac cct gcg 14 7 



aaggtgaaac cagtctggam aatggggaga tgggccgggt 495 

tcgaaacget ttgggaggce eaggtggaag gatcacttga 555 

aeectgtctc actgeaaatt aaaaggctgg gegtggtgge 615 

etttgggagg ctgaggcagg eggatcacct geaeeetggc 675 

tttactaaaa atagaaaatt ageegggegt gatggcacac 735 

gggaggctga ggeaggagaa ttgcttgaae ctgggaggtg 795 

tcatgeeatt gcactecagc etgageaaca agagcaaaac 855 

aaaa 879 
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63 
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Arg 
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10 
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tct 
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age 
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cga 


ggg 


aga 


aac 


195 


Ala 


Val 


Ser 


Leu 


Pro 


Thr 


Ser 


Ser 


Val 


Pro 


Glu 


Thr 


Arg 


Gly 


Arg 


Asn 










15 










20 










25 








aaa 


ggc 


age 


get 


gcc 


etc 


ggg 


gga 


gca 


ttg 


gcc 


ctg 


gcg 


gaa 


cgm 


rrc 


243 


Lys Gly 


Ser 


Ala 


Ala 


Leu 


Gly 


Gly 


Ala 


Leu 


Ala 


Leu 


Ala 


Glu 


Arg 


Xaa 








30 










35 










40 










age 


cgc 


gaa 


gga 


tec 


age 


cag 


agg 


atg 


cca 


cgc 


cag 


ccc 


age 


get 


ace 


291 


Ser 


Arg 


Glu 


Gly 


Ser 


Ser 


Gin 


Arg 


Met 


Pro 


Arg 


Gin 


Pro 


Ser 


Ala 


Thr 






45 










50 










55 












agg 


ctg 


ccc 


aag 


ggg 


ggc 


ggg 


cct 


ggg 


aag 


age 


cct 


aca 


egg 


ggc 


age 


339 


Arg 


Leu 


Pro 


Lys 


Gly 


Gly 


Gly 


Pro 


Gly 


Lys 


Ser 


Pro 


Thr 


Arg 


Gly 


Ser 




60 










65 










70 










75 





ace taggatgggg cagagacttg ttgeatcttt gtceecagca aaggetaeat 392 

Thr 

gttacctcct tcaattgata ataaacettt ctgagatgea gagggtceag gtcaaaaaaa 452 

aaaaaaaaaa aa 464 

<210> 76 
<211> 624 
<212> DNA 

<213> Homo sapiens 

<220> 
<221> CDS 
<222> 59. .367 

<220> 

<221> sig_peptide 

<222> 59. .166 

<223> Von Heijne matrix 

score 3.66643500261364 

seq RALSTFLFGSIRG/AA 

<220> 

<221> polyA__site 
<222> 609., 624 

<220> 

<221> polyA__signal 
<222> 588.. 593 

<400> 76 

agtgcgcaga cgcaggggte ggcgcegggt gagagcgtgc ggccggattc accacaac 58 



atg 


gca 


aat 


ett 


ttt 


ata 


agg 


aaa 


atg 


gtg 


aac 


cct 


ctg 


etc 


tat 


etc 


106 


Met 


Ala 
-35 


Asn 


Leu 


Phe 


He 


Arg 
-30 


Lys 


Met 


Val 


Asn 


Pro 
-25 


Leu 


Leu 


Tyr 


Leu 




cgt 


cgt 
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cga 
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etc 
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Arg 


Arg 


His 
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Arg 


Ala 


Leu 


Ser 
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Phe 


Leu 


Phe 


Gly 
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-10 










-5 
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gtg 


get 
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He 
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Gly 
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1 
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Val 
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5 
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10 
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ett 
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Arg 


Ser 


Leu 
15 
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Leu 
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His 


Leu 


Leu 

25 


Pro 


Ala 


Leu 
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aag 


gae 
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tac 


298 


Gly 


Phe 


Lys 


Asn 


Lys 


Thr 


Val 


Leu 


Lys 


Lys 


Arg Cys 


Lys 
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Cys 


Tyr 
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35 
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ctg 


gtg 


aag 


agg 


egg 


ggt 


egg 
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tac 


gtc 


tac 


tgt 


aaa 


ace 


eat 


ecg 


346 


Leu 


Val 


Lys 


Arg 


Arg 


Gly 


Arg 


Trp 


Tyr 


Val 


Tyr 


Cys 


Lys 


Thr 


His 


Pro 




45 










50 










55 










60 
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64 

agg cac aag cag aga cag atg tagacccttt ccctccagag tcacgcacat 397 
Arg His Lys Gin Arg Gin Met 
65 

actcgtcatc gcatcacttg ggagaatggt tgtatcttat ggaaggaatt atcacatcaa 4 57 

ggagtcaggg gaaagtgact ggaagcaaac gccctaaaag ttacccatca cgtttcagtg 517 

taaatgagta actatagaag acattgcgtt atcttatttc maaaacgttc caactaaaaa 577 

acattttcct attaaaatag accttccgaa caaaaaaaaa aaaaaaa 624 

<210> 77 
<211> 1158 
<212> DNA 

<213> Homo sapiens 

<220> 
<221> CDS 
<222> 58. .315 



<220> 

<221> sig_peptide 

<222> 58.. 165 

<223> Von Heijne matrix 

score 3.68684848854945 
seq SVSLSGFTVGTLS/ET 



<220> 

<221> polyA_site 
<222> 1143.. 1158 



<400> 77 

ctgtgggaga ggccaggatg ctgcattagg cacaggataa 
atg ggt cct get etc cga agt ctg caa gte aag 
Met Gly Pro Ala Leu Arg Ser Leu Gin Val Lys 

-35 -30 
gcc gac cct etc cct ttc ccc tct gtc tct ctt 
Ala Asp Pro Leu Pro Phe Pro Ser Val Ser Leu 
-20 -15 -10 

ggt acc ctg tea gaa acc age act ggg ggc cct 
Gly Thr Leu Ser Glu Thr Ser Thr Gly Gly Pro 



cctgggaacc caggcac 
aag gga aca gag cac 
Lys Gly Thr Glu His 
-25 

agt ggc ttt aca gtg 
Ser Gly Phe Thr Val 

-5 

gcc acc ccc aca tgg 
Ala Thr Pro Thr Trp 
10 

get gag agt gac aag 
Ala Glu Ser Asp Lys 
25 

ttt ttc age acc cag 
Phe Phe Ser Thr Gin 
40 

ca tgcacaaata 



aag gag tgt cct ate tgt aag gag cge ttt cct 
Lys Glu Cys Pro lie Cys Lys Glu Arg Phe Pro 

15 20 
gat gcc ctg gag gac cac atg gat gga cac ttc 
Asp Ala Leu Glu Asp His Met Asp Gly His Phe 

30 35 
gac ccc ttc acc ttt gag tgatcttact ccctcgta 
Asp Pro Phe Thr Phe Glu 
45 50 

cacactcatg cacacacaca ctcacacaca tgcatacact taggtttcat gcccattttc 
tatcacactg ggctccatga tattctgttc cctaagaact gcttctgtgt gccctgtttt 
catcccaaga tttctcactt catcctctec tacetggetc ttttgtecca gggaggggtc 
ctgttcggaa geagtggetg aatttatecc ctgaaagtgg ttttggagga aecgggatgg 
aggaggcctt cccctgtggg aatagaatcg tccactccta gccctggttg cttctgatac 
acagccactg cacacacaca etcacactca cactcecttg tytgatgccc caaagccaat 
tcctggggca ccctaccctc tcttatttgg agtttccgtt ggtttacctg agttttctct 
ggggtctgca cagaggcagc agcatggaca tcatggcctc tcaggtccct tttggttctc 
agtttcattg gttcctcttt ctgttccccc attgacttct gtgccccacc ctagcctttt 
ccataacctt aggtattcag tttggagggg ttttttgtat ttttgaggat tcctgtattc 
tgtatcctct cctcgcatct ectcacatgg aaagaaataa tgtatttgtg ccttctgtga 
ggaatggggg gaacaagtgg tcccaggtat ccccatttcc aaggcccccc tccctctcca 
ggtcccccca cagcaataaa agcttccccc tgatatccat ccctttgtag tttgaacaaa 
tatatttata tgatatgaaa aaaaaaaaaa aaa 



57 
105 



153 



201 



249 



297 



345 



405 
465 
525 
585 
645 
705 
765 
825 
885 
945 
1005 
1065 
1125 
1158 
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<210> 78 
<211> 641 
<212> DNA 

<213> Homo sapiens 

<220> 
<221> CDS 
<222> 187.. 342 



<220> 

<221> sig_peptide 

<222> 187.. 276 

<223> Von Heijne matrix 

score 3.57432920610252 
seq VKLLLKMVSSATT/ER 



<220> 

<221> polyA^site 
<222> 626.. 641 



<400> 78 

attctaaatc aataaatata gcttacccac agacttttta tgggatgtaa agtatgtatt 60 

acaacaggta aatccatttc tctgttttgg gaaagcaggt taatgttttg tttgttttgt 120 

tgctgcatat aatgttgcac tatatatata ttctaatgca gttggtctcc taattctaca 180 
ggataa atg tct age tgt ggc att gtt gga tea tea gtt teg ttt eag 228 
Met Ser Ser Cys Gly He Val Gly Ser Ser Val Ser Phe Gin 
-30 -25 -20 

tta gat get gtg aaa ttg etc ttg aaa atg gtg tec tet gee ace aca 276 

Leu Asp Ala Val Lys Leu Leu Leu Lys Met Val Ser Ser Ala Thr Thr 

-15 -10 -5 

gaa egg tgt tgt aat gga agt gee aat ttt cat aaa aac ttg tgt gca 324 

Glu Arg Cys Cys Asn Gly Ser Ala Asn Phe His Lys Asn Leu Cys Ala 
1 5 10 .15 
aca ggt att aag aat ttt tgaggtcggg cgcagtagct eacgcetgta 372 
Thr Gly He Lys Asn Phe 
20 

ateccagcac tttgggaggc ctaggcaggt ggatcacctg aggteaggag ttceagacea 432 

gcctgaccaa catggtgaaa teccatctcc actaaataea aaaaattagc tgggegtggt 4 92 

ggcatacace tgtaatccea gctacttggg aggctaagac aggagaattg ettgaaccca 552 

ggaggcggag gttgcagtga gecgagattg caecactaca ctecagcctg ggtgatgage 612 
gaaactccat ctcaaaaaaa aaaaaaaaa 641 



<210> 79 

<211> 660 

<212> DNA 

<213> Homo sapiens 



<220> 
<221> CDS 
<222> 2.. 658 



<220> 

<221> sig_peptide 

<222> 2. .121 

<223> Von Heijne matrix 

score 6.4143157541603 
ITMSATVVDAVNA/AP 



<400> 79 

t cet awt cga aaa att aeg aga ttg get tgg att etg teg gat gga ett 4 9 
Pro Xaa Arg Lys He Thr Arg Leu Ala Trp He Leu Ser Asp Gly Leu 
-40 -35 -30 -25 

ggg get age tgc ggc ggg get gga gga ggc cag ata ace atg tea gee 97 
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Met 




10 
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aag 
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gaa 


cag 


cga 


aaa 


Lys 


Lys 


Glu 
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Glu 


Glu 


Gin 


Arg 


Lys 


25 










30 








gtt 


gat 


gaa 


gaa 


gga 


aaa 


gac 


ate 


aac 


Val Asp 


Glu 


Glu 


Gly Lys Asp 

A e; 


He 


Asn 


tct 


tea 


gtg 


cca 


tgg 


tat 


att 


gat 


cct 


Ser 


Ser 


Val 


Pro 


Trp 


Tyr 


He 


Asp 


Pro 








60 










65 


cac 


cag 


aga 


cca 


caa 


cca 


gaa 


aaa 


caa 


His 


Gin 


Arg 


Pro 


Gin 


Pro 


Glu 


Lys 


Gin 






75 










80 




gaa 


tgg 


tac 


aag 


agg 


ggt 


gta 


aaa 


gag 


Glu 


Trp 


Tyr Lys 


Arg Gly Val 


Lys 


Glu 




90 










95 






cgc 


aaa 


gga 


gca 


tgt 


gaa 


aat 


tgt 


ggg 


Arg 


Lys 


Gly Ala 


Cys 


Glu 


Asn 


Cys 


Gly 


105 










110 








gac 


tgc 


ttt 


gag 


aga 


cct 


agg 


cga 


gtt 


Asp 


Cys 


Phe 


Glu 


Arg 


Pro Arg 


Arg 


Val 










125 










aat 


ata 


get 


eca 


gat 


gaa 


eat 


gtc 


cag 


Asn 


lie 


Ala 


Pro 


Asp Glu His 


Val 


Gin 








140 










145 


gat 


ggg 


aag 


agg 


gat 


egg 


tgg 


aat 


ggc 


Asp Gly 


Lys 


Arg 


Asp Arg 


Trp 


Asn 


Gly 






155 










160 




aaa 


att 


gtt 


gaa 


gag 


tat 


gee 


aaa 


gtt 


Lys 


He 


Val 


Glu 


Glu 


Tyr 


Ala 


Lys 


Val 




170 










175 







<210> 80 
<211> 647 
<212> DNA 

<213> Homo sapiens 

<220> 
<221> CDS 
<222> 1. .645 

<220> 

<221> sig^peptide 

<222> 1. .108 

<223> Von Heijne matrix 

score 5.52542399647426 

ITMSATVXDAVNA/AP 

<400> 80 

att acg aga ttg get tgg att ctg tck 

He Thr Arg Leu Ala Trp He Leu Ser 

-35 -30 

ggc ggg get gga gga ggc cag ata ace 

Gly Gly Ala Gly Gly Gly Gin He Thr 
-20 -15 

gca gtt aat get gca ccc eta teg ggg 
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Gly 


Gin 


He 


Thr 


Met 


Ser 


Ala 




-15 










-10 






ccc 


eta 


teg 


ggg 


tee 


aaa 


gaa 


145 


Pro 


Leu 


Ser 


Gly 
5 


Ser 


Lys 


Glu 




ace 


aga 


gag 


gac 


tgg 


aga 


aag 


193 


Thr 


Arg 


Glu 


Asp 


Trp 


Arg 


Lys 








20 












ttg 


ggc 


aat 


get 


cct 


gea 


gaa 


241 


Leu 


Gly 


Asn 


Ala 


Pro 


Ala 


Glu 






35 










40 




ccc 


cat 


att 


cct 


cag 


tat 


att 


289 


Pro 


His 


He 


Pro 


Gin 


Tyr 


He 




50 










55 






tea 


aaa 


aga 


cct 


act 


tta 


aaa 


337 


Ser 


Lys 


Arg 


Pro 


Thr 


Leu 


Lys 












70 








aag 


cag 


ttc 


age 


tea 


tct 


gga 


385 


Lys 


Gin 


Phe 


Ser 


Ser 


Ser 


Gly 










85 










aat 


tec 


ata 


att 


act 


aag 


tac 


433 


Asn 


Ser 


He 


He 


Thr 


Lys 


Tyr 








100 












gcc 


atg 


aca 


cac 


aaa 


aag 


aaa 


481 


Ala 


Met 


Thr 


His 


Lys 


Lys 


Lys 






115 










120 




gga 


gee 


aaa 


ttt 


aca 


ggt 


act 


529 


Gly 


Ala 


Lys 


Phe 


Thr 


Gly 


Thr 




130 










135 






cct 


caa 


ctg 


atg 


ttt 


gac 


tat 


577 


Pro 


Gin 


Leu 


Met 


Phe 


Asp 


Tyr 












150 








tac 


aat 


cca 


gaa 


gaa 


cac 


atg 


625 


Tyr 


Asn 


Pro 


Glu 


Glu 


His 


Met 










165 










gat 


ttg 


gc 










660 


Asp 


Leu 














gat 


gga 


ett 


ggg 


get 


age 


tgc 


48 


Asp 


Gly 


Leu 


Gly 


Ala 


Ser 


Cys 








-25 












atg 


tea 


gcc 


aca 


gtt 


gtm 


gat 


96 


Met 


Ser 


Ala 


Thr 


Val 


Val 


Asp 






-10 










-5 




tec 


aaa 


gaa 


atg 


agt 


ttg 


gaa 


144 



wo 02/083898 



Ala 


Val 


Asn 


Ala 


Ala 
1 


Pro 


Leu 


Ser 


Gly 
5 


gaa 


cca 


aag 


aag 


atg 


acc 


aga 


gag 


gac 


Glu 


Pro 


Lys 


Lys 


Met 


Thr 


Arg 


Glu 


Asp 






15 










20 




gaa 


gaa 


cag 


cga 


aaa 


ttg 


ggc 


aat 


get 


Glu 


Glu 


Gin 


Arg 


Lys 


Leu 


Gly 


Asn 


Ala 




30 










35 






gga 


aaa 


gac 


ate 


aac 


ccc 


cat 


att 


cct 


Gly Lys 


Asp 


He 


Asn 


Pro 


His 


He 


Pro 


45 










50 








tgg 


tat 


att 


gat 


cct 


tea 


aaa 


aga 


cct 


Trp 


Tyr 


lie 


Asp 


Pro 


Ser 


Lys 


Arg 


Pro 










65 










caa 


cca 


gaa 


aaa 


caa 


aag 


cag 


ttc 


age 


Gin 


Pro 


Glu 


Lys 


Gin 


Lys 


Gin 


Phe 


Ser 








80 










85 


agg 


ggt 


gta 


aaa 


gag 


aat 


tec 


ata 


att 


Arg 


Gly 


Val 


Lys 


Glu 


Asn 


Ser 


He 


He 






95 










100 




tgt 


gaa 


aat 


tgt 




gcc 


atg 


aca 


cae 


Cys 


Glu 


Asn 


Cys 


Gly 


Ala 


Met 


Thr 


His 




110 










115 






aga 


act 


agg 


cga 


gtt 


gga 


gcc 


aaa 


ttt 


Arg 


Pro 


Arg 


Arg 


Val 


Gly 


Ala 


Lys 


Phe 


125 










130 








gat 


gaa 


cat 


gtc 


cag 


cct 


caa 


ctg 


atg 


Asp 


Glu 


His 


Val 


Gin 


Pro 


Gin 


Leu 


Met 










145 










gat 


egg 


tgg 


aat 


ggc 


tac 


aat 


cca 


gaa 


Asp Arg 


Trp 


Asn 


Gly 


Tyr 


Asn 


Pro 


Glu 








160 










165 


gag 


tat 


gcc 


aaa 


gtt 


gat 


ttg 


gc 




Glu 


Tyr 


Ala 


Lys 


Val 


Asp 


Leu 







175 



<210> 81 

<211> 643 

<212> DNA 

<213> Homo sapiens 

<220> 
<221> CDS 
<222> 1. .642 

<220> 

<221> sig_peptide 

<222> 1..108 

<223> Von Heijne matrix 

score 6.4143157541603 

seq ITMSATVVDAVNA/AP 

<400> 81 

att aeg aga ttg get tgg att ctg teg 
He Thr Arg Leu Ala Trp He Leu Ser 

-35 -30 
ggc ggg get gga gga ggc cag ata acc 
Gly Gly Ala Gly Gly Gly Gin He Thr 
-20 -15 
gca gtt aat get gea cee eta teg ggg 
Ala Val Asn Ala Ala Pro Leu Ser Gly 

1 5 
gaa cca aag aag atg acc aga gag gac 
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Ser 


Lys 


Glu 


Met 


Ser 


Leu 


Glu 












10 








tgg 


aga 


aag 


aag 


aag 


gag 


eta 


192 


Trp 


Arg 


Lys 


Lys 


Lys 


Glu 


Leu 










25 










cct 


gca 


gaa 


gtt 


gat 


gaa 


gaa 


240 


Pro 


Ala 


Glu 


Val 


Asp 


Glu 


Glu 








40 












cag 


tat 


att 


tct 


tea 


gtg 


cca 


288 


Gin 


Tyr 


He 


Ser 


Ser 


Val 


Pro 






55 










60 




act 


tta 


aaa 


cac 


cag 


aga 


cca 


336 


Thr 


Leu 


Lys 


His 


Gin 


Arg 


Pro 




70 










75 






tea 


tct 


gga 


gaa 


tgg 


tac 


aag 


384 


Ser 


Ser 


Gly 


Glu 


Trp 


Tyr 


Lys 












90 








act 


aag 


tac 


egc 


aaa 


gga 


gca 


432 


Thr 


Lys 


Tyr 


Arg 


Lys 


Gly 


Ala 










105 










aaa 


aag 


aaa 


gac 


tgc 


ttt 


gag 


480 


Lys 


Lys 


Lys 


Asp 


Cys 


Phe 


Glu 








120 












aca 


ggt 


act 


aat 


ata 


get 


cca 


528 


Thr 


Gly 


Thr 


Asn 


He 


Ala 


Pro 






135 










140 




ttt 


gac 


tat 


gat 


ggg 


aag 


agg 


576 


Phe 


Asp 


Tyr 


Asp 


Gly 


Lys 


Arg 




150 










155 






gaa 


cac 


atg 


aaa 


att 


gtt 


gaa 


624 


Glu 


His 


Met 


Lys 


He 


Val 


Glu 












170 






















647 


gat 


gga 


ctt 


ggg 


get 


age 


tgc 


48 


Asp 


Gly 


Leu 


Gly 


Ala 


Ser 


Cys 








-25 












atg 


tea 


gcc 


aca 


gtt 


gta 


gat 


96 


Met 


Ser 


Ala 


Thr 


Val 


Val 


Asp 






-10 










-5 




tec 


aaa 


gaa 


atg 


agt 


ttg 


gaa 


144 


Ser 


Lys 


Glu 


Met 


Ser 


Leu 


Glu 












10 








tgg 


aga 


aag 


aag 


aag 


gag 


eta 


192 
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Glu Pro 


Lys 


Lys 


Met 


Thr 


Arg 


Glu 


Asp 


Trp Arg 


Lys 


Lys 


Lys Glu 


Leu 






15 










20 










25 








gaa gaa 


cag 


cga 


aaa 


ttg 


ggc 


aat 


get 


cct 


gea 


gaa 


gtt 


gat gaa 


gaa 


240 


Glu Glu 


Gin 


Arg 


Lys 


Leu 


Gly Asn 


Ala 


Pro 


Ala 


Glu 


Val 


Asp Glu Glu 




30 










35 










40 










gga aaa 


gac 


ate 


aac 


ccc 


eat 


att 


cct 


cag 


tat 


att 


tct 


tea gtg 


cca 


288 


Gly Lys 


Asp 


He 


Asn 


Pro 


His 


He 


Pro 


Gin 


Tyr 


He 


Ser 


Ser Val 


Pro 




45 








50 










55 








60 




tgg tat 


att 


gat 


cct 


tea 


aaa 


aga 


cct 


act 


tta 


aaa 


cae 


cag aga 


cca 


336 


Trp Tyr 


He 


Asp 


Pro 


Ser 


Lys Arg 


Pro 


Thr 


Leu 


Lys 


His 


Gin Arg 


Pro 










65 










70 








75 






caa cca 


gaa 


aaa 


caa 


aag 


caa 


ttc 


age 


tea 


tet 


gga 


gaa 


tgg tac 


aag 


384 


Gin Pro 


Glu 


Lys 


Gin 


Lys 


Gin 


Phe 


Ser 


Ser 


Ser Gly Glu 


Trp Tyr 


Lys 








80 










85 










90 






agg ggt 


gta 


aaa 


gag 


aat 


tec 


ata 


att 


act 


aag 


tac 


cgc 


aaa gga 


gca 


432 


Arg Gly 


Val 


Lys 


Glu 


Asn 


Ser 


He 


He 


Thr 


Lys 


Tyr Arg 


Lys Gly Ala 






95 










100 










105 








tgt gaa 


aat 


tgt 




gcc 


atg 


aca 


cae 


aaa 


aag 


raa 


gac 


tge ttt 


gag 


480 


Cys Glu 


Asn 


Cys 


Gly 


Ala 


Met 


Thr 


His 


Lys 


Lys 


Xaa Asp Cys Phe 


Glu 




110 










115 










120 










aga cct 


agg 


cga 


gtt 


gga 


gcc 


aaa 


ttt 


aca 


ggt 


act 


aat 


ata get 


cca 


528 


Arg Pro 


Arg 


Arg 


Val 


Gly 


Ala 


Lys 


Phe 


Thr 


Gly Thr Asn 


He Ala 


Pro 




125 








130 










135 








140 




gat gaa 


cat 


gtc 


cag 


cct 


caa 


ctg 


atg 


ttt 


gac 


tat 


gat 


ggg aag 


agg 


576 


Asp Glu 


His 


Val 


Gin 


Pro 


Gin 


Leu 


Met 


Phe 


Asp 


Tyr Asp 


Gly Lys 


Arg 










145 










150 








155 






gat egg 


tgg 


aat 


ggc 


tac 


aat 


cca 


gaa 


gaa 


cae 


atg 


aaa 


att gtt 


gaa 


624 


Asp Arg 


Trp 


Asn 
160 


Gly 


Tyr 


Asn 


Pro 


Glu 
165 


Glu 


His 


Met 


Lys 


He Val 
170 


Glu 




gag tat 


gcc 


aaa 


gtt 


gat 


t 


















643 


Glu Tyr 


Ala 
175 


Lys 


Val 


Asp 























<210> 82 

<211> 1200 

<212> DNA 

<213> Homo sapiens 

<220> 
<221> CDS 
<222> 29. .469 

<22C> 

<221> sig_peptide 

<222> 29. .73 

<223> Von Heijne matrix 

score 4.5958301326157 

seq VLVSXCSARLLQQ/EE 

<220> 

<221> polyA_site 
<222> 1185. .1200 

<400> 82 

aettggegag tgagacgctg atgggagg atg gac gta ctg gtg tct rag tge 52 

Met Asp Val Leu Val Ser Xaa Cys 
-15 -10 
tec gcg egg ctg ctg cag cag gaa gaa gag att aaa tct ctg act get 100 
Ser Ala Arg Leu Leu Gin Gin Glu Glu Glu He Lys Ser Leu Thr Ala 

-5 15 
gaa att gac egg ttg aaa aac tgt ggc tgt tta gga get tet cca aat 148 
Glu He Asp Arg Leu Lys Asn Cys Gly Cys Leu Gly Ala Ser Pro Asn 
10 15 20 25 
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ttg gag cag tta caa gaa gaa aat tta aaa tta 
Leu Glu Gin Leu Gin Glu Glu Asn Leu Lys Leu 

30 35 
att ctt cga aag agt ctt cag gca gaa agg aac 
lie Leu Arg Lys Ser Leu Gin Ala Glu Arg Asn 

45 50 
atg att aac att att age cgc eta caa gag gtc 
Met lie Asn lie He Ser Arg Leu Gin Glu Val 



aag tat cga 
Lys Tyr Arg 



aaa 
Lys 

ttt 



60 

aag get gca tat 
Lys Ala Ala Tyr 
75 



65 

cca gat ttg gaa aat cct cct 
Pro Asp Leu Glu Asn Pro Pro 
80 



agt cag cag gcc aag ttt ggg gac tat cag tgt 

Ser Gin Gin Ala Lys Phe Gly Asp Tyr Gin Cys 

90 95 100 

att tct cag gtg atg tat tgt cat gac tet tgg 

He Ser Gin Val Met Tyr Cys His Asp Ser Trp 



110 



115 



aag tat tat tat cat cat tgc cat tta cag aaa 

Lys Tyr Tyr Tyr His His Cys His Leu Gin Lys 

125 130 
tccttagtga aaaggacatt tgccacagtt tgaaaaactt 
atatgtttta acttttttag gcacaatttt taaggtttgg 
caatatttaa gggcaatcat tggtactctt ttgtttaggt 
aggaktgctg tgtggtggtr atgagtgctg ggaggtgaaa 
cagtcagcat cccaattaaa tatttgatgt aactgtgatc 
tcattttcaa gcagaggagt tccccatttt aaatagaggc 
ttaactgcat ctggcttggg tctttctgtt ttcctttett 
ctctgaagag tccaggtctt acagtgtggt ttatttctca 
tcatggcttg aaaattagaa atgtaaatat gctggaacca 
tgttatgtca tcattaaaaa gttcgaggtg ttgtaatccc 
caggagaatt gcttgaaccc aggggcggag gttacagtga 
ctccagcctg ggcaacagag cgagactcca tctctaaaaa 



cca act 
Pro Thr 

55 
ggt cat 
Phe Gly His 
70 

ctg eta gtg 
Leu Leu Val 
85 

aat agt get 
Asn Ser Ala 

ctg ttt gat 
Leu Phe Asp 

taatactatt 



ctg aat 

Leu Asn 
40 

aaa aat 

Lys Asn 

gca att 
Ala He 

aca cca 
Thr Pro 

atg ggt 
Met Gly 
105 
ttt ttt 
Phe Phe 
120 
acaagttgta 



gagaaaggag 
ttaaatttta 
atttccctcc 
aattaaaata 
tttgagccag 
attgtctgat 
tgctgaattg 
agtgtgaata 
gacttcataa 
agctactagg 
gccgagatca 
aaaaaaaaaa 



ttgggggggt 
tatgtattet 
tgctgtgtcc 
agccatttac 
gcttatatat 
gtgtttatgg 
gaaggggtta 
ttgcacacct 
ctgtagactt 
gaggctgagg 
caccactgca 
a 



196 



244 



292 



340 



388 



436 



489 



549 
609 
669 
729 
789 
849 
909 
969 
1029 
1089 
1149 
1200 



<210> 83 
<211> 351 
<212> DNA 

<213> Homo sapiens 



<220> 
<221> CDS 
<222> 1..351 



<220> 

<221> sig_peptide 

<222> 1,,129 

<223> Von Heijne matrix 

score 4.10872570449087 
LGQVAAAAAAANA/AS 



<220> 

<221> polyA_site 
<222> 336.. 351 



<220> 

<221> polyA_signal 
<222> 306. .311 



<400> 83 

ctt tct cca get caa gcg tct ggt att tgg tgt agg gac age get ttt 48 

Leu Ser Pro Ala Gin Ala Ser Gly He Trp Cys Arg Asp Ser Ala Phe 

-40 -35 -30 

tec ggg tgg age gag egt gga tec agt teg egg egg ggt tgt ttg ggt 96 
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Ser 


Gly 


Trp 


Ser 


Glu 


Arg 


Gly Ser Ser 


Ser Arg Arg Gly Cys 


Leu Gly 








-25 








-20 








-15 










caa 


gtt 


get 


get 


get 


get 


gcg get gcg 


aat 


get 


get 


tet 


cct 


cct 


cct 


144 


Gin 


Val 
-10 


Ala 


Ala 


Ala 


Ala 


Ala Ala Ala 
-5 


Asn 


Ala 


Ala 
1 


Ser 


Pro 


Pro 


Pro 
5 




tea 


gcg 


aae 


age 


ctg 


gga 


teg ggt ggt 


cge 


tgc 


aag 


ctg 


agg 


get 


ccg 


192 


Ser 


Ala 


Asn 


Ser 


Leu 


Gly 


Ser Gly Gly 


Arg 


Cys 


Lys 


Leu Arg 


Ala 


Pro 












10 






15 










20 






acg 


age 


egc 


ecg 


tee 


eag 


teg egg ccc 


ega 


tec 


ctg 


ccc 


eag 


gee 


egg 


240 


Thr 


Ser 


Arg 


Pro 
25 


Ser 


Gin 


Ser Arg Pro 
30 


Arg 


Ser 


Leu 


Pro 


Gin 
35 


Ala 


Arg 




ttc 


ccc 


ccg 


teg 


ccg 


ctg 


ccg cct tet 


cct 


gca 


gga 


acg 


agt 


tgc 


aca 


288 


Phe 


Pro 


Pro 


Ser 


Pro 


Leu 


Pro Pro Ser 


Pro 


Ala 


Gly Thr 


Ser 


Cys 


Thr 








40 








45 








50 










tgt 


tac 


tgc 


ttc 


eta 


eta 


ata aat get 


gae 


ctg 


ate 


aaa 


tgg 


age 


eca 


336 


Cys 


Tyr 
55 


Cys 


Phe 


Leu 


Leu 


lie Asn Ala 
60 


Asp 


Leu 


He 
65 


Lys 


Trp 


Ser 


Pro 




aaa 


aaa 


aaa 


aaa 


aaa 




















351 


Lys 


Lys 


Lys 


Lys 


Lys 























70 



<210> 84 

<211> 438 

<212> DNA 

<213> Homo sapiens 

<220> 
<221> CDS 
<222> 15.. 278 

<220> 

<221> sig_peptide 

<222> 15., 122 

<223> Von Heijne matrix 

score 3.79790641648006 
seq ILMRDFSPSGIFG/AF 

<220> 

<221> polyA site 
<222> 423.. 438 

<220> 

<221> polyA__signal 
<222> 396, .401 

<400> 84 



accaggactc caaa 


atg 


gcg 


tea 


gtt 


gta 


eca 


gtg 


aag 


gae aag aaa 


ctt 


50 










Met 


Ala 


Ser 


Val 


Val 


Pro 


Val 


Lys 


Asp Lys Lys 


Leu 














-35 










-30 






-25 




ctg 


gag 


gtc 


aaa 


ctg 


ggg 


gag 


ctg 


cca 


age 


tgg 


ate 


ttg atg egg 


gae 


98 


Leu 


Glu 


Val 


Lys 


Leu 


Gly 


Glu 


Leu 


Pro 


Ser 


Trp 


He 


Leu Met Arg 


Asp 












-20 










-15 






-10 






ttc 


agt 


cct 


agt 


ggc 


att 


ttc 


gga 


gcg 


ttt 


caa 


aga 


ggt tac tac 


egg 


146 


Phe 


Ser 


Pro 


Ser 
-5 


Gly 


He 


Phe 


Gly Ala 


Phe Gin Arg Gly Tyr Tyr Arg 




tac 


tac 


aae 


aag 


tac 


ate 


aat 


gtg 


aag 


aag 


ggg 


age 


ate teg ggg 


att 


194 


Tyr 


Tyr 


Asn 


Lys 


Tyr 


He 


Asn 


Val 


Lys 


Lys Gly Ser 


He Ser Gly 


He 






10 










15 










20 








ace 


atg 


gtg 


ctg 


gca 


tgc 


tac 


gtg 


etc 


ttt 


age 


tac 


tee ttt tec 


tac 


242 


Thr 


Met 


Val 


Leu 


Ala 


Cys 


Tyr 


Val 


Leu 


Phe 


Ser 


Tyr 


Ser Phe Ser 


Tyr 




25 










30 










35 






40 




aag 


cat 


etc 


aag 


cac 


gag 


egg 


etc 


egc 


aaa 


tac 


cac 


tgaagaggac 




288 


Lys 


His 


Leu 


Lys 


His 


Glu 


Arg 


Leu Arg 


Lys 


Tyr 


His 
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45 50 

acactctgca cccccccacc ccacgacctt ggcccgagcc cctccgtgag gaacacaatc 348 

tcaatcgttg ctgaatcctt tcatatccta ataggaatta acctccaaat aaaacatgac 408 
tggtacgtgt gctgaaaaaa aaaaaaaaaa 438 

<210> 85 
<211> 419 
<212> DNA 

<213> Homo sapiens 



<220> . 
<221> CDS 
<222> 74.. 259 



<220> 

<221> sig_peptide 

<222> 74. .199 

<223> Von Heijne matrix 

score 3.56683458636822 
seq CXXSCCSCCPVGC/AK 



<220> 

<221> polyA_site 
<222> 404. .419 



<220> 

<221> polyA_signal 
<222> 388.. 393 



<400> 85 

cactccgcct tccacgtgca cccactgcct cttcccttct cgcttgggaa ctctagtctc 60 
gcctcgggtt gca atg gac ccc aac tgc tec tgt gcc get gca ggt gtc 109 
Met Asp Pro Asn Cys Ser Cys Ala Ala Ala Gly Val 
-40 -35 
tec tgc ace tgc gcc age tee tgc aag tgc aaa gag tgc aaa tgc aec 157 
Ser Cys Thr Cys Ala Ser Ser Cys Lys Cys Lys Glu Cys Lys Cys Thr 
-30 -25 -20 -15 

tec tgc aar rag age tgc tgc tec tgc tgc cct gtg ggc tgt gcc aag 205 
Ser Cys Lys Xaa Ser Cys Cys Ser Cys Cys Pro Val Gly Cys Ala Lys 

-10 -5 1 

tgt gcc cag ggc tgc ate tgc aaa ggg gca teg gag aag tgc age tgc 253 
Cys Ala Gin Gly Cys lie Cys Lys Gly Ala Ser Glu Lys Cys Ser Cys 

5 10 15 

tgc gcc tgatgteggg acagcectge tcccaagtac aaatagagtg aeecgtaaaa 309 
Cys Ala 
20 

tecaggattt tttgtttttt gctacaatet tgaceecttt gctacattec tttttttctg 369 
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<210> 89 

<211> 544 

<212> DNA 

<213> Homo sapiens 

<220> 
<221> CDS 
<222> 141. .494 



<220> 
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<221> polyA^site 
<222> 529. .544 
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<221> polyA^signal 
<222> 508. .513 
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agtctcgcga taactgcgca ggcgcggacc aaagcgatct cttctgagga tccggcaaga 60 
tggcagaagt atagcagaag aagaagcgga ccttccgcaa gttcacctac cgcggcgtgg 120 
acctcgacca gctgctggac atg tec tac gag cag ctg atg cag ctg tac agt 173 

Met Ser Tyr Glu Gin Leu Met Gin Leu Tyr Ser 
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gcg cgc cag egg egg egg ctg aac egg ggc ctg egg egg aag cag cac 221 
Ala Arg Gin Arg Arg Arg Leu Asn Arg Gly Leu Arg Arg Lys Gin His 
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<210> 91 
<211> 302 
<212> DNA 

<213> Homo sapiens 

<220> 
<221> CDS 
<222> 3. .302 

<220> 

<221> poiyA_site 
<222> 281. .302 

<220> 

<221> polyA_signal 
<222> 230. .235 

<400> 91 

ac gta egg cag ggc ggg gtc gge ceg cag tgg cea get ggg age tac 47 
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Glu 








90 










95 










100 










aag 


gcg 


gag 


ttg 


gcc 


tea 


tec 


age 


age 


aag 


tta 


ggg 


ctg 


aag 


cct 


gcg 


510 


Lys 


Ala 


Glu 


Leu 


Ala 


Ser 


Ser 


Ser 


Ser 


Lys 


Leu 


Gly 


Leu 


Lys 


Pro 


Ala 




105 










110 










115 












agt 


cct 


ggg 


cct 


eea 


agt 


get 


9gg 


ccc 


teg 


atg 


aag 


agt 


gac 


gag 


gat 


558 


Ser 


Pro 


Gly 


Pro 


Pro 


Ser 


Ala 


Gly 


Pro 


Ser 


Met 


Lys 


Ser 


Asp 


Glu 


Asp 





wo 02/083898 



77 



120 125 130 

aag gat gat gta aca ggg aca ata aca gaa gcc gaa ga 
Lys Asp Asp Val Thr Gly Thr lie Thr Giu Ala Glu 
140 145 

<210> 94 

<211> 1413 

<212> DNA 

<213> Homo sapiens 

<220> 
<221> CDS 
<222> 24. .470 

<220> 

<221> polyA site 
<222> 1398.71413 



PCT/IBOl/00914 
135 

596 



<400> 94 
gggttcccgt 



tccccgcgga gcc atg 
Met 

1 



egg tac aac 
Arg Tyr Asn 



gag 

Glu 
5 



aag 
Lys 



gag 
Glu 



ctg 
Leu 



cag get 
Gin Ala 

10 



53 



ctg 


tec 


egg 


eag 


ceg 


gcc 


gag 


atg 


geg 


gcc 


gag 


ctg 


ggc 


atg 


agg 


ggc 


101 


Leu 


Ser 


Arg 


Gin 


Pro 


Ala 


Glu 


Met 


Ala 


Ala 


Glu 


Leu 


Gly 


Met 


Arg 


Gly 












15 










20 










25 






ccc 


aag 


aag 


ggc 


age 


gtg 


ctg 


aag 


egg 


egg 


ctg 


gtg 


aag 


ctg 


gtg 


gtg 


149 


Pro 


Lys 


Lys 


Gly 


Ser 


Val 


Leu 


Lys 


Arg 


Arg 


Leu 


Val 


Lys 


Leu 


Val 


Val 










30 










35 










40 








aat 


ttc 


etc 


ttc 


tae 


ttt 


egg 


aca 


gac 


gag 


gcc 


gag 


ccc 


gtc 


gga 


gcc 


197 


Asn 


Phe 


Leu 


Phe 


Tyr 


Phe 


Arg 


Thr 


Asp 


Glu 


Ala 


Glu 


Pro 


Val 


Gly 


Ala 








45 










50 










55 










ctg 


ctg 


ctg 


gag 


cgc 


tgc 


aga 


gtc 


gtc 


egg 


gaa 


gag 


ccc 


ggc 


ace 


ttc 


245 


Leu 


Leu 


Leu 


Glu 


Arg 


Cys 


Arg 


Val 


Val 


Arg 


Glu 


Glu 


Pro 


Gly 


Thr 


Phe 






60 










65 










70 












tec 


ate 


age 


ttc 


att 


gag 


gac 


cct 


gag 


agg 


aag 


tat 


cac 


ttt 


gag 


tgc 


293 


Ser 


He 


Ser 


Phe 


He 


Glu 


Asp 


Pro 


Glu 


Arg 


Lys 


Tyr 


His 


Phe 


Glu 


Cys 




75 










80 










85 










90 




age 


age 


gag 


gag 


eag 


tgt 


cag 


gag 


tgg 


atg 


gag 


get 


ctg 


cgt 


egg 


gee 


341 


Ser 


Ser 


Glu 


Glu 


Gin 


Cys 


Gin 


Glu 


Trp 


Met 


Glu 


Ala 


Leu 


Arg 


Arg 


Ala 












95 










100 










105 






age 


tac 


gag 


ttc 


atg 


egg 


aga 


age 


etc 


ate 


ttc 


tac 


agg 


aac 


gaa 


ate 


389 


Ser 


Tyr 


Glu 


Phe 


Met 


Arg 


Arg 


Ser 


Leu 


He 


Phe 


Tyr 


Arg 


Asn 


Glu 


He 










110 










115 










120 








egg 


aag 


gtg 


acg 


ggc 


aag 


gac 


ccc 


ctg 


gaa 


eag 


ttc 


ggc 


ata 


tee 


gag 


437 


Arg 


Lys 


Val 


Thr 


Gly 


Lys 


Asp 


Pro 


Leu 


Glu 


Gin 


Phe 


Gly 


He 


Ser 


Glu 








125 










130 










135 










gag 


gee 


agg 


ttc 


eag 


ctg 


agt 


ggc 


ttg 


eag 


geg 


tgagcgcagg gcacggtggt 


490 


Glu 


Ala 


Arg 


Phe 


Gin 


Leu 


Ser 


Gly 


Leu 


Gin 


Ala 















140 
eagcgtgcag 
tctgttttgt 
gtgcagtggt 
cteagettec 
ggattttcag 
gggcagggge 
ecacctaact 
tgtcttgttt 
gtaaagettt 
ttgactttat 
aatcaetett 
eaetttggaa 



145 

cgggaeggga etggccctgc 
tttggttttt ggtttttggg 
geeacagetc actgtgacct 
caagtagegg ggatcacagg 
tgggaggttt ttggtttgga 



tgcattttat gaaactgccc 
ggcttcaget tttgtcatat 
atgagaeeat tcagtggtga 
etagtactct taaaataact 
tattttagta gtatgcctat 
aagaatecat geagcaggec 
ggecgagaea ggeggatcac 
caacacagtg aaacectgtc tetactaaaa 
gcgeetgtaa tcccagctac tcaggaggct 



ccageeatga ategcttgge catgcctgga 550 

tcagggtttc actgtgttgc ccaggctaga 610 

tgaecttctg gaetcaagtg atcctcctgc 670 

catgagtege cacacccgge eatcaeacet 730 

gacatceaaa gcctgaagcc aggtgggtgt 790 

agcaagetge gctecctggg gccccaggat 850 

gggtaaaaat taaagatgtc attgaactac 910 

aetgtttctg gctgataggt tatgagatat 970 

aaatggagta ttatatatea attcatatea 1030 

agaaaatatt atggactcag agtgtcataa 1090 

aggeacagtg gctcacaeet gtaatgcctg 1150 

ttgaggtcag gagtttgaaa ceagceagge 1210 

atacaaaagg ttagccgggc atggtggeag 1270 

gaggeaggag aattgcttga acgcaggagg 1330 



wo 02/083898 PCT/IBO 1/009 14 

78 

caaaggttgc agtgagctga gatcacgcca ctgcactcca gcctgggcaa cagacctcga 1390 
ctccatcaaa aaaaaaaaaa aaa 1413 



<210> 95 

<211> 519 

<212> DNA 

<213> Homo sapiens 

<220> 
<221> CDS 
<222> 211. .519 



<220> 

<221> polyA_site 
<222> 504, .519 



<400> 95 

agattcttaa ctacgtgtct tacagcaaag ttgcccatgg gaacaaaggc tgtagggtgt 60 

cattgatctt gtaatacaac acatatttga tgggtgaatc ttattagcac tcctttgaaa 120 

atgttccagt atggagtcag ttgctgataa agaggaagtt cacagctagg aatccgaagt 180 

taagatttct caattgccgc ctgggcactc atg cct gta gtc cca gca ctt ggg 234 

Met Pro Val Val Pro Ala Leu Gly 



agg 


cca 


agg 


tgg gca gat 


cac 


ctg 


1 

agg 


teg 


gga 


gtt 


5 

egg 


gac 


cag 


cct 


282 


Arg 


Pro 


Arg Trp Ala Asp 


His 


Leu Arg 


Ser Gly Val 


Arg 


Asp 


Gin 


Pro 






10 






15 










20 












ggc 


caa 


ect 


ggt gaa gcc 


ccc 


cca 


tct 


eta 


eta 


aaa 


ata 


caa 


aaa 


ttg 


330 


Gly 


Gin 


Pro 


Gly Glu Ala 


Pro 


Pro 


Ser 


Leu 


Leu 


Lys 


He 


Gin 


Lys 


Leu 




25 






30 










35 










40 




gcc 


ggg 


tat 


ggt ggt ggc 


tgc 


ctg 


tgg 


tec 


eag 


eta 


etc 


ggg 


agg 


ctg 


378 


Ala 


Gly 


Tyr Gly Gly Gly 


Cys 


Leu 


Trp 


Ser 


Gin 


Leu 


Leu 


Gly 


Arg 


Leu 










45 








50 










55 






agg 


egg 


gag 


aat cac ttg 


age 


ccg 


gga 


ggc 


gga 


ggt 


tgc 


agt 


gag 


ccg 


426 


Arg 


Arg 


Glu 


Asn His Leu 


Ser 


Pro Gly Gly Gly Gly 


Cys 


Ser 


Glu 


Pro 










60 






65 










70 








aga 


ttg 


tgc 


cac tgc act 


cca 


gcc 


tgg 


gtg 


aca 


gag 


caa 


gac 


tec 


ate 


474 


Arg 


Leu 


Cys 


His Cys Thr 


Pro 


Ala 


Trp Val 


Thr 


Glu 


Gin 


Asp 


Ser 


He 








75 






80 










85 










tea 


aaa 


ata 


gaa aag att 


tct 


caa 


ttg 


cca 


aaa 


aaa 


aaa 


aaa 


aaa 




519 


Ser 


Lys 


lie 


Glu Lys He 


Ser 


Gin 


Leu 


Pro 


Lys 


Lys 


Lys 


Lys 


Lys 







90 95 100 



<210> 96 
<211> 347 
<212> DNA 

<213> Homo sapiens 



<220> 
<221> CDS 
<222> 73.. 306 



<220> 

<221> polyA_site 
<222> 332.. 347 



<220> 

<221> polyA_signal 
<222> 305.. 310 



<400> 96 

actccgcctt ccacgtgcac ccactgcctc ttcccttctc gcttgggaac tctagtctcg 60 
cctcgggttg ca atg gac ecc aac tgc tec tgt gcc get ggt gtc tec tgc 111 
Met Asp Pro Asn Cys Ser Cys Ala Ala Gly Val Ser Cys 
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1 








5 








10 








acc 


tgc 


gcc 


age 


tec 


tgc 


aag 


tgc 


aaa 


gag tgc 


aaa 


tgc 


acc 


tec 


tgc 


159 


Thr 


Cys 


Ala 


Ser 


Ser 


Cys 


Lys 


Cys 


Lys 


Glu Cys 


Lys 


Cys 


Thr 


Ser Cys 






15 










20 








25 












aag 


aag 


age 


tge 


tgc 


tec 


tgg 


tge 


act 


ggc ccc 


acc 


etc 


gtg 


gac 


ace 


207 


Lys Lys 


Ser 


Cys 


Cys 


Ser 


Trp 


Cys 


Thr 


Gly Pro 


Thr 


Leu 


Val 


Asp 


Thr 




30 










35 








40 










45 




tgc 


cct 


gcc 


ctg 


cca 


cct 


gtc 


tgt 


ctg 


tec caa 


aga 


agt 


tet 


ggt 


atg 


255 


Cys 


Pro 


Ala 


Leu 


Pro 


Pro 


Val 


Cys 


Leu 


Ser Gin 


Arg 


Ser 


Ser 


Gly Met 












50 










55 








60 






aac 


ttg 


agg 


aca 


cat 


gtc 


cag 


tgg 


gag 


gtg aga 


cca 


cct 


etc 


aat 


att 


303 


Asn 


Leu 


Arg 


Thr 
65 


His 


Val 


Gin 


Trp 


Glu 
70 


Val Arg 


Pro 


Pro 


Leu 
75 


Asn 


lie 





caa taaagctget gagaatetag cetcgaadaa aaaaaaaaaa a 347 - 

Gin 



<210> 97 
<211> 477 
<212> DNA 

<213> Homo sapiens 

<220> 
<221> CDS 
<222> 120.. 395 

<220> 

<221> polyA__site 
<222> 462. .477 

<400> 97 

gcttctttet agtgtggacc agaggtgggg gcaatgagta aacatcttca ctgtaagggt 60 
tttgggactg cccctcaggg ettgaggtet eggeategtg tcteagtcag tccttggeg 119 



atg 


tee 


cct 


cag 


aeg 


cac 


tea 


cag 


act 


tgc 


ata 


egt 


aat 


tta 


gta 


aca 


167 


Met 


Ser 


Pro 


Gin 


Thr 


His 


Ser 


Gin 


Thr 


Cys 


He 


Arg 


Asn 


Leu 


Val 


Thr 




1 








5 










10 










15 






tgc 


ate 


aac 


tat 


cct 


aga 


aca 


tet 


aca 


ggg 


tgc 


aaa 


gga 


acc 


acc 


act 


215 


Cys 


lie 


Asn 


Tyr 
20 


Pro 


Arg 


Thr 


Ser 


Thr 
25 


Gly 


Cys 


Lys 


Gly 


Thr 
30 


Thr 


Thr 




cag 


aga 


ate 


atg 


gag 


cca 


gtg 


gag 


tta 


gaa 


gtg 


gaa 


ggg 


aca 


gaa 


caa 


263 


Gin Arg 


He 


Met 


Glu 


Pro 


Val 


Glu 


Leu 


Glu 


Val 


Glu 


Gly 


Thr 


Glu 


Gin 








35 










40 










45 










gae 


aat 


get 


aaa 


acc 


tgt 


ggt 


tea 


eta 


gga 


agg 


ggg 


aat 


gaa 


aac 


ace 


311 


Asp Asn 


Ala 


Lys 


Thr 


Cys 


Gly 


Ser 


Leu 


Gly 


Arg 


Gly 


Asn 


Glu 


Asn 


Thr 






50 










55 










60 












atg 


etc 


cga 


ggt 


gga 


ttc 


age 


atg 


aac 


aca 


act 


gtg 


ggg 


caa 


gga 


att 


359 


Met 


Leu 


Arg 


Gly 


Gly 


Phe 


Ser 


Met 


Asn 


Thr 


Thr 


Val 


Gly 


Gin 


Gly 


He 




65 










70 










75 










80 




tec 


aag 


caa 


aca 


cae 


cac 


act 


agt 


ace 


act 


tet 


tee 


taaacagaaa 




405 


Ser 


Lys 


Gin 


Thr 


His 
85 


His 


Thr 


Ser 


Thr 


Thr 

90 


Ser 


Ser 













agggaaagag catcettcga sttccccctc ectccatcag caccgggttt tectccaaaa 465 
aaaaaaaaaa aa 477 



<210> 98 

<211> 363 

<212> DNA 

<213> Homo sapiens 

<220> 
<221> CDS 
<222> 117. .269 



<220> 
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80 

<221> polyA_site 
<222> 348. .363 



<220> 

<221> polyA_signal 
<222> 329. .334 



<400> 98 

cctttctcgt tccccggcca tcttatcggc tgctgttggt tgggggccgt cccgctccta 60 
aggcaggaag atggtggccg caaagaagac ggaaatctga aatagagtac tatgct atg 119 

Met 
1 

ttg get aaa act ggt gtc cat cac tac agt ggc aat aat att gaa ctg 167 
Leu Ala Lys Thr Gly Val His His Tyr Ser Gly Asn Asn lie Glu Leu 

5 10 15 

ggc aca gca tgc gga aaa tac tac aga gtg tgc aca ctg get ate att 215 
Gly Thr Ala Cys Gly Lys Tyr Tyr Arg Val Cys Thr Leu Ala lie lie 

20 25 30 

gat cca ggt gac tct gac ate att aga age atg cca gaa cag act ggt 263 
Asp Pro Gly Asp Ser Asp lie lie Arg Ser Met Pro Glu Gin Thr Gly 

35 40 45 

gaa aag taaacetttt cacctacaaa atttcacctg caaaecttaa acetgcaaaa 319 
Glu Lys 
50 

ttttcettta ataaaatttg ettgttetaa aaaaaaaaaa aaaa 363 



<210> 99 
<211> 606 
<212> DNA 

<213> Homo sapiens 

<220> 
<221> CDS 
<222> 115. .411 



<220> 

<221> polyA_site 
<222> 591,, 606 

<220> 

<221> polyA_signal 
<222> 573. .578 



<400> 99 

aagaaagggg tgaggcctaa gggacaatca ggatgttttt cagagagaag tgtggatgct 60 
ggacaggaag aaccaeagat accagatacg ggtaetgttg taactetgtt ctce atg 117 

Met 
1 



aaa 


aaa 


aag 


gaa 


gaa 


aca 


aca 


ett 


tea 


gag 


atg 


gag 


cet 


gtt 


gag 


cca 


165 


Lys 


Lys 


Lys 


Glu 


Glu 


Thr 


Thr 


Leu 


Ser 


Glu 


Met 


Glu 


Pro 


Val 


Glu 


Pro 










5 










10 










15 








cag 


tac 


caa 


eta 


gtc 


aat 


get 


gaa 


teg 


act 


tct 


ccc 


ttt 


eta 


cat 


tgc 


213 


Gin 


Tyr 


Gin 


Leu 


Val 


Asn 


Ala 


Glu 


Ser 


Thr 


Ser 


Pro 


Phe 


Leu 


His 


Cys 








20 










25 










30 










ctg 


aga 


gaa 


gtc 


att 


ggg 


gaa 


tac 


tct 


gta 


cac 


gaa 


ttt 


tea 


ctg 


ttg 


261 


Leu 


Arg 


Glu 


Val 


He 


Gly 


Glu 


Tyr 


Ser 


val 


His 


Glu 


Phe 


Ser 


Leu 


Leu 






35 










40 










45 












ggg 


aaa 


aca 


gag 


agt 


caa 


ggg 


att 


gga 


ttg 


tgg 


att 


gca 


ttg 


gtg 


gtt 


309 


Gly Lys 


Thr 


Glu 


Ser 


Gin 


Gly 


He 


Gly 


Leu 


Trp 


lie 


Ala 


Leu 


Val 


Val 




50 










55 










60 










65 




ttc 


etc 


agt 


ttc 


etc 


ate 


ttc 


tec 


aca 


agt 


ttc 


tac 


ata 


teg 


aat 


gca 


357 


Phe 


Leu 


Ser 


Phe 


Leu 


He 


Phe 


Ser 


Thr 


Ser 


Phe 


Tyr 


He 


Ser 


Asn 


Ala 





70 75 80 
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8i 

gag cag ccc ttc ttc aaa gaa cct cct acg gaa get get aag gaa ete 405 
Glu Gin Pro Phe Phe Lys Glu Pro Pro Thr Glu Ala Ala Lys Glu Leu 

85 90 95 

agt ctg tagctctgcg tggagccatg tgtaaacact gaactgagac ctgccacctc 4 61 
Ser Leu 

ctactaccta agggcccatt tteatctgat atcatccccc agaaacaaac tcatgatgac 521 

ttccatgttt tttttagatt agataeatgg agaattttcc tttcccttag aattaaaatc 581 

ctgcattcta aaaaaaaaaa aaaat 606 



<210> 100 
<211> 648 
<212> DNA 
<213> Homo sapiens 

<220> 
<221> CDS 
<222> 125.. 508 



<220> 

<221> polyA_site 
<222> 633.. 648 



<220> 

<221> polyA_signal 
<222> 610. .615 



<4a0> 100 

agetgeetgg gtaaggeeca agatggetgt cttegcctta gtactegtgt gaagttggea 60 
gggaeggttc etgteatett ettgggetta tttggtgtgc tgttgaaggg gggagaetag 120 
agaa atg gca ggg aac etc tta tec ggg gca ggt agg cgc ctg tgg gae 169 
Met Ala Gly Asn Leu Leu Ser Gly Ala Gly Arg Arg Leu Trp Asp 





1 








5 










10 










15 




tgg 


gtg 


cct 


ctg 


gcg 


tgc 


aga 


age 


ttc 


tct 


Ctt 


ggt 


gtg 


cct 


aga 


ttg 


217 


Trp Val 


Pro 


Leu 


Ala 


Cys 


Arg 


Ser 


Phe 


Ser 


Leu 


Gly 


Val 


Pro 


Arg 


Leu 












20 










25 










30 






ate 


ggt 


ata 


agg 


etc 


act 


etc 


eeg 


ccc 


eee 


aaa 


gtg 


gtt 


gat 


cgt 


tgg 


265 


He 


Gly 


He 


Arg 


Leu 


Thr 


Leu 


Pro 


Pro 


Pro 


Lys 


Val 


Val 


Asp 


Arg 


Trp 










35 










40 










45 








aac 


gag 


aaa 


agg 


gcc 


atg 


ttc 


gga 


gtg 


tat 


gae 


aac 


ate 


ggg 


ate 


ctg 


313 


Asn 


Glu 


Lys 


Arg 


Ala 


Met 


Phe 


Gly 


Val 


Tyr 


Asp 


Asn 


He 


Gly 


He 


Leu 








50 










55 










60 










gga 


aac 


ttt 


gaa 


aag 


cac 


ccc 


aaa 


gaa 


Ctg 


ate 


agg 


ggg 


ccc 


ata 


tgg 


361 


Gly Asn 


Phe 


Glu 


Lys 


His 


Pro 


Lys 


Glu 


Leu 


He 


Arg 


Gly 


Pro 


He 


Trp 






65 










70 










75 












ctt 


cga 


ggt 


tgg 


aaa 


ggg 


aat 


gaa 


ttg 


caa 


cgt 


tgt 


ate 


ega 


aag 


agg 


409 


Leu 


Arg 


Gly 


Trp 


Lys 


Gly 


Asn 


Glu 


Leu 


Gin 


Arg 


Cys 


He 


Arg 


Lys 


Arg 




80 










85 










90 










95 




aaa 


atg 


gtt 


gga 


agt 


aga 


atg 


ttc 


get 


gat 


gae 


ctg 


cac 


aac 


ctt 


aat 


457 


Lys 


Met 


Val 


Gly 


Ser 


Arg 


Met 


Phe 


Ala 


Asp 


Asp 


Leu 


His 


Asn 


Leu 


Asn 












100 










105 










110 






aaa 


cgc 


ate 


cgc 


tat 


etc 


tac 


aaa 


cac 


ttt 


aac 


cga 


cat 


ggg 


aag 


ttt 


505 


Lys 


Arg 


lie 


Arg 


Tyr 


Leu 


Tyr 


Lys 


His 


Phe 


Asn 


Arg 


His 


Gly 


Lys 


Phe 










115 










120 










125 









cga tagaagagaa agctgagaac ttcggaaaag gctcatctgt caccctggag 558 
Arg 

aagggaaact gtacttttcc ctgtgaggaa acggctttgt attttctctg taataaaatg 618 

gggcttcttt ggacaaaaaa aaaaaaaaaa 648 



<210> 101 
<211> 491 
<212> DNA 
<213> Homo sapiens 



wo 02/083898 
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82 

<220> 
<221> CDS 
<222> 50.. 205 

<220> 

<221> polyA site 
<222> 469.. 491 



<400> 101 

actctttcct tggctgctca ggtcataggt gctgtgatct agagacaaa atg ttc ctt 58 

Met Phe Leu 
1 



aca 


ctg gca gat act tgc 


aaa eta agg gga 


atg 


age 


ttc 


ctt 


ctg 


aat 


106 


Thr 


Leu Ala Asp Thr Cys 


Lys Leu Arg Gly Met 


Ser 


Phe 


Leu 


Leu 


Asn 






5 


10 




15 












gtt 


tat gaa gga gag gcc 


act gtg tea tct 


gtc 


tta 


gag 


eta 


ttg 


gaa 


154 


Val 


Tyr Glu Gly Glu Ala 


Thr Val Ser Ser 


Val 


Leu 


Glu 


Leu 


Leu 


Glu 




20 


25 




30 










35 




tec 


tgg ate att gtg gga 


aat gaa aga tac 


ttc 


gat 


gga 


ate 


age 


age 


202 


Ser Trp He He Val Gly Asn Glu Arg Tyr 


Phe 


Asp Gly 


He 


Ser 


Ser 






40 


45 










50 







cat tgatccaatg ccaactccaa gactggaacg tcgcaatgat agttccaagg 255 

His 



cggaaatttg aegtaattet tttcgaeaca gttttacagg tctggatgcc cattttaatt 315 

cttctgaaag eatgcctect ecttctgget tcaggaetcc atctecagec tettgatcta 375 

aaaataatcc ceaaaccaaa aaattagata etatttccte aaaattaggt attttaatca 4 35 

aaacatctta acataaatae attattatca eccaaaaaaa aaaaaaaatg cgaaaa 4 91 



<210> 102 
<211> 565 
<212> DNA 

<213> Homo sapiens 

<220> 
<221> CDS 
<222> 90. .341 



<220> 

<221> polyA_site 
<222> 534.. 565 



<220> 

<221> polyA_signal 
<222> 515. .520 



<400> 102 

gttgagggag tcgggccgcg actgtggtcg tttttatacc ttcccgcgcg gaegccggcg 60 



ctgccaacgg aagggcgggc tctgctgcc 


atg 


aag 


gtg 


aag 


att 


aag 


tgc 


tgg 


113 


















Met 
1 


Lys 


Val 


Lys 


He 

5 


Lys 


Cys 


Trp 




aac 


ggc 


gtg 


gcc 


act 


tgg 


etc 


tgg 


gtg 


gcc 


aac 


gat 


gag 


aac 


tgt 


ggc 


161 


Asn 


Gly 


Val 


Ala 


Thr 


Trp 


Leu 


Trp 


Val 


Ala 


Asn 


Asp 


Glu 


Asn 


Cys 


Gly 






10 










15 










20 












ate 


tgc 


agg 


atg 


gca 


ttt 


aac 


gga 


tgc 


tgc 


cet 


gac 


tgc 


aag 


gtg 


cec 


209 


He 


Cys 


Arg 


Met 


Ala 


Phe 


Asn 


Gly 


Cys 


Cys 


Pro 


Asp 


Cys 


Lys 


Val 


Pro 




25 










30 










35 










40 




ggc 


gac 


gac 


tgc 


ccg 


ctg 


gtg 


tgg 


ggc 


cag 


tgc 


tec 


cac 


tgc 


ttc 


cac 


257 


Gly Asp 


Asp 


Cys 


Pro 


Leu 


Val 


Trp 


Gly 


Gin 


Cys 


Ser 


His 


Cys 


Phe 


His 












45 










50 










55 






atg 


cat 


tgc 


ate 


etc 


aag 


tgg 


ctg 


cac 


gca 


cag 


cag 


gtg 


eag 


cag 


cac 


305 


Met 


His 


Cys 


He 


Leu 


Lys 


Trp 


Leu 


His 


Ala 


Gin 


Gin 


Val 


Gin 


Gin 


His 










60 










65 










70 








tgc 


cec 


atg 


tgc 


cgc 


cag 


gaa 


tgg 


aag 


ttc 


aag 


gag 


tgaggcccga 




351 
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Cys Pro Met Cys Arg Gin Glu Trp Lys Phe Lys Glu 

75 80 
cctggctctc gctggagggg catcctgaga ctccttcctc atgctggcgc cgatggctgc 411 
tggggacagc gcccctgagc tgcaacaagg tggaaacaag ggctggagct gcgtttgttt 471 
tgccatcact atgttgacac ttttatccaa taagtgaaaa ctcattaaac tactcaaatc 531 
ttaaaaaaaa aaaaaaagaa aaaaaaaaaa aaaa 565 

<210> 103 
<211> 661 
<212> DNA 
<213> Homo sapiens 

<220> 
<221> CDS 
<222> 169.. 420 



<220> 

<221> polyA^site 
<222> 614.. 661 



<220> 

<221> poiyA_signal 
<222> 594. .599 



<400> 103 

gagactcggc gggcgctgtt gagggagtcg gcccgcgact gtggtcgttt ttataccttc 60 
ccgcgcggac gccggcctgc caacggaagg gcggagacgg agtttcgtca tgttggccag 120 
gcccatttga gatctttgaa gatatcctca acgtgaggct ctgctgcc atg aag gtg 177 

Met Lys Val 



aag 


att 


aag 


tgc 


tgg 


aac 


ggc 


gtg 


gcc 


act 


tgg 


etc 


tgg 


1 

gtg 


gcc 


aac 


225 


Lys 


He 


Lys 


Cys 


Trp 


Asn 


Gly 


Val 


Ala 


Thr 


Trp Leu 


Trp 


Val 


Ala 


Asn 






5 










10 










15 












gat 


gag 


aac 


tgt 


ggc 


ate 


tgc 


agg 


atg 


gca 


ttt 


aac 


gga 


tgc 


tgc 


cct 


273 


Asp Glu 


Asn 


Cys 


Gly 


He 


Cys 


Arg 


Met 


Ala 


Phe 


Asn 


Gly 


Cys 


Cys 


Pro 




20 










25 










30 










35 




gac 


tgc 


aag 


gtg 


ccc 


ggc 


gac 


gac 


tgc 


ccg 


ctg 


gtg 


tgg 


ggc 


cag 


tgc 


321 


Asp Cys 


Lys 


Val 


Pro 


Gly 


Asp 


Asp 


Cys 


Pro 


Leu 


Val 


Trp 


Gly 


Gin 


Cys 












40 










45 










50 






tec 


cac 


tgc 


ttc 


cac 


atg 


cat 


tgc 


ate 


etc 


aag 


tgg 


ctg 


cac 


gca 


cag 


369 


Ser 


His 


Cys 


Phe 
55 


His 


Met 


His 


Cys 


He 
60 


Leu 


Lys 


Trp 


Leu 


His 
65 


Ala 


Gin 




cag 


gtg 


cag 


cag 


cac 


tgc 


ccc 


atg 


tgc 


cgc 


cag 


gaa 


tgg 


aag 


ttc 


aag 


417 


Gin 


Val 


Gin 
70 


Gin 


His 


Cys 


Pro 


Met 
75 


Cys 


Arg 


Gin 


Glu 


Trp 
80 


Lys 


Phe 


Lys 




gag 


tgaggcccga cctggctctc gctggagggg catcctgaga 


ctccttcctc 




470 



Glu 

atgctggcgc cgatggctgc tggggacagc gcccctgagc tgcaacaagg tggaaacaag 530 

ggctggagct gcgtttgttt tgccatcact atgttgacac ttttatccaa taagtgaaaa 590 

ctcattaaac tactcaaatc ttgaaaaaaa aaaaaaaaaa aaaaaaaaaa aaaagaaaaa 650 

aaaaaaaaaa a 661 



<210> 104 
<211> 609 
<212> DNA 
<213> Homo sapiens 

<220> 
<221> CDS 
<222> 144. .395 



<220> 

<221> polyA_site 



wo 02/083898 
<222> 594. .609 
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<220> 

<221> polyA signal 
<222> 569. ,574 



<400> 104 

gtttttatac cttcccgcgc ggacgccggc gctgccaacg gaagggcggg tagggcggtg 60 
cgtgattaga gacggagttt cgtcatgttg gccaggccca tttgagatct ttgaagatat 120 
cctcaacgtg aggctctgct gcc atg aag gtg aag att aag tgc tgg aac ggc 173 

Met Lys Val Lys lie Lys Cys Trp Asn Gly 
15 10 



gtg 


gcc 


act 


tgg 


etc 


tgg 


gtg 


gcc 


aac 


gat 


gag 


aac 


tgt 


ggc 


ate 


tgc 


221 


Val 


Ala 


Thr 


Trp 


Leu 


Trp 


Val 


Ala 


Asn 


Asp 


Glu 


Asn 


Cys 


Gly 


He 


Cys 












15 










20 










25 






agg 


atg 


gca 


ttt 


aac 


gga 


tgc 


tgc 


cct 


gac 


tgc 


aag 


gtg 


ecc 


ggc 


gac 


269 


Arg 


Met 


Ala 


Phe 


Asn 


Gly 


Cys 


Cys 


Pro 


Asp 


Cys 


Lys 


Val 


Pro 


Gly 


Asp 










30 










35 










40 








gac 


tgc 


ccg 


ctg 


gtg 


tgg 


ggc 


cag 


tgc 


tec 


cac 


tgc 


ttc 


cac 


atg 


cat 


317 


Asp 


Cys 


Pro 


Leu 


Val 


Trp 


Gly 


Gin 


Cys 


Ser 


His 


Cys 


Phe 


His 


Met 


His 








45 










50 










55 










tgc 


ate 


etc 


aag 


tgg 


ctg 


cac 


gca 


cag 


cag 


gtg 


cag 


cag 


cac 


tgc 


ccc 


365 


Cys 


He 


Leu 


Lys 


Trp 


Leu 


His 


Ala 


Gin 


Gin 


Val 


Gin 


Gin 


His 


Cys 


Pro 






60 










65 










70 












atg 


tgc 


cgc 


cag 


gaa 


tgg 


aag 


ttc 


aag 


gag 


tgaggcccga cctggctctc 


415 


Met 


Cys 


Arg 


Gin 


Glu 


Trp 


Lys 


Phe 


Lys 


Glu 

















75 80 

gctggagggg catectgaga etccttccte atgctggcge egatggctgc tggggacagc 475 

gcccctgage tgcaacaagg tggaaacaag ggctggaget gcgtttgttt tgceatcact 535 

atgttgacac ttttatccaa taagtgaaaa etcattaaac tactcaaatc ttgaaaagaa 595 

aaaaaaaaaa aaaa 609 



<210> 105 
<211> 635 
<212> DNA 
<213> Homo sapiens 

<220> 
<221> CDS 
<222> 177. .428 



<220> 

<221> polyA_site 
<222> 620.. 635 



<220> 

<221> polyA_signal 
<222> 602. .607 



<400> 105 

aagactcggc gggcgctgtt gagggagteg ggccgcgact gtggtcgttt ttatacettc 60 
ccgcgcggac gecggcgctg ccaacggaag ggcgggtagg gcgagacgga gtttcgtcat 120 
gttggccagg cccatttgag atctttgata tatcctcaac gtgaggctct gctgcc atg 179 

Met 
1 

aag gtg aag att aag tgc tgg aac ggc gtg gcc act tgg etc tgg gtg 227 
Lys Val Lys He Lys Cys Trp Asn Gly Val Ala Thr Trp Leu Trp Val 

5 10 15 

gcc aac gat gag aac tgt ggc ate tgc agg atg gca ttt aac gga tgc 275 
Ala Asn Asp Glu Asn Cys Gly He Cys Arg Met Ala Phe Asn Gly Cys 

20 25 30 

tgc cct gac tgc aag gtg ccc ggc gac gac tgc ccg ctg gtg tgg ggc 323 
Cys Pro Asp Cys Lys Val Pro Gly Asp Asp Cys Pro Leu Val Trp Gly 
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35 40 45 

cag tgc tec cac tgc ttc cac atg cat tgc ate etc aag tgg ctg cac 371 
Gin Cys Ser His Cys Phe His Met His Cys lie Leu Lys Trp Leu His 
50 55 60 65 

gca cag cag gtg cag cag cac tgc ccc atg tgc cgc cag gaa tgg aag 419 
Ala Gin Gin Val Gin Gin His Cys Pro Met Cys Arg Gin Glu Trp Lys 

70 75 80 

ttc aag gag tgaggcccga cctggctctc gctggagggg catcctgaga 4 68 

Phe Lys Glu 

ctccttcctc atgctggcgc cgatggctgc tggggacagc gcccctgagc tgcaacaagg 528 
tggaaacaag ggctggagct gcgtttgttt tgccatcact atgttgacac ttttatccaa 588 
taagtgaaaa ctcattaaac tactcaaatc caaaaaaaaa aaaaaaa 635 



<210> 106 
<211> 573 
<212> DNA 

<213> Homo sapiens 

<220> 
<221> CDS 
<222> 114. .365 

<220> 

<221> polyA__site 
<222> 558. .573 



<220> 

<221> polyA_signal 
<222> 539. .544 



<400> 106 

ggcgtgcgag aetcggcggg cgctgttgag ggagtcgggc cgcgactgtg gtcgttttta 60 
taccttcccg cgcggacgcc ggcgctgcca acggaagggc gggctctgct gcc atg 116 

Met 
1 



aag 


gtg 


aag 


att 


aag 


tgc 


tgg 


aac 


ggc 


gtg 


gcc 


act 


tgg 


etc 


tgg 


gtg 


164 


Lys 


Val 


Lys 


He 


Lys 


Cys 


Trp 


Asn 


Gly 


Val 


Ala 


Thr 


Trp 


Leu 


Trp 


Val 










5 










10 










15 








gcc 


aac 


gat 


gag 


aac 


tgt 


gge 


ate 


tgc 


agg 


atg 


gca 


ttt 


aac 


gga 


tgc 


212 


Ala 


Asn 


Asp 


Glu 


Asn 


Cys 


Gly 


He 


Cys 


Arg 


Met 


Ala 


Phe 


Asn 


Gly 


Cys 








20 










25 










30 










tgc 


cct 


gac 


tgc 


aag 


gtg 


ccc 


ggc 


gac 


gac 


tgc 


ccg 


ct'g 


gtg 


tgg 


ggc 


260 


Cys 


Pro 


Asp 


Cys 


Lys 


Val 


Pro 


Gly 


Asp 


Asp 


Cys 


Pro 


Leu 


Val 


Trp 


Gly 






35 










40 










45 












cag 


tgc 


tec 


cac 


tgc 


ttc 


cac 


atg 


cat 


tgc 


ate 


etc 


aag 


tgg 


ctg 


cac 


308 


Gin 


Cys 


Ser 


His 


Cys 


Phe 


His 


Met 


His 


Cys 


He 


Leu 


Lys 


Trp 


Leu 


His 




50 










55 










60 










65 




gca 


cag 


cag 


gtg 


cag 


cag 


cac 


tgc 


ccc 


atg 


tgc 


cgc 


cag 


gaa 


tgg 


aag 


356 


Ala 


Gin 


Gin 


Val 


Gin 


Gin 


His 


Cys 


Pro 


Met 


Cys 


Arg 


Gin 


Glu 


Trp 


Lys 












70 










75 










80 







ttc aag gag tgaggcccga cctggctctc gctggagggg catcctgaga 405 
Phe Lys Glu 

ctccttcctc atgctggcgc cgatggctgc tggggacagc gcccctgagc tgcaacaagg 4 65 

tggaaacaag ggctggagct gcgtttgttt tgccatcact atgttgacac ttttatccaa 525 

taagtgaaaa ctcattaaac tactcaaatc ctaaaaaaaa aaaaaaaa 573 



<210> 107 

<211> 689 

<212> DNA 

<213> Homo sapiens 



<220> 
<221> CDS 



wo 02/083898 
<222> 44. .283 



86 
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<220> 

<221> polyA^site 
<222> 674. .689 

<220> 

<221> polyA_signal 
<222> 644. .649 

<400> 107 

tttcgccgcc aaagcatcca gcagccccct gctccggccc age atg gcg acc ccg 55 

Met Ala Thr Pro 
1 



acc 


cag 


acc 


ccc 


aca 


aag 


get 


cct 


gag 


gaa 


cct 


gac 


cea ttt 


tac 


tat 


103 


Thr 


Gin 


Thr 


Pro 


Thr 


Lys 


Ala 


Pro 


Glu 


Glu 


Pro 


Asp 


Pro Phe 


Tyr 


Tyr 




5 










10 










15 








20 




gac 


tac 


aac 


acg 


gtg 


cag 


act 


gtg 


gge 


atg 


act 


ctg 


gca aee 


ate 


ttg 


151 


Asp Tyr 


Asn 


Thr 


Val 


Gin 


Thr 


Val 


Gly 


Met 


Thr 


Leu 


Ala Thr 


He 


Leu 












25 










30 








35 






ttc 


ctg 


ctg 


ggt 


ate 


etc 


ate 


gtc 


ate 


age 


aag 


aag 


gtg aag 


tge 


agg 


199 


Phe 


Leu 


Leu 


Gly 
40 


He 


Leu 


He 


Val 


He 
45 


Ser 


Lys 


Lys 


Val Lys 
50 


Cys 


Arg 




aag 


gcg 


gac 


tec 


agg 


tct 


gag 


age 


cea 


acc 


tgc 


aaa 


tec tgt 


aag 


tct 


247 


Lys 


Ala 


Asp 
55 


Ser 


Arg 


Ser 


Glu 


Ser 
60 


Pro 


Thr 


Cys 


Lys 


Ser Cys 
65 


Lys 


Ser 




gag 


ctt 


ccc 


tot 


tea 


gee 


ect 


ggt 


gge 


gge 


gge 


gtg 


taaeacctte 




293 


Glu 


Leu 


Pro 


Ser 


Ser 


Ala 


Pro 


Gly 


Gly 


Gly 


Gly Val 












70 










75 










80 











ccgaggaaac tccgctgccg accctgeetg agegegggag ectgaggacc gggtggaggc 353 

ggtggggacc cagccgcgcg cegggagege teeeeggaat gagcegcccc acccacccca 413 

aggetggagc cgctgeaccc tgccgtccct ctecaggeet tggcaatgac gateceeeaa 473 

agageecgte tgeaecceag aeeeagggee teaggeetec agetectggg atecgggagt 533 

eeateeeggc ceagcacccc cagcatccce gtgtatggcc cccctgcacc tccttgtctc 593 

atccccgaag atccgtcece etggeecctc agtgteeatg tcttgagctt aataaatgtg 653 

eatttggttt tttcetctge aaaaaaaaaa aaaaaa 689 

<210> 108 
<211> 561 
<212> DNA 

<213> Homo sapiens 

<220> 
<221> CDS 
<222> 99. .479 

<220> 

<221> polyA_site 
<222> 546.. 561 

<220> 

<221> polyA_signal 
<222> 526. .531 

<400> 108 

tttcgaaggg aagggagece cetataaaae ageetacagt ggaeagtctg gtcggeagag 60 
ccgcaggtca gtcgtgaaga gggagctcta ttgccaee atg agt ttc tee gge aag 116 

Met Ser Phe Ser Gly Lys 
1 5 

tac caa ctg cag age cag gaa aac ttt gaa gee tte atg aag gca ate 164 
Tyr Gin Leu Gin Ser Gin Glu Asn Phe Glu Ala Phe Met Lys Ala He 

10 15 20 

ggt etg ccg gaa gag etc ate cag aag ggg aag gat ate aag ggg gtg 212 
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Gly Leu 


Pro 


Glu 


Glu 


Leu 


He 


Gin 


Lys 


Gly 


Lys 


Asp 


He 


Lys 


Gly 


Val 






25 










30 










35 










teg gaa 


ate 


gtg 


cag 


aat 


ggg 


aag 


cae 


ttc 


aag 


ttc 


acc 


ate 


acc 


get 


260 


Ser Glu 


He 


Val 


Gin 


Asn 


Gly 


Lys 


His 


Phe 


Lys 


Phe 


Thr 


He 


Thr 


Ala 




40 










45 










50 












ggg tec 


aaa 


gtg 


ate 


eaa 


aac 


gaa 


ttc 


aeg 


gtg 


ggg 


gag 


gaa 


tgt 


gag 


308 


Gly Ser 


Lys 


Val 


He 


Gin 


Asn 


Glu 


Phe 


Thr 


Val 


Gly 


Glu 


Glu 


Cys 


Glu 




55 








60 










65 










70 




ctg gag 


aca 


atg 


aca 


ggg 


gag 


aaa 


gtc 


aag 


aca 


gtg 


gtt 


cag 


ttg 


gaa 


356 


Leu Glu 


Thr 


Met 


Thr 


Gly 


Glu 


Lys 


Val 


Lys 


Thr 


Val 


Val 


Gin 


Leu 


Glu 










75 










80 










85 






ggt gac 


aat 


aaa 


ctg 


gtg 


aca 


act 


ttc 


aaa 


aac 


ate 


aag 


tet 


gtg 


ace 


404 


Gly Asp 


Asn 


Lys 


Leu 


Val 


Thr 


Thr 


Phe 


Lys 


Asn 


He 


Lys 


Ser 


Val 


Thr 








90 










95 










100 








gaa etc 


aae 


ggc 


gac 


ata 


ate 


acc 


aat 


acc 


atg 


aca 


ttg 


ggt 


gac 


att 


452 


Glu Leu 


Asn 


Gly 


Asp 


He 


He 


Thr 


Asn 


Thr 


Met 


Thr 


Leu 


Gly 


Asp 


He 






105 










110 










115 










gtc ttc 


aag 


aga 


ate 


age 


aag 


aga 


att 


taaacaagtc tgcatttcat 




499 


Val Phe 


Lys 


Arg 


lie 


Ser 


Lys 


Arg 


He 



















120 125 
attattttag tgtgtaaaat taatgtaata aagtgaactt tgttttaaaa aaaaaaaaaa 559 
aa 561 



<210> 109 
<211> 738 
<212> DNA 

<213> Homo sapiens 



<220> 
<221> CDS 
<222> 1..261 



<220> 

<221> polyA_site 
<222> 724. .738 



<220> 

<221> polyA_signal 
<222> 712. .717 



<400> 109 


























gac gea 


gta 


gee 


gtg 


ggg 


aag 


ttt teg 


eaa 


tgg 


ccg 


ccg 


gaa egg 


teg 


48 


Asp Ala 


Val 


Ala 


Val 


Gly 


Lys 


Phe Ser 


Gin 


Trp 


Pro 


Pro Glu Arg 


Ser 




1 






5 








10 








15 






eeg gee 


gat 


get 


etc 


tgc 


gac 


gga aag 


aaa 


gee 


gee 


gga 


cec act 


tec 


96 


Pro Ala 


Asp 


Ala 


Leu 


Cys 


Asp Gly Lys 


Lys 


Ala 


Ala 


Gly 


Pro Thr 


Ser 








20 








25 










30 






ggt gcg 


ttt 


tac 


aat 


gaa 


etc 


egg gea 


etc 


gag 


teg 


etc 


ccc gga 


aac 


144 


Gly Ala 


Phe 


Tyr 


Asn 


Glu 


Leu Arg Ala 


Leu 


Glu 


Ser 


Leu 


Pro Gly Asn 






35 










40 








45 








tat agt 


tct 


get 


tec 


ace 


gtt 


gtc eta 


ett 


tgg 


ata 


gea 


tea aga 


eaa 


192 


Tyr Ser 


Ser 


Ala 


Ser 


Thr 


Val 


Val Leu Leu Trp 


He 


Ala 


Ser Arg 


Gin 




50 










55 








60 










aaa tct 


cag 


tct 


teg 


agg 


aac 


tea cag 


tgt 


age 


aga 


aaa 


gac aaa 


gga 


240 


Lys Ser 


Gin 


Ser 


Ser 


Arg Asn 


Ser Gin 


Cys 


Ser Arg 


Lys 


Asp Lys 


Gly 




65 








70 








75 








80 




ggt aaa 


cag 


gag 


gag 


cat 


gaa 


taattcatct tgatgtaatc atgcaagact 


291 


Gly Lys 


Gin 


Glu 


Glu 


His 


Glu 



















85 



cctaagaaga tgttaggaga ccctgtcatc attettttgg cctctagtgt ectttatctt 351 

catgtttgct ggcttccttt tactaetatg gaaatacaac atatgeaaaa gtagtgtttt 411 

cgctgtttaa aaacatcgte agcagcatea tcatcattaa ctgeaccagg aagecatgae 471 

tcctttctgg acaaggtgca cttgcetgtc atccttccat etetttcttc ctccttgaag 531 
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ttcacaatac ccagtccagg agcattgcag agaatgctag gtgtgtctgc agacttgact 591 

ttaaaagaaa acaagttcct tcaagtgagt tcatcatgat gggaatgata atttgtttga 651 

gagaatgtgt gttctttggc aaaggaactc ctgttatacc atgacagtaa taatagcagt 711 
aataaaatca ccaaaaaaaa aaaaaaa 738 



<210> 110 
<211> 584 
<212> DNA 
<213> Homo sapiens 

<220> 
<221> CDS 
<222> 88. .375 



<220> 

<221> polyA_site 
<222> 567.. 584 



<220> 

<221> polyA_signal 
<222> 546.. 551 



<400> 110 

gagtcagtct cgccagctgc cggtctttcg ggggctccgt aactttctat ccgtccgcgt 60 
cagcgcyttg ccaccctcat ctccaat atg cct ggt ccg acc ccc agt ggc act 114 

Met Pro Gly Pro Thr Pro Ser Gly Thr 



aac 


gtg 


gga 


tec 


tea 


ggg 


cgc 


1 

tct 


ccc 


age 


aaa 


5 

gca 


gtg 


gee 


gee 


egg 


162 


Asn 


Val 


Gly 


Ser 


Ser 


Gly 


Arg 


Ser 


Pro 


Ser 


Lys 


Ala 


Val 


Ala 


Ala 


Arg 




10 










15 










20 










25 




gcg 


gcg 


gga 


tec 


act 


gtc 


egg 


cag 


agg 


aaa 


aat 


gee 


age 


tgt 


ggg 


aca 


210 


Ala 


Ala 


Gly 


Ser 


Thr 


Val 


Arg 


Gin 


Arg 


Lys 


Asn 


Ala 


Ser 


Cys 


Gly 


Thr 












30 










35 










40 






agg 


agt 


gca 


ggc 


cgc 


aca 


acc 


teg 


gca 


ggc 


acc 


ggg 


ggg 


atg 


tgg 


cga 


258 


Arg 


Ser 


Ala 


Gly 


Arg 


Thr 


Thr 


Ser 


Ala 


Gly 


Thr 


Gly 


Gly 


Met 


Trp 


Arg 










45 










50 










55 








ttc 


tac 


aca 


gaa 


gat 


tea 


cct 


ggg 


etc 


aaa 


gtt 


ggc 


cct 


gtt 


cea 


gta 


306 


Phe 


Tyr 


Thr 


Glu 


Asp 


Ser 


Pro 


Gly 


Leu 


Lys 


Val 


Gly 


Pro 


Val 


Pro 


Val 








60 










65 










70 










ttg 


gtt 


atg 


agt 


ctt 


ctg 


ttc 


ate 


get 


tct 


gta 


ttt 


atg 


ttg 


cac 


att 


354 


Leu 


Val 


Met 


Ser 


Leu 


Leu 


Phe 


lie 


Ala 


Ser 


Val 


Phe 


Met 


Leu 


His 


He 






75 










80 










85 












tgg 


ggc 


aag 


tac 


act 


cgt 


teg 


tagattcagt tacatecatc tgtcatctga 


405 


Trp Gly 


Lys 


Tyr 


Thr 


Arg 


Ser 























90 95 

agaaggagga aaaaaeceaa catttcttgg aceaaaagta tagtgactat ctgttcatga 465 

gagaaatttt etgtaagett gctgttttac aggggattta teaataattg attttgagga 525 

atcagttttt ttctatggct aataaacttt ttaatteact tataaaaaaa aaaaaaaaa 584 



<210> 111 
<211> 588 
<212> DNA 
<213> Homo sapiens 



<220> 
<221> CDS 
<222> 16. .183 



<220> 

<221> polyA site 
<222> 289. .588 



<220> 
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<221> polyA_signal 
<222> 267. .272 



<400> 111 

tttattttgc cttct atg tea tct tgg age tta atg tgc atg eaa gat atg 51 
Met Ser Ser Trp Ser Leu Met Cys Met Gin Asp Met 

15 10 

gga aat act aaa gca aga eta etc tat gaa gee aat ett eea gag aac 99 

Gly Asn Thr Lys Ala Arg Leu Leu Tyr Glu Ala Asn Leu Pro Glu Asn 

15 20 25 

ttt ega aga eea eag aea gat eaa gea gtg gaa ttt tte ate aga gat 147 

Phe Arg Arg Pro Gin Thr Asp Gin Ala Val Glu Phe Phe lie Arg Asp 

30 35 40 

aaa tat gaa aag aag aaa tac tae gaa aaa atg eea tagetattac 193 

Lys Tyr Glu Lys Lys Lys Tyr Tyr Glu Lys Met Pro 

45 50 55 

aaatatttcc tcctctgatg ctcctcttca gcctttggta tcctetcctt etetgeaagc 253 

tgetgttgac aaaaataaat tggagaaaga aaaggaaaaa aaaaaggaag agaaaaagag 313 

agaaaaggag ceagaaaagc cggcaaaace acttacagct gaaaagctgc agaagaaaga 373 

tcageaactg gageetaaaa aaagtaecag ccctaaaaaa gctgcggage eaetgtggat 433 

ettttaggae ttgtttggag aaagattgag aaggaaaaga gtgaaaggca gttagttgaa 493 

gtggaagagt tgaeettcat gtactgcttt ccaaaaaatt gatcctagtt gttattgaat 553 

aaattgttgt ctgttttaag aaaaaaaaaa aaaaa 588 



<210> 112 
<211> 589 
<212> DNA 

<213> Homo sapiens 

<220> 
<221> CDS 
<222> 16.. 183 

<220> 

<221> polyA_site 
<222> 289. .589 



<220> 

<221> polyA_signal 
<222> 267.. 272 



<400> 112 

tttattttgc ettct atg tea tct tgg age tta atg tgc atg eaa gat atg 51 
Met Ser Ser Trp Ser Leu Met Cys Met Gin Asp Met 

15 10 

gga aat act aaa gca aga eta etc tat gaa gee aat ett eea gag aae 99 

Gly Asn Thr Lys Ala Arg Leu Leu Tyr Glu Ala Asn Leu Pro Glu Asn 

15 20 25 

ttt cga aga eea eag aca gat eaa gca gtg gaa ttt tte ate aga gat 147 

Phe Arg Arg Pro Gin Thr Asp Gin Ala Val Glu Phe Phe lie Arg Asp 

30 35 40 

aaa tat gaa aag aag aaa tae tae gaa aaa atg cca tagctattac 193 

Lys Tyr Glu Lys Lys Lys Tyr Tyr Glu Lys Met Pro 

45 50 55 

aaatatttcc tcctctgatg ctcctcttca gcettttgta tcctetcctt etetgeaagc 253 

tgetgttgac aaaaataaat tggagaaaga aaaggaaaaa aaaaaggaag agaaaaagag 313 

agaaaaggag ceagaaaagc cggcaaaace acttacagct gaaaagctgc agaagaaaga 373 

tcageaactg gageetaaaa aaagtaecag ccctaaaaaa gctgcggage eeaetgtgga 433 

tcttttagga cttgtttgga gaaagattga gaaggaaaag agtgaaaggc agttagttga 493 

agtggaagag ttgaccttea tgtaetgett teeaaaaaat tgatectagt tgttattgaa 553 

taaattgttg tetgttttaa gaaaaaaaaa aaaaaa 589 



<210> 113 
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<211> 588 
<212> DNA 

<213> Homo sapiens 

<220> 
<221> CDS 
<222> 16.. 183 



<220> 

<221> polyA_site 
<222> 289. .588 



<220> 

<221> polyA_signal 
<222> 267.. 272 



<400> 113 
tttattttgc 



gga aat act 

Gly Asn Thr 
15 

ttt cga aga 
Phe Arg Arg 
30 

aaa tat gaa 
Lys Tyr Glu 
45 

aaatatttcc 
tgctgttgac 
agaaaaggag 
tcagcaactg 
cttttaggac 
gtggaagagt 
aaattgttgt 



cttct atg tea tct tgg age tta atg tge atg caa gat atg 
Met Ser Ser Trp Ser Leu Met Cys Met Gin Asp Met 
15 10 
aaa gea aga eta cte tat gaa gee aat ctt eea gag aac 
Lys Ala Arg Leu Leu Tyr Glu Ala Asn Leu Pro Glu Asn 



20 



25 



cca cag aca gat caa gca gtg gaa ttt ttc ate aga gat 
Pro Gin Thr Asp Gin Ala Val Glu Phe Phe He Arg Asp 



35 

aag aag aaa tac tac gaa aaa 
Lys Lys Lys Tyr Tyr Glu Lys 
50 



40 

atg cca tagctattac 
Met Pro 

55 



tcctctgatg ctcctcttca gectttggta tcctctcett ctctgcaagc 
aaaaataaat tggagaaaga aaaggaaaaa aaaaaggaag agaaaaagag 
ceagaaaagc cggcaaaacc acttacagct gaaaagctgc agaagaaaga 
gagcetaaaa aaagtaccag ccctaaaaaa gctgcggasc cactgtggat 
ttgtttggag aaagattgag aaggaaaaga gtgaaaggca gttagttgaa 
tgacettcat gtactgcttt ccaaaaaatt gatcctagtt gttattgaat 
ctgttttaag aaaaaaaaaa aaaaa 



51 



99 



147 



193 



253 
313 
373 
433 
493 
553 
588 



<210> 114 
<211> 590 
<212> DNA 
<213> Homo sapiens 



<220> 
<221> CDS 
<222> 16. .183 



<220> 

<221> polyA_site 
<222> 289.. 590 



<220> 

<221> polyA_signal 
<222> 267.. 272 



<400> 114 

tttattttgc cttct atg tea tct tgg age tta atg tge atg caa gat atg 51 
Met Ser Ser Trp Ser Leu Met Cys Met Gin Asp Met 
15 10 
gga aat act aaa gca aga eta cte tat gaa gee aat ctt cca gag aac 99 
Gly Asn Thr Lys Ala Arg Leu Leu Tyr Glu Ala Asn Leu Pro Glu Asn 

15 20 25 

ttt cga aga cca cag aca gat caa gca gtg gaa ttt ttc ate aga gat 147 
Phe Arg Arg Pro Gin Thr Asp Gin Ala Val Glu Phe Phe He Arg Asp 
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30 35 40 

aaa tat gaa aag aag aaa tac tac gaa aaa atg cca tagctattac 193 
Lys Tyr Glu Lys Lys Lys Tyr Tyr Glu Lys Met Pro 
45 50 55 

aaatatttcc tcctctgatg ctcctcttca gcctttggta tcctctcctt ctctgcaagc 253 

tgctgttgac aaaaataaat tggagaaaga aaaggaaaaa aaaaaggaag agaaaaagag 313 

agaaaaggag ccagaaaagc cggcaaaacc acttacagct gaaaagctgc agaagaaaga 373 

tcagcaactg gagcctaaaa aaagtaccag ccctaa'aaaa gctgcggagc ccactgtgga 433 

tcttttagga cttgtttgga gaaagattga gaaggaaaag agtgaaaggc agttagttga 4 93 

agtggaagag ttgaccttca tgtactgctt tccaaaaaat tgatcctagt tgttattgaa 553 

taaattgttg tctgttttaa gaaaaaaaaa aaaaaat 590 



<210> 115 
<211> 595 
<212> DNA 
<213> Homo sapiens 

<220> 
<221> CDS 
<222> 16. .183 

<220> 

<221> polyA_site 
<222> 572.. 595 

<220> 

<221> polyA_signal 
<222> 552.. 557 

<400> 115 

tttattttgc cttct atg tea tct tgg age tta atg tgc atg caa gat atg 51 
Met Ser Ser Trp Ser Leu Met Cys Met Gin Asp Met 
15 10 

gga aat act aaa gca aga eta etc tat gaa gcc aat ctt cca gag aac 99 

Gly Asn Thr Lys Ala Arg Leu Leu Tyr Glu Ala Asn Leu Pro Glu Asn 

15 20 25 

ttt cga aga cca cag aca gat caa gca gtg gaa ttt tte ate aga gat 147 

Phe Arg Arg Pro Gin Thr Asp Gin Ala Val Glu Phe Phe He Arg Asp 

30 35 40 

aaa tat gaa aag aag aaa tac tac gaa aaa atg cca tagctattac 193 

Lys Tyr Glu Lys Lys Lys Tyr Tyr Glu Lys Met Pro 
45 50 55 

aaatatttcc tcctctgatg ctcctcttca gcctttggta tcctctcctt ctctgcaagc 253 

tgctgttgac aaaaataaat tggagaaaga aaaggaaaaa aaaaaggaag agaaaaagag 313 

agaaaaggag ccagaaaagc cggcaaaacc acttacagct gaaaagctgc agaagaaaga 373 

tcagcaactg gagcctaaaa aaagtaccag cectaaaaaa gctgcggagc ccactgtgga 4 33 

tcttttagga cttgtttgga gaaagattga gaaggaaaag agtgaaaggc agttagttga 4 93 

agtggaagag ttgaccttca tgtactgctt tccaaaaaat tgatcctagt tgttattgaa 553 

taaattgttg tctgttttaa gaaaaaaaaa aaaaaatgeg aa 595 

<210> 116 
<211> 588 
<212> DNA 
<213> Homo sapiens 

<220> 
<221> CDS 
<222> 16. .183 

<220> 

<221> polyA_site 
<222> 289.. 588 
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<220> 

<221> polyA^signal 
<222> 267.. 272 



<400> 116 

tttattttgc cttct atg tea tct tgg age tta atg tgc atg caa gat atg 51 
Met Ser Ser Trp Ser Leu Met Cys Met Gin Asp Met 
15 10 
gga aat act aaa gca aga eta etc tat gaa gee aat ctt cca gag aac 99 
Gly Asn Thr Lys Ala Arg Leu Leu Tyr Glu Ala Asn Leu Pro Glu Asn 

15 20 25 

ttt cga aga cca cag aca gat caa gca gtg gaa ttt ttc ate aga gat 147 
Phe Arg Arg Pro Gin Thr Asp Gin Ala Val Glu Phe Phe lie Arg Asp 

30 35 40 

aaa tat gaa aag aag aaa tac tac gaa aaa atg cca tagetattac 193 
Lys Tyr Glu Lys Lys Lys Tyr Tyr Glu Lys Met Pro 
45 50 55 

aaatatttcc teetetgatg ctcetcttca gcctttggta teetctectt ctctgcaagc 253 
tgctgttgae aaaaataaat tggagaaaga aaaggaaaaa aaaaaggaag agaaaaagag 313 
agaaaaggag ccagaaaage cggcaaaacc acttacagct gaaaagctgc agaagaaaga 373 
teagcaactg gagcctaaaa aaagtaccag ccctaaaaaa gctgcggasc eactgtggat 433 
cttttaggac ttgtttggag aaagattgag aaggaaaaga gtgaaaggca gttagttgaa 4 93 
gtggaagagt tgacettcat gtactgettt ecaaaaaatt gatectagtt gttattgaat 553 
aaattgttgt ctgttttaag aaaaaaaaaa aaaaa 588 

<210> 117 
<211> 588 
<212> DNA 
<213> Homo sapiens 



<220> 
<221> CDS 
<222> 16. .183 



<220> 

<221> polyA_site 
<222> 289. .588 



<220> 

<221> polyA_signal 
<222> 267. .272 



<400> 117 

tttattttgc cttct atg tea tct tgg age tta atg tgc atg caa gat atg 51 
Met Ser Ser Trp Ser Leu Met Cys Met Gin Asp Met 

15 10 

gga aat act aaa gca aga eta etc tat gaa gee aat ctt cca gag aac 99 

Gly Asn Thr Lys Ala Arg Leu Leu Tyr Glu Ala Asn Leu Pro Glu Asn 

15 20 25 

ttt cga aga cca cag aca gat caa gca gtg gaa ttt ttc ate aga gat 147 

Phe Arg Arg Pro Gin Thr Asp Gin Ala Val Glu Phe Phe lie Arg Asp 

30 35 40 

aaa tat gaa aag aag aaa tac tac gaa aaa atg cca tagetattac 193 

Lys Tyr Glu Lys Lys Lys Tyr Tyr Glu Lys Met Pro 

45 50 55 

aaatatttcc teetetgatg ctcetcttca gcctttggta teetctectt ctctgcaagc 253 

tgctgttgae aaaaataaat tggagaaaga aaaggaaaaa aaaaaggaag agaaaaagag 313 

agaaaaggag ccagaaaage cggcaaaacc acttacagct gaaaagctgc agaagaaaga 373 

teagcaactg gagcctaaaa aaagtaccag ccctaaaaaa gctgcggagc eactgtggat 433 

cttttaggac ttgtttggag aaagattgag aaggaaaaga gtgaaaggca gttagttgaa 493 

gtggaagagt tgacettcat gtactgettt ecaaaaaatt gatectagtt gttattgaat 553 

aaattgttgt ctgttttaag aaaaaaaaaa aaaaa 588 
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<210> 118 
<211> 588 
<212> DNA 

<213> Homo sapiens 

<220> 
<221> CDS 
<222> 16. .183 



<220> 

<221> polyA_site 
<222> 289. .588 



<220> 

<221> polyA_signal 
<222> 267. .272 



<400> 118 

tttattttgc cttct atg tea tct tgg age tta atg tgc atg eaa gat atg 51 
Met Ser Ser Trp Ser Leu Met Cys Met Gin Asp Met 
1 5 10 ' 

gga aat act aaa gca aga eta etc tat gaa gee aat ett cea gag aac 99 

Gly Asn Thr Lys Ala Arg Leu Leu Tyr Glu Ala Asn Leu Pro Glu Asn 
15 20 25 

ttt ega aga eea eag aca gat caa gca gtg gaa ttt ttc ate aga gat 147 

Phe Arg Arg Pro Gin Thr Asp Gin Ala Val Glu Phe Phe He Arg Asp 
30 35 40 

aaa tat gaa aag aag aaa tac tac gaa aaa atg cea tagctattac 193 

Lys Tyr Glu Lys Lys Lys Tyr Tyr Glu Lys Met Pro 

45 50 55 

aaatatttcc teetetgatg etectcttea geetttggta tcctctcctt ctctgcaage 253 
tgctgttgac aaaaataaat tggagaaaga aaaggaaaaa aaaaaggaag agaaaaagag 313 
agaaaaggag ecagaaaagc cggeaaaaec acttaeagct gaaaagctgc agaagaaaga 373 
teagcaaetg gagcetaaaa aaagtaccag ecetaaaaaa gctgeggagc eaetgtggat 433 
ettttaggae ttgtttggag aaagattgag aaggaaaaga gtgaaaggea gttagttgaa 493 
gtggaagagt tgaccttcat gtactgcttt ecaaaaaatt gatcetagtt gttattgaat 553 
aaattgttgt etgttttaag aaaaaaaaaa aaaaa 588 



<210> 119 
<211> 595 
<212> DNA 
<213> Homo sapiens 



<220> 
<221> CDS 
<222> 16.. 183 



<220> 

<221> polyA_site 
<222> 571.. 595 



<220> 

<221> polyA_signal 
<222> 551. .556 



<400> 119 

tttattttgc cttct atg tea tct tgg age tta atg tgc atg eaa gat atg 51 
Met Ser Ser Trp Ser Leu Met Cys Met Gin Asp Met 
1 5 10 

gga aat act aaa gca aga eta etc tat gaa gee aat ett cea gag aac 99 

Gly Asn Thr Lys Ala Arg Leu Leu Tyr Glu Ala Asn Leu Pro Glu Asn 
15 20 25 

ttt ega aga cea eag aca gat eaa gca gtg gaa ttt ttc ate aga gat 147 



wo 02/083898 PCT/IBO 1/009 14 

94 

Phe Arg Arg Pro Gin Thr Asp Gin Ala Val Glu Phe Phe lie Arg Asp 

30 35 40 

aaa tat gaa aag aag aaa tac tac gaa aaa atg cca tagctattac 193 
Lys Tyr Glu Lys Lys Lys Tyr Tyr Glu Lys Met Pro 
45 50 55 

aaatatttcc tcctctgatg ctcctcttca gcctttggta tcctctcctt ctctgcaagc 253 
tgctgttgac aaaaataaat tggagaaaga aaaggaaaaa aaaaaggaag agaaaaagag 313 
agaaaaggag ccagaaaagc cggcaaaacc acttacagct gaaaagctgc agaagaaaga 373 
tcagcaactg gagcctaaaa aaagtaccag ccctaaaaaa gctgcggagc cactgtggat 4 33 
cttttaggac ttgtttggag aaagattgag aaggaaaaga gtgaaaggca gttagttgaa 4 93 
gtggaagagt tgaccttcat gtactgcttt ccaaaaaatt gatcctagtt gttattgaat 553 
aaattgttgt ctgttttaag aaaaaaaaaa aaaaatgcga aa 595 

<210> 120 
<211> 601 
<212> DNA 
<213> Homo sapiens 



<220> 
<221> CDS 
<222> 16.. 183 



<220> 

<221> polyA^site 
<222> 572.. 601 



<220> 

<221> polyA_signal 
<222> 552.. 557 



<400> 120 

tttattttgc cttct atg tea tot tgg age tta atg tgc atg caa gat atg 51 
Met Ser Ser Trp Ser Leu Met Cys Met Gin Asp Met 

15 10 

gga aat act aaa gca aga eta etc tat gaa gee aat ett cca gag aac 99 

Gly Asn Thr Lys Ala Arg Leu Leu Tyr Glu Ala Asn Leu Pro Glu Asn 

15 20 25 

ttt cga aga cca cag aca gat caa gca gtg gaa ttt ttc ate aga gat 147 

Phe Arg Arg Pro Gin Thr Asp Gin Ala Val Glu Phe Phe He Arg Asp 

30 35 40 

aaa tat gaa aag aag aaa tac tac gaa aaa atg cca tagctattac 193 

Lys Tyr Glu Lys Lys Lys Tyr Tyr Glu Lys Met Pro 
45 50 55 

aaatatttcc tcctctgatg ctcctcttca gcctttggta tcctctcctt ctctgcaagc 253 

tgctgttgac aaaaataaat tggagaaaga aaaggaaaaa aaaaaggaag agaaaaagag 313 

agaaaaggag ccagaaaagc cggcaaaacc acttacagct gaaaagctgc agaagaaaga 373 

tcagcaactg gagcctaaaa aaagtaccag ccctaaaaaa gctgcggagc ccactgtgga 433 

tcttttagga cttgtttgga gaaagattga gaaggaaaag agtgaaaggc agttagttga 493 

agtggaagag ttgaccttca tgtactgctt tccaaaaaat tgatcctagt tgttattgaa 553 

taaattgttg tctgttttaa gaaaaaaaaa aaaaaatgcg aaaagctt 601 



<210> 121 
<211> 589 
<212> DNA 
<213> Homo sapiens 



<220> 
<221> CDS 
<222> 16.. 183 



<220> 

<221> polyA site 
<222> 289.. 589 
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<220> 

<221> polyA_signal 
<222> 267.. 272 



<400> 121 

tttattttgc cttct atg tea tct tgg age tta atg tgc atg eaa gat atg 51 
Met Ser Ser Trp Ser Leu Met Cys Met Gin Asp Met 
15 10 
gga aat act aaa gca aga eta etc tat gaa gee aat ett cca gag aac 99 
Gly Asn Thr Lys Ala Arg Leu Leu Tyr Glu Ala Asn Leu Pro Glu Asn 

15 20 25 

ttt cga aga cca cag aca gat caa gca gtg gaa ttt ttc ate aga gat 147 
Phe Arg Arg Pro Gin Thr Asp Gin Ala Val Glu Phe Phe lie Arg Asp 

30 35 40 

aaa tat gaa aag aag aaa tac tac gaa aaa atg cea tagctattac 193 
Lys Tyr Glu Lys Lys Lys Tyr Tyr Glu Lys Met Pro 
45 50 55 

aaatatttcc tcctctgatg ctcctcttca gcctttggta tcctctcctt ctctgcaagc 253 
tgctgttgac aaaaataaat tggagaaaga aaaggaaaaa aaaaaggaag agaaaaagag 313 
agaaaaggag ceagaaaagc cggeaaaacc acttaeagct gaaaagctgc agaagaaaga 373 
tcageaaetg gagectaaaa aaagtaccag ccctaaaaaa getgcggage eeaetgtgga 433 
tctttttgga cttgtttgga gaaagattga gaaggaaaag agtgaaagge agttagttga 493 
agtggaagag ttgaccttca tgtactgctt tccaaaaaat tgatcctagt tgttattgaa 553 
taaattgttg tctgttttaa gaaaaaaaaa aaaaaa 589 



<210> 122 
<211> 595 
<212> DNA 

<213> Homo sapiens 



<220> 
<221> CDS 
<222> 16. .183 



<220> 

<221> polyA_site 
<222> 571. .595 



<220> 

<221> polyA_signal 
<222> 551.. 556 



<400> 122 

tttattttgc cttct atg tea tct tgg age tta atg tgc atg caa gat atg 51 
Met Ser Ser Trp Ser Leu Met Cys Met Gin Asp Met 

15 10 

gga aat act aaa gca aga eta etc tat gaa gee aat ett cca gag aac 99 

Gly Asn Thr Lys Ala Arg Leu Leu Tyr Glu Ala Asn Leu Pro Glu Asn 

15 20 25 

ttt cga aga cca cag aca gat caa gca gtg gaa ttt ttc ate aga gat 147 

Phe Arg Arg Pro Gin Thr Asp Gin Ala Val Glu Phe Phe He Arg Asp 

30 35 40 

aaa tat gaa aag aag aaa tac tac gaa aaa atg cca tagctattac 193 

Lys Tyr Glu Lys Lys Lys Tyr Tyr Glu Lys Met Pro 

45 50 55 

aaatatttcc tcctctgatg ctcctcttca gcctttggta tcctctcctt ctctgcaagc 253 

tgctgttgac aaaaataaat tggagaaaga aaaggaaaaa aaaaaggaag agaaaaagag 313 

agaaaaggag ceagaaaagc cggeaaaacc acttaeagct gaaaagctgc agaagaaaga 373 

tcageaaetg gagectaaaa aaagtaccag ccctaaaaaa gctgeggasc caetgtggat 433 

cttttaggac ttgtttggag aaagattgag aaggaaaaga gtgaaaggca gttagttgaa 4 93 

gtggaagagt tgaecttcat gtactgcttt ccaaaaaatt gatcctagtt gttattgaat 553 

aaattgttgt ctgttttaag aaaaaaaaaa aaaaatgcga aa 595 
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<210> 123 
<211> 588 
<212> DNA 

<213> Homo sapiens 

<220> 
<221> CDS 
<222> 16. .183 



<220> 

<221> polyA_site 
<222> 289.. 588 



<220> 

<221> polyA_signal 
<222> 267. .272 



<400> 123 

tttattttgc cttct atg tea tct tgg age tta atg tgc atg caa gat atg 51 
Met Ser Ser Trp Ser Leu Met Cys Met Gin Asp Met 

15 10 

gga aat act aaa gca aga eta etc tat gaa gee aat ctt cca gag aac 99 

Gly Asn Thr Lys Ala Arg Leu Leu Tyr Glu Ala Asn Leu Pro Glu Asn 

15 20 25 

ttt cga aga eca cag aca gat caa gca gtg gaa ttt ttc ate aga gat 147 

Phe Arg Arg Pro Gin Thr Asp Gin Ala Val Glu Phe Phe lie Arg Asp 

30 35 40 

aaa tat gaa aag aag aaa tac tac gaa aaa atg cca tagctattae 193 

Lys Tyr Glu Lys Lys Lys Tyr Tyr Glu Lys Met Pro 

45 50 55 

aaatatttce tcctctgatg ctcctcttca geetttggta tectctcctt ctctgcaagc 253 

tgctgttgac aaaaataaat tggagaaaga aaaggaaaaa aaaaaggaag agaaaaagag 313 

agaaaaggag ccagaaaagc eggcaaaace acttacagct gaaaagetgc agaagaaaga 373 

teagcaactg gagcetaaaa aaagtaccag cectaaaaaa gctgcggagc cactgtggat 433 

cttttaggac ttgtttggag aaagattgag aaggaaaaga gtgaaaggca gttagttgaa 4 93 

gtggaagagt tgaectteat gtactgcttt ccaaaaaatt gatcetagtt gttattgaat 553 

aaattgttgt ctgttttaag aaaaaaaaaa aaaaa 588 



<210> 124 
<211> 588 
<212> DNA 

<213> Homo sapiens 



<220> 
<221> CDS 
<222> 16.. 183 



<220> 

<221> polyA_site 
<222> 289. .588 



<220> 

<221> polyA_signal 
<222> 267.. 272 

<400> 124 

tttattttgc cttct atg tea tct tgg age tta atg tgc atg caa gat atg 51 
Met Ser Ser Trp Ser Leu Met Cys Met Gin Asp Met 
15 10 
gga aat act aaa gca aga eta etc tat gaa gee aat ctt cca gag aac 99 
Gly Asn Thr Lys Ala Arg Leu Leu Tyr Glu Ala Asn Leu Pro Glu Asn 
15 20 25 
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ttt cga aga cca cag aca gat caa gca gtg gaa ttt ttc ate aga gat 147 
Phe Arg Arg Pro Gin Thr Asp Gin Ala Val Glu Phe Phe lie Arg Asp 

30 35 40 

aaa tat gaa aag aag aaa tac tac gaa aaa atg cca tagctattac 193 
Lys Tyr Glu Lys Lys Lys Tyr Tyr Glu Lys Met Pro 
45 50 55 

aaatatttcc tcctctgatg ctcctcttca gcctttggta tcctctcctt ctctgcaagc 253 
tgctgttgac aaaaataaat tggagaaaga aaaggaaaaa aaaaaggaag agaaaaagag 313 
agaaaaggag ccagaaaagc cggcaaaacc acttacagct gaaaagctgc agaagaaaga 373 
tcagcaactg gagcctaaaa aaagtaccag ccctaaaaaa gctgcggagc cactgtggat 433 
cttttaggac ttgtttggag aaagattgag aaggaaaaga gtgaaaggca gttagttgaa 4 93 
gtggaagagt tgaccttcat gtactgcttt ccaaaaaatt gatcctagtt gttattgaat 553 
aaattgttgt ctgttttaag aaaaaaaaaa aaaaa 588 



<210> 125 
<211> 589 
<212> DNA 

<213> Homo sapiens 

<220> 
<221> CDS 
<222> 16.. 183 

<220> 

<221> polyA_site 
<222> 289.. 589 

<220> 

<221> polyA_signal 
<222> 267. ,272 

<400> 125 

tttattttgc cttct atg tea tct tgg age tta atg tgc atg caa gat atg 51 
Met Ser Ser Trp Ser Leu Met Cys Met Gin Asp Met 
1 5 10 

gga aat act aaa gca aga eta etc tat gaa gee aat ctt cca gag aac 99 

Gly Asn Thr Lys Ala Arg. Leu Leu Tyr Glu Ala Asn Leu Pro Glu Asn 
15 20 25 

ttt cga aga cca cag aca gat caa gca gtg gaa ttt ttc ate aga gat 147 

Phe Arg Arg Pro Gin Thr Asp Gin Ala Val Glu Phe Phe lie Arg Asp 
30 35 40 

aaa tat gaa aag aag aaa tac tac gaa aaa atg cca tagctattac 193 

Lys Tyr Glu Lys Lys Lys Tyr Tyr Glu Lys Met Pro 

45 50 55 

aaatatttcc tcctctgatg ctcctcttca gcctttggta tcctctcctt ctctgcaagc 253 
tgctgttgac aaaaataaat tggagaaaga aaaggaaaaa aaaaaggaag agaaaaagag 313 
agaaaaggag ccagaaaagc cggcaaaacc acttacagct gaaaagctgc agaagaaaga 373 
tcagcaactg gagcctaaaa aaagtaccag ccctaaaaaa gctgcggagc ccactgtgga 433 
tettttagga cttgtttgga gaaagattga gaaggaaaag agtgaaaggc agttagttga 493 
agtggaagag ttgaccttca tgtactgctt tccaaaaaat tgatcctagt tgttattgaa 553 
taaattgttg tctgttttaa gaaaaaaaaa aaaaaa 589 

<210> 126 
<211> 663 
<212> DNA 
<213> Homo sapiens 

<220> 
<221> CDS 
<222> 86. .451 



<220> 

<221> polyA_site 
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<222> 622. .663 



<220> 

<221> polyA_signal 
<222> 597. .602 



<400> 126 

acacatccaa gcttaagacg gtgaggtcag cttcacattc tcaggaactc tccttctttg 60 

ggccacggaa ttaacccgag caggc atg gag gcc tct get etc acc tea tea 112 

Met Glu Ala Ser Ala Leu Thr Ser Ser 

1 5 

gca gtg acc agt gtg gcc aaa gtg gtc agg gtg gcc tct ggc tct gcc 160 

Ala Val Thr Ser Val Ala Lys Val Val Arg Val Ala Ser Gly Ser Ala 

10 15 20 25 

gta gtt ttg ccc ctg gcc agg att get aca gtt gtg att gga gga gtt 208 

Val Val Leu Pro Leu Ala Arg He Ala Thr Val Val He Gly Gly Val 

30 35 40 

gtg gcc atg gcg get gtg ccc atg gtg etc agt gcc atg ggc ttc act 256 

Val Ala Met Ala Ala Val Pro Met Val Leu Ser Ala Met Gly Phe Thr 

45 50 55 

geg gcg gga ate gee teg tec tec ata gca gcc aag atg atg tec gcg 304 

Ala Ala Gly He Ala Ser Ser Ser lie Ala Ala Lys Met Met Ser Ala 

60 65 70 

gcg gcc att gcc aat ggg ggt gga gtt gcc teg ggc age ett gtg get 352 

Ala Ala He Ala Asn Gly Gly Gly Val Ala Ser Gly Ser Leu Val Ala 

75 80 85 

act ctg cag tea ctg gga gca act gga etc tec gga ttg acc aag ttc 400 

Thr Leu Gin Ser Leu Gly Ala Thr Gly Leu Ser Gly Leu Thr Lys Phe 
90 95 100 105 

ate ctg ggc tec att ggg tct gme att geg get gtc att geg agg ttc 448 

He Leu Gly Ser He Gly Ser Xaa He Ala Ala Val He Ala Arg Phe^ 

110 115 120 

tac tagctcectg cccctcgccc tgcagagaag agaaccatgc caggggagaa 501 
Tyr 

ggcacccagc catcctgacc cagegaggag ccaactatcc caaatatacc tggggtgaaa 561 

tataccaaat tctgcatctc cagaggaaaa taagaaataa agatgaattg ttgcaactct 621 

aaaaaaaaaa aaaaaaaaaa aaaaacaaaa aaaaaaaaaa aa 663 



<210> 127 
<211> 633 
<212> DNA 

<213> Homo sapiens 

<220> 
<221> CDS 
<222> 88. .453 



<220> 

<221> polyA_site 
<222> 618.. 633 



<220> 

<221> polyA^signal 
<222> 599.. 604 



<400> 127 

aaacacatcc aagcttaaga cggtgaggtc agctteacat tctcaggaac tctccttctt 
tgggecaegg aattaacccg agcagge atg gag gcc tct get etc ace tea tea 

Met Glu Ala Ser Ala Leu Thr Ser Ser 

1 5 
gca gtg acc agt gtg gcc aaa gtg gtc agg gtg gcc tct ggc tct gcc 
Ala Val Thr Ser Val Ala Lys Val Val Arg Val Ala Ser Gly Ser Ala 
10 15 20 25 
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gta 


gtt 


ttg 


ccc 


ctg 


gem 


agg 


att 


get 


aca 


gtt 


gtg 


att 


gga 


gga 


gtt 


210 


Val 


Val 


Leu 


Pro 


Leu 
30 


Ala 


Arg 


He 


Ala 


Thr 
35 


Val 


Val 


He 


Gly 


Gly 
40 


Val 




gtg 


gcc 


atg 


gcg 


get 


gtg 


ccc 


atg 


gtg 


etc 


agt 


gcc 


atg 


ggc 


ttc 


act 


258 


Val 


Ala 


Met 


Ala 
45 


Ala 


Val 


Pro 


Met 


Val 
50 


Leu 


Ser 


Ala 


Met 


Gly 

55 


Phe 


Thr 




gcg 


gcg 


gga 


ate 


gcc 


teg 


tec 


tec 


ata 


gca 


gee 


aag 


atg 


atg 


tec 


gcg 


306 


Ala 


Ala 


Gly 
60 


He 


Ala 


Ser 


Ser 


Ser 
65 


He 


Ala 


Ala 


Lys 


Met 
70 


Met 


Ser 


Ala 




gcg 


gcc 


att 


gcc 


aat 


sgg 


ggt 


gga 


gtt 


gcc 


teg 


ggc 


age 


ctt 


gtg 


get 


354 


Ala 


Ala 
75 


He 


Ala 


Asn 


Xaa 


Gly 
80 


Gly 


Val 


Ala 


Ser 


Gly 
85 


Ser 


Leu 


Val 


Ala 




act 


ctg 


cag 


tea 


ctg 


gga 


gea 


act 


gga 


etc 


tec 


gga 


ttg 


ace 


aag 


ttc 


402 


Thr 


Leu 


Gin 


Ser 


Leu 


Gly 


Ala 


Thr 


Gly 


Leu 


Ser 


Gly 


Leu 


Thr 


Lys 


Phe 




90 










95 










100 










105 




ate 


ctg 


ggc 


tec 


att 


ggg 


tct 


gcc 


att 


gcg 


get 


gtc 


att 


gcg 


agg 


ttc 


450 


He 


Leu 


Gly 


Ser 


He 
110 


Gly 


Ser 


Ala 


He 


Ala 
115 


Ala 


Val 


He 


Ala 


Arg 
120 


Phe 





tac tagctccctg cccctcgccc tgcagagaag agaaccatgc caggggagaa 503 
Tyr 

ggeacccage catcetgacc eagcgaggag ccaactatcc caaatataec tggggtgaaa 563 

tataccaaat tetgeatctc cagaggaaaa taagaaataa agatgaattg ttacaaaaaa 623 

aaaaaaaaaa 633 

<210> 128 
<211> 588 
<212> DNA 
<213> Homo sapiens 

<220> 
<221> CDS 
<222> 15.. 236 



<220> 

<221> polyA site 
<222> 573. .588 



<220> 

<221> polyA^signal 
<222> 551. .556 



<400> 128 

gaaaattttc cact atg get tec age act gtc eeg gtg age get get ggc 50 
Met Ala Ser Ser Thr Val Pro Val Ser Ala Ala Gly 
15 10 



teg 


get 


aat 


gaa 


act 


ccc gaa 


ata eeg gac aac gtg 


gga gat 


tgg 


ctt 


98 


Ser 


Ala 


Asn 


Glu 


Thr 


Pro Glu 


He Pro Asp Asn Val 


Gly Asp 


Trp 


Leu 








15 








20 


25 








egg 


ggc 


gtc 


tac 


cgc 


ttt gcc 


act gat agg aat gac 


ttc egg 


agg 


aac 


146 


Arg 


Gly 


Val 


Tyr 


Arg 


Phe Ala 


Thr Asp Arg Asn Asp 


Phe Arg Arg Asn 






30 








35 


40 










ttg 


ata 


eta 


aat 


ttg 


gga etc 


ttt get gcg gga gtt 


tgg ctg 


gee 


agg 


194 


Leu 


He 


Leu 


Asn 


Leu 


Gly Leu 


Phe Ala Ala Gly Val 


Trp Leu 


Ala 


Arg 




45 










50 


55 






60 




aac 


ttg 


agt 


gac 


att 


gac etc 


atg gca cct cag cca 


ggg gtg 






236 


Asn 


Leu 


Ser 


Asp 


He Asp Leu Met Ala Pro Gin Pro Gly Val 









65 70 



tagceaagta gacaaatgga atcctgtgct gaaccegaat cttccaaaaa acagectaca 296 

atctgtgacc accaeaagat gtgecetgat ggeagetgaa gtttgattea gatgggcact 356 

tttcttccce ttccetgect agtttecttt tgtteettga gtecacgeag aattceattc 416 

tctggtcagc agacaggctt aagctaaagt attgectcta ttctgtaaag ttctgtacat 476 

agttcccaag ettctgcagg gggtgatttt tgctcttgtc ctgagaaata acagtgetgt 536 

tttaaaaaac ataaaataaa taccgcacac aaagacaaaa aaaaaaaaaa aa 588 
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<210> 129 
<211> 608 
<212> DNA 
<213> Homo sapiens 

<220> 
<221> CDS 
<222> 39. .260 

<220> 

<221> polyA site 
<222> 589.. 608 

<220> 

<221> polyA_signal 
<222> 577.. 582 

<400> 129 

acggttcctc caccgaggcc catgcgaagc tttccact atg get tec age act gtc 56 

Met Ala Ser Ser Thr Val 



ccg gtg 


age 


get 


get 


gge 


teg 


get 


aat 


gaa 


1 

act 


cec 


gaa 


ata 


5 

ccg 


gac 


104 


Pro Val 


Ser 


Ala 
10 


Ala 


Gly 


Ser 


Ala 


Asn 
15 


Glu 


Thr 


Pro 


Glu 


He 
20 


Pro 


Asp 




aac gtg 


gga 


gat 


tgg 


ctt 


egg 


gge 


gte 


tac 


egc 


ttt 


gee 


act 


gat 


agg 


152 


Asn Val 


Gly 


Asp 


Trp 


Leu 


Arg Gly Val 


Tyr 


Arg 


Phe 


Ala 


Thr 


Asp 


Arg 






25 










30 










35 










aat gac 


ttc 


egg 


agg 


aac 


ttg 


ata 


eta 


aat 


ttg 


gga 


etc 


ttt 


get 


gcg 


200 


Asn Asp 


Phe 


Arg 


Arg 


Asn 


Leu 


He 


Leu 


Asn 


Leu 


Gly 


Leu 


Phe 


Ala 


Ala 




40 










45 










50 












gga gtt 


tgg 


etg 


gee 


agg 


aac 


ttg 


agt 


gac 


att 


gac 


etc 


atg 


gea 


cet 


248 


Gly Val 


Trp 


Leu 


Ala 


Arg 


Asn 


Leu 


Ser 


Asp 


He 


Asp 


Leu 


Met 


Ala 


Pro 




55 








60 










65 










70 





cag eca ggg gtg tagceaagta gaeaaatgga atcctgtgct gaaeecgaat 300 

Gin Pro Gly Val 

ettecaaaaa acagcctaca atctgtgacc accacaagat gtgccctgat ggcagctgaa 360 

gtttgattca gatgggcact tttcttcccc ttccctgect agtttccttt tgttccttga 420 

gtceacgeag aatteeatte tetggteage agaeaggett aagetaaagt attgccteta 480 

ttctgtaaag ttctgtacat agttyccaag cttctgeagg gggtgattkt tgctcttgtc 540 

etgagaaata aeagtgctgt tttaaaaaae atttgaaata aataecgcae acaaaaaaaa 600 

aaaaaaaa 608 

<210> 130 

<211> 1200 

<212> DNA 

<213> Homo sapiens 

<220> 
<221> CDS 
<222> 29. .469 

<220> 

<221> polyA_site 
<222> 1185. .1200 

<400> 130 

aettggegag tgagacgetg atgggagg atg gac gta etg gtg tet gag tgc 52 

Met Asp Val Leu Val Ser Glu Cys 
1 5 

tec gcg egg etg etg cag cag gaa gaa gag att aaa tet etg act get 100 
Ser Ala Arg Leu Leu Gin Gin Glu Glu Glu He Lys Ser Leu Thr Ala 
10 15 20 
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gaa 


att 


gac 


egg 


ttg 


aaa 


aac 


tgt 


ggc 


tgt 


tta 


gga 


get 


tct 


cca 


aat 


148 


Glu 


He 


Asp 


Arg 


Leu 


Lys 


Asn 


Cys 


Gly 


Cys 


Leu 


Gly 


Ala 


Ser 


Pro 


Asn 




25 










30 










35 










40 




ttg 


gag 


cag 


tta 


caa 


gaa 


gaa 


aat 


tta 


aaa 


tta 


aag 


tat 


cga 


ctg 


aat 


196 


Leu 


Glu 


Gin 


Leu 


Gin 


Glu 


Glu 


Asn 


Leu 


Lys 


Leu 


Lys 


Tyr 


Arg 


Leu 


Asn 












45 










50 










55 






att 


ctt 


cga 


aag 


agt 


ctt 


cag 


gca 


gaa 


agg 


aac 


aaa 


cca 


act 


aaa 


aat 


244 


He 


Leu 


Arg 


Lys 


Ser 


Leu 


Gin 


Ala 


Glu 


Arg 


Asn 


Lys 


Pro 


Thr 


Lys 


Asn 










60 










65 










70 








atg 


att 


aac 


att 


att 


age 


cgc 


eta 


caa 


gag 


gtc 


ttt 


ggt 


cat 


gca 


att 


292 


Met 


He 


Asn 


He 


He 


Ser 


Arg 


Leu 


Gin 


Glu 


Val 


Phe 


Gly 


His 


Ala 


He 








75 










80 










85 










aag 


get 


gca 


tat 


oca 


gat 


ttg 


gaa 


aat 


cct 


cct 


ctg 


eta 


gtg 


aca 


cca 


340 


Lys Ala 


Ala 


Tyr 


Pro 


Asp 


Leu 


Glu 


Asn 


Pro 


Pro 


Leu 


Leu 


Val 


Thr 


Pro 






90 










95 










100 












agt 


cag 


cag 


gcc 


aag 


ttt 


ggg 


gac 


tat 


cag 


tgt 


aat 


agt 


get 


atg 


ggt 


388 


Ser 


Gin 


Gin 


Ala 


Lys 


Phe 


Gly 


Asp 


Tyr 


Gin 


Cys 


Asn 


Ser 


Ala 


Met 


Gly 




105 










110 










115 










120 




att 


tct 


cag 


gtg 


atg 


tat 


tgt 


cat 


gac 


tct 


tgg 


ctg 


ttt 


gat 


ttt 


ttt 


436 


He 


Ser 


Gin 


Val 


Met 


Tyr 


Cys 


His 


Asp 


Ser 


Trp 


Leu 


Phe 


Asp 


Phe 


Phe 












125 










130 










135 






aag 


tat 


tat 


tat 


cat 


cat 


tgc 


cat 


tta 


cag 


aaa 


taatactatt acaagttgta 


489 


Lys 


Tyr 


Tyr 


Tyr 


His 


His 


Cys 


His 


Leu 


Gin 


Lys 















140 



145 



tccttagtga aaaggacatt tgccacagtt tgaaaaactt 

atatgtttta acttttttag gcacaatttt taaggtttgg 

caatatttaa gggcaatcat tggtactctt ttgtttaggt 

aggattgctg tgtggtggtg atgagtgctg ggaggtgaaa 

cagtcagcat cccaattaaa tatttgatgt aactgtgatc 

tcattttcaa gcagaggagt tccccatttt aaatagaggc 

ttaactgeat ctggcttggg tctttctgtt ttcctttctt 

ctctgaagag tccaggtctt acagtgtggt ttatttctca 

tcatggcttg aaaattagaa atgtaaatat gctggaacca 

tgttatgtca tcattaaaaa gttcgaggtg ttgtaatccc 

caggagaatt gcttgaaccc aggggcggag gttacagtga 

ctccagcctg ggcaacagag cgagactcca tctctaaaaa 



gagaaaggag 
ttaaatttta 
atttccctcc 
aattaaaata 
tttgagccag 
attgtctgat 
tgctgaattg 
agtgtgaata 
gacttcataa 
agctactagg 
gccgagatca 
aaaaaaaaaa 



ttgggggggt 
tatgtattct 
tgctgtgtcc 
agccatttac 
gcttatatat 
gtgtttatgg 
gaaggggtta 
ttgcacacct 
ctgtagactt 
gaggctgagg 
caccactgca 
a 



549 
609 
669 
729 
789 
849 
909 
969 
1029 
1089 
1149 
1200 



<210> 131 
<211> 646 
<212> DNA 

<213> Homo sapiens 

<220> 
<221> CDS 
<222> 50,.340 



<220> 

<221> polyA_site 
<222> 631.. 646 



<220> 

<221> polyA_signal 
<222> 614. .619 



<400> 131 

gaggagaaga ggtgaccccg ctaaagataa taaatcatca ttgatcaag atg tct tea 58 

Met Ser Ser 
1 

gca cct gag cct cca aca ttc aaa aag gaa cca ccc aaa gaa aaa gag 106 
Ala Pro Glu Pro Pro Thr Phe Lys Lys Glu Pro Pro Lys Glu Lys Glu 

5 10 15 

ttt caa age cca ggg etc aga ggg gtg cgc aca aca ace tta ttt cgt 154 
Phe Gin Ser Pro Gly Leu Arg Gly Val Arg Thr Thr Thr Leu Phe Arg 
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20 25 30 

get gtg aat cca gag etc ttc att aaa cct aac 

Ala Val Asn Pro Glu Leu Phe lie Lys Pro Asn 

40 45 

ttc gga ttg gta act ctt tea ctt tgc gtg gca 

Phe Gly Leu Val Thr Leu Ser Leu Cys Val Ala 



55 



60 



cat gca ata caa gag aat aaa aag gac etc tat 
His Ala lie Gin Glu Asn Lys Lys Asp Leu Tyr 

70 75 
gag ggg cac agt tat atg agg egg aaa aca tec 
Glu Gly His Ser Tyr Met Arg Arg Lys Thr Ser 

85 90 
tagtagtgct ggttagtgca gatggacctt tattaaaggt 
gaccttgtga gtgtacagta tcatgtttct tgttctagaa 
atagcagttg caaccagaca actgtcgtaa attttgtcct 
etcattcttt ttccacagag tgagcgtcat aatatttttc 
gtgccatgag aatggaagtt gtttttgtta attattaaat 
aaaaat 
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35 

aaa cct gta atg get 202 
Lys Pro Val Met Ala 
50 

tat att ggt tat eta 250 
Tyr lie Gly Tyr Leu 
65 

gaa get att gat agt 298 
Glu Ala lie Asp Ser 
80 

aaa tgg gat 340 
Lys Trp Asp 

95 

tctgaaatct tcaaatgaaa 400 
catgctaatg aagagagaag 460 
ttcacagctg cagceattat 520 
tttccttace tcttataaaa 580 
tctgtataat aaaaaaaaaa 640 

646 



<210> 132 
<211> 575 
<212> DNA 
<213> Homo sapiens 

<220> 
<221> CDS 
<222> 177. .470 



<220> 

<221> polyA_site 
<222> 561. .575 

<220> 

<221> polyA_signal 
<222> 543. .548 



<400> 132 

atcgaccacg gaaatttgac acctccggge ttggaagcag ctetctcctc cttccccget 60 
gcttataaac etcagcectg aggctccagc tcactctacc ecatctcett gccgggtcag 120 
ccctgacaaa ggtcagctag ccccttgagg acatcagctt tggcctcagg gtccta atg 179 

Met 
1 



gca 


gca 


gaa 


cca 


ctg 


aca 


gag 


eta 


gag 


gag 


tec 


att 


gag 


aac 


gtg 


gtc 


227 


Ala 


Ala 


Glu 


Pro 
5 


Leu 


Thr 


Glu 


Leu 


Glu 
10 


Glu 


Ser 


He 


Glu 


Asn 
15 


Val 


Val 




ace 


acc 


ttc 


ttc 


acc 


ttt 


gca 


agg 


eag 


gag 


ggc 


egg 


aak 


gat 


age 


etc 


275 


Thr 


Thr 


Phe 


Phe 


Thr 


Phe 


Ala 


Arg 


Gin 


Glu 


Gly Arg 


Xaa 


Asp 


Ser 


Leu 








20 










25 










30 










age 


gtc 


aac 


gag 


ttc 


aaa 


gag 


ctg 


gtt 


ace 


eag 


cag 


ttg 


cec 


cat 


ctg 


323 


Ser 


Val 
35 


Asn 


Glu 


Phe 


Lys 


Glu 
40 


Leu 


Val 


Thr 


Gin 


Gin 
45 


Leu 


Pro 


His 


Leu 




etc 


aag 


gat 


gtg 


ggc 


tet 


ctt 


gat 


gag 


aag 


atg 


aag 


age 


ttg 


gat 


gtg 


371 


Leu 


Lys 


Asp 


Val 


Gly 


Ser 


Leu 


Asp 


Glu 


Lys 


Met 


Lys 


Ser 


Leu 


Asp 


Val 




50 










55 










60 










65 




aat 


eag 


gac 


teg 


gag 


etc 


aag 


ttc 


aat 


gag 


tac 


tgg 


aga 


ttg 


att 


ggg 


419 


Asn 


Gin 


Asp 


Ser 


Glu 
70 


Leu 


Lys 


Phe 


Asn 


Glu 
75 


Tyr 


Trp 


Arg 


Leu 


He 
80 


Gly 




gag 


ctg 


gee 


aag 


gaa 


ate 


agg 


aag 


aag 


aaa 


gac 


ctk 


aag 


ate 


agg 


aag 


4 67 


Glu 


Leu 


Ala 


Lys 


Glu 


He 


Arg 


Lys 


Lys 


Lys 


Asp Leu 


Lys 


He 


Arg 


Lys 










85 










90 










95 








aag 


taaagecgcc tggctgagat ggggtgggca gggcagaget 


gatcagggcc 




520 



Lys 
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gagcagaacc gcactcttcc caaataaagc ttcctccttg aaaaaaaaaa aaaaa 575 

<210> 133 
<211> 458 
<212> DNA 

<213> Homo sapiens 

<220> 
<221> CDS 
<222> 75. .419 

<400> 133 

ctttcctttc tcgttccccg gccatcttag cggctgctgt tggttggggg ccgtcccgct 60 

cctaaggcag gaag atg gtg gcc gca aag aag acg aaa aag teg ctg gag 110 

Met Val Ala Ala Lys Lys Thr Lys Lys Ser Leu Glu 
15 10 

teg ate aaa tct agg etc eaa cte gtt atg aaa agt ggg aag tac gtc 158 

Ser lie Lys Ser Arg Leu Gin Leu Val Met Lys Ser Gly Lys Tyr Val 

15 20 25 

ctg ggg tac aag cag act ctg aag atg ate aga caa ggc aaa gcg aaa 206 

Leu Gly Tyr Lys Gin Thr Leu Lys Met He Arg Gin Gly Lys Ala Lys 

30 35 40 

ttg gtc att etc get aac aac tge cea get ttg agg aaa tct gaa ata 254 

Leu Val He Leu Ala Asn Asn Cys Pro Ala Leu Arg Lys Ser Glu He 
45 50 55 60 



gag 


tac 


tat get atg 


ttg 


get 


aaa 


act 


ggt 


gtc 


cat 


cac 


tac 


agt 


ggc 


302 


Glu Tyr 


Tyr Ala Met 


Leu 


Ala 


Lys 


Thr 


Gly 


Val 


His 


His 


Tyr 


Ser 


Gly 








65 










70 










75 






aat 


aat 


att gaa ctg 


ggc 


aca 


gca 


tge 


gga 


aaa 


tac 


tac 


aga 


gtg 


tge 


350 


Asn 


Asn 


He Glu Leu 


Gly 


Thr 


Ala 


Cys 


Gly 


Lys 


Tyr 


Tyr 


Arg 


Val 


Cys 








80 








85 










90 








aca 


ctg 


get ate att 


gat 


cea 


rgt 


gac 


tct 


kac 


ate 


att 


aga 


age 


atg 


398 


Thr 


Leu 


Ala He He 


Asp 


Pro 


Xaa 


Asp 


Ser 


Xaa 


He 


He 


Arg 


Ser 


Met 








95 






100 










105 










cea 


gaa 


cag act ggt 


gaa 


aag 


taaacctttt cacctacaaa atttcacctg 


449 


Pro 


Glu 


Gin Thr Gly 


Glu 


Lys 























110 115 
caaacctta 458 

<210> 134 
<211> 588 
<212> DNA 
<213> Homo sapiens 



<220> 
<221> CDS 
<222> 144. .449 



<220> 

<221> polyA_site 
<222> 573. .588 



<220> 

<221> polyA_signal 
<222> 552. .557 



<400> 134 

gaggggcgag ggggctagcg gcgacggctg gggcgagcgc gectgcgcge tgggtgattt 60 
tttcacgtgt cgccagggcc ggactgcgag tctctttgcg gcgctacact agagcagagt 120 
aegagtctga ggcggaggga gta atg gca gga caa gcg ttt aga aag ttt ett 173 

Met Ala Gly Gin Ala Phe Arg Lys Phe Leu 
15 10 
cea etc ttt gac cga gta ttg gtt gaa agg agt cct get gaa act gta 221 



wo 02/083898 PCT/IBO 1/009 14 

104 



Pro 


Leu 


Phe 


Asp 


Arg 


Val 


Leu 


Val 


Glu 


Arg 


Ser 


Pro 


Ala 


Glu 


Thr 


Val 












15 










20 










25 






acc 


aaa 


gga 


ggc 


att 


atg 


ctt 


cca 


gaa 


aaa 


tct 


caa 


gga 


aaa 


gta 


ttg 


269 


Thr 


Lys 


Gly 


Gly 


He 


Met 


Leu 


Pro 


Glu 


Lys 


Ser 


Gin 


Gly 


Lys 


Val 


Leu 










30 










35 










40 








caa 


gca 


aca 


gta 


gtc 


get 


gtt 


gga 


teg 


ggt 


tct 


aaa 


gga 


aag 


ggt 


gga 


317 


Gin 


Ala 


Thr 


Val 


Val 


Ala 


Val 


Gly 


Ser 


Gly 


Ser 


Lys 


Gly 


Lys 


Gly 


Gly 








45 










50 










55 










gag 


att 


caa 


cca 


gtt 


age 


gtg 


aaa 


gtt 


gga 


gat 


aaa 


gtt 


ctt 


etc 


cca 


365 


Glu 


lie 


Gin 


Pro 


Val 


Ser 


Val 


Lys 


Val 


Gly 


Asp Lys 


Val 


Leu 


Leu 


Pro 






60 










65 










70 












gaa 


tat 


gga 


ggc 


acc 


aaa 


gta 


gtt 


yta 


gat 


aac 


aag 


gat 


tat 


ttc 


eta 


413 


Glu Tyr 


Gly 


Gly 


Thr 


Lys 


Val 


Val 


Leu 


Asp 


Asn Lys 


Asp 


Tyr 


Phe 


Leu 




75 










80 










85 










90 




ttt 


aga 


gat 


ggt 


gac 


att 


ctt 


gga 


aag 


tac 


gta 


gac 


tgaaataagt 




459 


Phe Arg 


Asp 


Gly 


Asp 


He 


Leu 


Gly 


Lys 


Tyr 


Val 


Asp 













95 100 



caetattgaa atggcatcaa catgatgctg cccattccac tgaagttetg aaatctttcg 519 
teatgtaaat aatttccata tttctctttt ataataaact aatgataact aacaaaaaaa 579 
aaaaaaaaa 588 

<210> 135 
<211> 538 
<212> DNA 
<213> Homo sapiens 

<220> 
<221> CDS 
<222> 93.-398 

<220> 

<221> polyA_site 
<222> 523,. 538 

<220> 

<221> polyA_signal 
<222> 501. .506 



<400> 135 

gggtgatttt ttcacgtgtc gccagggccg gactgcgagt ctetttgcgg cgetacaeta 60 
gagcagagta cgagtctgag gcggagggag ta atg gca gga caa gcg ttt aga 113 





















Met 
1 


Ala 


Gly 


Gin 


Ala 

5 


Phe 


Arg 




aag 


ttt 


ctt 


cca 


etc 


ttt 


gac 


cga 


gta 


ttg 


gtt 


gaa 


agg 


agt 


get 


get 


.161 


Lys 


Phe 


Leu 


Pro 


Leu 


Phe 


Asp 


Arg 


Val 


Leu 


Val 


Glu 


Arg 


Ser 


Ala 


Ala 








10 










15 










20 










gaa 


act 


gta 


acc 


aaa 


gga 


ggc 


att 


atg 


ctt 


cca 


gaa 


aaa 


tct 


caa 


gga 


209 


Glu 


Thr 


Val 


Thr 


Lys 


Gly 


Gly 


He 


Met 


Leu 


Pro 


Glu 


Lys 


Ser 


Gin 


Gly 






25 










30 










35 












aaa 


gta 


ttg 


caa 


gca 


aca 


gta 


gtc 


get 


gtt 


gga 


teg 


ggt 


tct 


aaa 


gga 


257 


Lys 


Val 


Leu 


Gin 


Ala 


Thr 


Val 


Val 


Ala 


Val 


Gly 


Ser 


Gly 


Ser 


Lys 


Gly 




40 










45 










50 










55 




aag 


ggt 


gga 


gag 


att 


caa 


cca 


gtt 


age 


gtg 


aaa 


gtt 


gga 


gat 


aaa 


gtt 


305 


Lys 


Gly 


Gly 


Glu 


He 


Gin 


Pro 


Val 


Ser 


Val 


Lys 


Val 


Gly 


Asp 


Lys 


Val 












60 










65 










70 






ctt 


etc 


cca 


gaa 


tat 


gga 


ggc 


acc 


aaa 


gta 


gtt 


eta 


gat 


gac 


aag 


gat 


353 


Leu 


Leu 


Pro 


Glu 


Tyr 


Gly 


Gly 


Thr 


Lys 


Val 


Val 


Leu 


Asp 


Asp 


Lys 


Asp 










75 










80 










85 








tat 


ttc 


eta 


ttt 


aga 


gat 


ggt 


gac 


att 


ctt 


gga 


aag 


tac 


gta 


gac 




398 


Tyr 


Phe 


Leu 


Phe 


Arg 


Asp 


Gly 


Asp 


He 


Leu 


Gly Lys 


Tyr 


Val 


Asp 










90 










95 










100 











tgaaataagt caetattgaa atggcatcaa catgatgctg cccattccac tgaagttetg 458 
aaatctttcg teatgtaaat aatttccata tttctctttt ataataaact aatgataact 518 



wo 02/083898 
aacgaaaaaa aaaaaaaaaa 



105 



PCT/IBOl/00914 
538 



<210> 136 
<211> 912 
<212> DNA 
<213> Homo sapiens 

<220> 
<221> CDS 
<222> 97. .444 



<220> 

<221> polyA site 
<222> 897.. 912 



<400> 136 

atctttccgt cccgggcagc cagcgccagt cggagccagc 
ctgccgctgc caagtcctcc acccgctgcc cccgcc atg 

Met 
1 

acg gcc ccc cct get gcc ccg get ggg gag ggt 
Thr Ala Pro Pro Ala Ala Pro Ala Gly Glu Gly 

10 15 
cct oca aac etc ace agt aac agg aga etg eag 
Pro Pro Asn Leu Thr Ser Asn Arg Arg Leu Gin 

25 30 
gtg gat gag gtg gtg gac ate atg agg gtg aac 
Val Asp Glu Val Val Asp He Met Arg Val Asn 

40 45 
gag ega gac cag aag ctg teg gag etg gac gac 
Glu Arg Asp Gin Lys Leu Ser Glu Leu Asp Asp 
55 60 65 

cag gcg ggg ccc tec cag ttt gaa aca age gea 
Gin Ala Gly Pro Ser Gin Phe Glu Thr Ser Ala 

75 80 
aaa tac tgg tgg aaa aac etc aag atg atg ate 
Lys Tyr Trp Trp Lys Asn Leu Lys Met Met lie 

90 95 
tge gee ate ate etc ate ate ate ata gtt tac 
Cys Ala lie lie Leu He He He He Val Tyr 

105 110 
taaatccccg aggagtctgc cctgcctaga gaagggcetc 
ectccaccte teagecatat ctttcagccc cecctcccet 
ccgtgtgtgt gtccccctgt aaatagccag ctgttattta 
tatttggtct gtttgtagtt ttattactag atgattttte 
tectgaggtt eccttcacec ctetctcttg ecttccttec 
agecccaagt cecttcattt gcatctgcta tgcaatagtc 
ecctcagatt tagctgatcc tteetcccac cctggccttc 
tetcccegtc atgctccctc tgcecegccc tcaaaaaaaa 



gcgagcegee gecgecatea 
tct get ace get gcc 
Ser Ala Thr Ala Ala 
5 

ggt ccc cct gea ccc 
Gly Pro Pro Ala Pro 

20 

cag ace cag gcc cag 
Gin Thr Gin Ala Gin 
35 

gtg gac aag gtc ctg 
Val Asp Lys Val Leu 
50 

cgt gea gat gea etc 
Arg Ala Asp Ala Leu 
70 

gcc aag etc aag cge 
Ala Lys Leu Lys Arg 
85 

ate ttg gga gtg att 
He Leu Gly Val He 
100 

ttc age act 
Phe Ser Thr 
115 

tceeccaacc ctcagccgtt 
ggatecgtgt gtgtgtgtgt 
tacatatata atattatata 
cggttgtcct taacacccct 
cttteeettt ettcctgact 
ectctccttt ccttcttctt 
ctttcctctt tcctcctcac 
aaaaaaaa 



60 
114 



162 



210 



258 



306 



354 



402 



444 



504 
564 
624 
684 
744 
804 
864 
912 



<210> 137 
<211> 562 
<212> DNA 

<213> Homo sapiens 



<220> 

<221> CDS 
<222> 45.. 335 



<220> 

<221> polyA_site 
<222> 545.. 562 
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<220> 

<221> polyA__signal 
<222> 526. .531 

<400> 137 

tgttattttg tagctttaat gatttttgtt ttctaatagg gcaa atg tct eta age 56 

Met Ser Leu Ser 



ttg gtg 


ttt 


aga 


get 


get 


tea 


tat 


ttt 


aaa 


eta 


gtt 


1 

cca 


ttc 


cae 


agt 


104 


Leu 


Val 


Phe 


Arg 


Ala 


Ala 


Ser 


Tyr 


Phe 


Lys 


Leu 


Val 


Pro 


Phe 


His 


Ser 




5 










10 










15 










20 




tct 


agt 


tea 


aac 


cao 


ttt 


tta 


caa 


ect 


cet 


ggg 


tgg 


gtc 


gtc 


ttg 


aec 


152 


Ser 


Ser 


Ser 


Asn 


Gin 


Phe 


Leu 


Gin 


Pro 


Pro 


Gly Trp 


Val 


Val 


Leu 


Thr 












25 










30 










35 






caa 


act 


ctt 


gtg 


ttg 


tta 


cat 


ttt 


gag 


agg 


ttt 


tea 


tac 


cag 


aat 


gta 


200 


Gin 


Thr 


Leu 


Val 
40 


Leu 


Leu 


His 


Phe 


Glu 
45 


Arg 


Phe 


Ser 


Tyr 


Gin 
50 


Asn 


Val 




cca 


aaa 


agt 


gca 


caa 


ggt 


aaa 


ggt 


aat 


tta 


cag 


cca 


gaa 


aca 


aat 


ata 


248 


Pro Lys 


Ser 


Ala 


Gin 


Gly 


Lys 


Gly 


Asn 


Leu 


Gin 


Pro 


Glu 


Thr 


Asn 


He 








55 










60 










65 










cat 


ttg 


ttt 


cat 


ttc 


ttg 


act 


ttc 


ect 


aag 


cag 


ata 


age 


aga 


aac 


ttg 


296 


His 


Leu 
70 


Phe 


His 


Phe 


Leu 


Thr 

75 


Phe 


Pro 


Lys 


Gin 


He 
80 


Ser 


Arg 


Asn 


Leu 




ttt 


aac 


tea 


tta 


ctt 


tgt 


ttg 


atg 


tgt 


ctt 


aca 


tat 


ttt 


tgactaaaag 


345 


Phe 


Asn 


Ser 


Leu 


Leu 


Cys 


Leu 


Met 


Cys 


Leu 


Thr 


Tyr 


Phe 










85 










90 










95 















ttatagaatg ttattcctct gggggaatet tttaeaggtg gaggaatggg gatagcagta 405 

ttgcctcaga ttcaaactgg catcaceata aaccctgtag gccarggtgg aatgaagtca 4 65 

gctccttttt atagttgaaa tacaattttt tattcaccat tgtetgeaca aatccttgaa 525 

aataaacctt tttttcccta caaaaaaaaa aaaaaaa 562 

<210> 138 
<211> 566 
<212> DNA 

<213> Homo sapiens 

<220> 
<221> CDS 
<222> 32,. 361 

<220> 

<221> polyA_site 
<222> 551. .566 

<220> 

<221> polyA_signal 
<222> 532, .537 

<400> 138 

aaettgctct gcccgeteaa acagttgcag g atg teg atg aca gac ttg ctg 52 

Met Ser Met Thr Asp Leu Leu 



aac 


get 


gag 


gac 


ate 


aag 


aag 


gcg 


1 

gtg gga 


gee 


ttt 


age 


get 


aec 


gac 


100 


Asn 


Ala 


Glu 


Asp 


He 


Lys 


Lys 


Ala 


Val Gly 


Ala 


Phe 


Ser 


Ala 


Thr Asp 








10 










15 








20 










tec 


ttc 


gac 


cae 


aaa 


aag 


ttc 


ttc 


caa atg 


gtc 


ggc 


ctg 


aag 


aaa 


aag 


148 


Ser 


Phe 
25 


Asp 


His 


Lys 


Lys 


Phe 
30 


Phe 


Gin Met 


Val 


Gly 
35 


Leu 


Lys 


Lys 


Lys 




agt 


gcg 


gat 


gat 


gtg 


aag 


aag 


gtg 


ttt cae 


atg 


ctg 


gac 


aag 


gac 


aaa 


196 


Ser 


Ala 


Asp 


Asp 


Val 


Lys 


Lys 


Val 


Phe His 


Met 


Leu 


Asp 


Lys 


Asp 


Lys 




40 










45 








50 










55 




agt 


ggc 


ttc 


ate 


gag 


gag 


gat 


gag 


ctg gga 


ttc 


ate 


eta 


aaa 


ggc 


ttc 


244 


Ser Gly 


Phe 


He 


Glu 


Glu 


Asp Glu 


Leu Gly 


Phe 


He 


Leu Lys 


Gly 


Phe 
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60 65 70 

tec cca gat gcc aga gac ctg tct get aaa gaa acc aag atg ctg atg 292 
Ser Pro Asp Ala Arg Asp Leu Ser Ala Lys Glu Thr Lys Met Leu Met 

75 80 85 

get get gga gac aaa gat ggg gac ggc aaa att ggg gtt gac gaa ttc 340 
Ala Ala Gly Asp Lys Asp Gly Asp Gly Lys lie Gly Val Asp Glu Phe 

90 95 100 

tee act ctg gtg get gaa age taagaagcac tgactgcccc tggtcttcca 391 
Ser Thr Leu Val Ala Glu Ser 
105 110 
cetctctgce etcaacaecc aateteggcc ectcttgcca ccctcctgca tttctgttca 451 
gttcgtttat gttatttttt actcccccat cccctgtggc cctctaatga caccattctt 511 
ctggaaaatg ctggagaagc aataaaggtt gtaccagtca aaaaaaaaaa aaaaa 566 

<210> 139 
<211> 567 
<212> DNA 
<213> Homo sapiens 

<220> 
<221> CDS 
<222> 31.. 360 

<220> 

<221> polyA_site 
<222> 552.. 567 

<220> 

<221> polyA_signal 
<222> 531. .536 

<400> 139 

aettgctctg cccgctcaaa cagttgcagg atg teg atg aca gac ttg ctg aac 54 

Met Ser Met Thr Asp Leu Leu Asn 



get gag 


gac 


ate 


aag 


aag 


geg 


gtg 


1 

gga 


gee 


ttt 


age 


5 

get 


ace 


gac 


tee 


102 


Ala Glu 


Asp 


He 


Lys 


Lys 


Ala 


Val 


Gly 


Ala 


Phe 


Ser 


Ala 


Thr 


Asp 


Ser 




10 










15 










20 












ttc gac 


cac 


aaa 


aag 


ttc 


ttc 


caa 


atg 


gtc 


ggc 


ctg 


aag 


aaa 


aag 


agt 


150 


Phe Asp 


His 


Lys 


Lys 


Phe 


Phe 


Gin 


Met 


Val 


Gly 


Leu 


Lys 


Lys 


Lys 


Ser 




25 








30 










35 










40 




geg gat 


gat 


gtg 


aag 


aag 


gtg 


ttt 


cac 


atg 


ctg 


gac 


aag 


gac 


aaa 


agt 


198 


Ala Asp 


Asp 


Val 


Lys 
45 


Lys 


Val 


Phe 


His 


Met 
50 


Leu 


Asp 


Lys 


Asp 


Lys 
55 


Ser 




gge ttc 


ate 


gag 


gag 


gat 


gag 


ctg 


gga 


ttc 


ate 


eta 


aaa 


gge 


ttc 


tec 


246 


Gly Phe 


lie 


Glu 
60 


Glu 


Asp 


Glu 


Leu 


Gly 
65 


Phe 


He 


Leu 


Lys 


Gly 
70 


Phe 


Ser 




cca gat 


gee 


aga 


gac 


ctg 


tct 


get 


aaa 


gaa 


acc 


aag 


atg 


ctg 


atg 


get 


294 


Pro Asp 


Ala 

75 


Arg 


Asp 


Leu 


Ser 


Ala 

80 


Lys 


Glu 


Thr 


Lys 


Met 

85 


Leu 


Met 


Ala 




get gga 


gac 


aaa 


gat 


ggg 


gac 


gge 


aaa 


att 


ggg 


gtt 


gac 


gaa 


ttc 


tec 


342 


Ala Gly 


Asp 


Lys 


Asp 


Gly 


Asp 


Gly 


Lys 


He 


Gly 


Val 


Asp 


Glu 


Phe 


Ser 




90 










95 










100 












act ctg 


gtg 


get 


gaa 


age 


taagaagcac tgactgcccc tggtcttcca 




390 


Thr Leu 


Val 


Ala 


Glu 


Ser 
























105 








110 

























cetctctgce etgaacacce aateteggcc cetetegcea ccctcctgca tttctgttca 450 
gttcgtttat gttatttttt actcececat cccctgtggc cctctaatga caccattctt 510 
ctggaaaatg ctggagaagc aataaaggtt gtaccagtca caaaaaaaaa aaaaaaa 567 



<210> 140 
<211> 566 
<212> DNA 
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<220> 
<221> CDS 
<222> 32. .361 



<220> 

<221> polyA_site 
<222> 551. .566 



<220> 

<221> polyA__signal 
<222> 532. .537 



<400> 140 

aacttgctct gcccgctcaa acagttgcag g atg teg atg aca gac ttg ctg 52 

Met Ser Met Thr Asp Leu Leu 
1 5 



aac 


get 


gag 


gac 


ate 


aag 


aag 


gcg 


gtg 


gga 


gcc 


ttt 


age 


get 


ace 


gac 


100 


Asn 


Ala 


Glu 


Asp 


He 


Lys 


Lys 


Ala 


Val 


Gly 


Ala 


Phe 


Ser 


Ala 


Thr 


Asp 








10 










15 










20 










tec 


ttc 


gac 


cac 


aaa 


aag 


ttc 


ttc 


caa 


atg 


gtc 


ggc 


ctg 


aag 


aaa 


aag 


148 


Ser 


Phe 


Asp 


His 


Lys 


Lys 


Phe 


Phe 


Gin 


Met 


Val 


Gly 


Leu 


Lys 


Lys 


Lys 






25 










30 










35 












agt 


gcg 


gat 


gat 


gtg 


aag 


aag 


gtg 


ttt 


cac 


atg 


ctg 


gac 


aag 


gac 


aaa 


196 


Ser 


Ala 


Asp 


Asp 


Val 


Lys 


Lys 


Val 


Phe 


His 


Met 


Leu 


Asp 


Lys 


Asp 


Lys 




40 










45 










50 










55 




agt 


gge 


ttc 


ate 


gag 


gag 


gat 


gag 


ctg 


gga 


ttc 


ate 


eta 


aaa 


ggc 


ttc 


244 


Ser 


Gly 


Phe 


He 


Glu 


Glu 


Asp 


Glu 


Leu 


Gly 


Phe 


He 


Leu 


Lys 


Gly 


Phe 












60 










65 










70 






tec 


cea 


gat 


gcc 


aga 


gac 


ctg 


tct 


get 


aaa 


gaa 


ace 


aag 


atg 


ctg 


atg 


292 


Ser 


Pro 


Asp 


Ala 


Arg 


Asp 


Leu 


Ser 


Ala 


Lys 


Glu 


Thr 


Lys 


Met 


Leu 


Met 










75 










80 










85 








get 


get 


gga 


gac 


aaa 


gat 


ggg 


gac 


ggc 


aaa 


att 


ggg 


gtt 


gac 


gaa 


ttc 


340 


Ala 


Ala 


Gly 


Asp 


Lys 


Asp 


Gly 


Asp 


Gly 


Lys 


He 


Gly 


Val 


Asp 


Glu 


Phe 








90 










95 










100 










tec 


act 


ctg 


gtg 


get 


gaa 


age 


taagaagcae tgactgcccc tggtcttcca 


391 


Ser 


Thr 


Leu 


Val 


Ala 


Glu 


Ser 























105 110 

ectctctgee cteaacacce aatcteggee cetcttgeca cectcctgea tttctgttca 451 

gttcgtttat gttatttttt actcccecat cccctgtgge cetctaatga caccattctt 511 

ctggaaaatg ctggagaagc aataaaggtt gtaccagtca aaaaaaaaaa aaaaa 566 



<210> 141 
<211> 625 
<212> DNA 
<213> Homo sapiens 

<220> 
<221> CDS 
<222> 85. .420 



<220> 

<221> polyA_site 
<222> 610. .625 



<400> 141 

cecttctgeg cggtcacgcc gagccagcgc ctgggcctgg aaccgggccg tagcccccce 60 
agtttcgcec accacctccc tacc atg gac cec egc aaa gtg aac gak ett 111 

Met Asp Pro Arg Lys Val Asn Xaa Leu 

1 5 

egg gcc ttt gtg aaa atg tgt aag cag gat ccg age gtt ctg cac acc 159 
Arg Ala Phe Val Lys Met Cys Lys Gin Asp Pro Ser Val Leu His Thr 
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10 










15 










20 










25 




gag 


gaa 


atg 


cge 


ttc 


ctg 


agg 


gas 


tgg 


gtg 


gag 


age 


atg 


ggt 


ggt 


aaa 


207 


Glu 


Glu 


Met 


Arg 


Phe 
30 


Leu 


Arg 


Xaa 


Trp 


Val 

35 


Glu 


Ser 


Met 


Gly 


Gly 
40 


Lys 




gta 


cca 


cct 


get 


act 


caa 


aaa 


get 


aaa 


tea 


gaa 


gaa 


aat 


acc 


aag 


gaa 


255 


Val 


Pro 


Pro 


Ala 
45 


Thr 


Gin 


Lys 


Ala 


Lys 
50 


Ser 


Glu 


Glu 


Asn 


Thr 

55 


Lys 


Glu 




gaa 


aaa 


cet 


gat 


agt 


aag 


aag 


gtg 


gag 


gaa 


gac 


tta 


aag 


gca 


gac 


gaa 


303 


Glu 


Lys 


Pro 
60 


Asp 


Ser 


Lys 


Lys 


Val 
65 


Glu 


Glu 


Asp 


Heu 


Lys 
70 


Ala 


Asp 


Glu 




cca 


tea 


agt 


gag 


gaa 


agt 


gat 


eta 


gaa 


att 


gat 


aaa 


gaa 


ggt 


gtg 


att 


351 


Pro 


Ser 
75 


Ser 


Glu 


Glu 


Ser 


Asp 
80 


Leu 


Glu 


He 


Asp 


Lys 

85 


Glu 


Gly 


Val 


He 




gaa 


cca 


gac 


act 


gat 


get 


cct 


caa 


gaa 


atg 


gga 


gat 


gaa 


aat 


gcg 


gag 


399 


Glu 


Pro 


Asp 


Thr 


Asp 


Ala 


Pro 


Gin 


Glu 


Met 


Gly 


Asp 


Glu 


Asn 


Ala 


Glu 




90 










95 










100 










105 




ata 


a eg 


get 


caa 


cat 


ttt 


ttg 


tagccacagg cttccacgga cttcacgtca 


450 


He 


Thr 


Ala 


Gin 


His 


Phe 


Leu 























110 

ttattggctc aactttecte actatetget teatccgcea aetaatattt caetttaeat 510 

ccaaacatca ctttggctte gaagcegceg cetgataetg geattttgta gatgtggttt 570 

gactatttct gtatgtctcc atctattgat gagggtctta aaaaaaaaaa aaaat 625 



<210> 142 
<211> 685 
<212> DNA 

<213> Homo sapiens 

<220> 
<221> CDS 
<222> 22.. 375 

<220> 

<221> polyA_site 
<222> 670. .685 

<220> 

<221> polyA_signal 
<222> 650. .655 

<400> 142 



agtegagcea gcgtegccgc g 


atg 


gtg 


ttg 


ttg 


gag 


age 


gag 


eag 


ttc 


ctg 


51 














Met 


Val 


Leu 


Leu 


Glu 


Ser 


Glu 


Gin 


Phe 


Leu 
















1 








5 










10 




acg 


gag 


ctg 


ace 


aga 


ctt 


ttc 


cag 


aag 


tge 


egg 


aeg 


teg 


ggc 


age 


gte 


99 


Thr 


Glu 


Leu 


Thr 


Arg 


Leu 


Phe 


Gin 


Lys 


Cys 


Arg 


Thr 


Ser 


Gly 


Ser 


Val 












15 










20 










25 






tat 


ate 


acc 


ttg 


aag 


aag 


tat 


gac 


ggt 


cga 


acc 


aaa 


ccc 


att 


cca 


aag 


147 


Tyr 


He 


Thr 


Leu 


Lys 


Lys 


Tyr 


Asp 


Gly 


Arg 


Thr 


Lys 


Pro 


He 


Pro 


Lys 










30 










35 










40 








aaa 


ggt 


act 


gtg 


gag 


gge 


ttt 


gag 


cee 


gca 


gac 


aac 


aag 


tgt 


ctg 


tta 


195 


Lys 


Gly 


Thr 


Val 


Glu 


Gly 


Phe 


Glu 


Pro 


Ala 


Asp 


Asn 


Lys 


Cys 


Leu 


Leu 








45 










50 










55 










aga 


get 


acc 


gat 


ggg 


aag 


aag 


aag 


ate 


age 


act 


gtg 


gtg 


age 


tec 


aag 


243 


Arg 


Ala 


Thr 


Asp 


Gly 


Lys 


Lys 


Lys 


He 


Ser 


Thr 


Val 


Val 


Ser 


Ser 


Lys 






60 










65 










70 












gaa 


gtg 


aat 


aag 


ttt 


eag 


atg 


get 


tat 


tea 


aac 


etc 


ctt 


aga 


get 


aac 


291 


Glu 


Val 


Asn 


Lys 


Phe 


Gin 


Met 


Ala 


Tyr 


Ser 


Asn 


Leu 


Leu 


Arg 


Ala 


Asn 




75 










80 










85 










90 




atg 


gat 


ggg 


ttg 


aag 


aag 


aga 


gac 


aaa 


aag 


aac 


gaa 


act 


aag 


aag 


ace 


339 


Met 


Asp 


Gly 


Leu 


Lys 


Lys 


Arg 


Asp 


Lys 


Lys 


Asn 


Glu 


Thr 


Lys 


Lys 


Thr 












95 










100 










105 






aaa 


gca 


gca 


gca 


gca 


gca 


gca 


gca 


aea 


gca 


gca 


cag 


taaagggcat 




385 
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110 

Lys Ala Ala Ala Ala Ala Ala Ala Thr Ala Ala Gin 

110 115 
acatttcctg ctttcaccaa ttaaccactg aattgctatt ttttcctttt ggccagatag 445 
ctaggtttct ggttccccca cagtaggtgt tttcacataa gattagggtc cttttggaaa 505 
gaatagttgc agtgtttata ggatagttgt ggtaagaatc tagtttattt tgcatttggc 565 
taattggtct gtgctgcatg gttatatact cctggattat agattaaaag tctctgtaga 625 
catctctgtg aagagcaagc tatcattaaa catgtctgtt tatcaaaaaa aaaaaaaaaa 685 

<210> 143 
<211> 553 
<212> DNA , 
<213> Homo sapiens 

<220> 
<221> CDS 
<222> 9, .335 

<220> 

<221> polyA_site 
<222> 538.. 553 

<220> 

<221> polyA_signal 
<222> 520. .525 

<400> 143 

tcaagaat atg gat agt tta cct tct ggt aaa ata cat cga aaa gtg aaa 50 

Met Asp Ser Leu Pro Ser Gly Lys lie His Arg Lys Val Lys 
15 10 



ata 


ata 


tta 


gga 


cga 


aat 


aga 


aaa 


gaa 


aat 


ctg 


gaa 


cea 


aat 


get 


gaa 


98 


lie 


He 


Leu 


Gly 


Arg 


Asn 


Arg 


Lys 


Glu 


Asn 


Leu 


Glu 


Pro 


Asn 


Ala 


Glu 




15 










20 










25 










30 




ttt 


gat 


aaa 


aga 


act 


gaa 


ttt 


att 


aca 


caa 


gaa 


gaa 


aac 


aga 


att 


tgt 


146 


Phe Asp 


Lys 


Arg 


Thr 


Glu 


Phe 


He 


Thr 


Gin 


Glu 


Glu 


Asn 


Arg 


He 


Cys 












35 










40 










45 






agt 


tea 


ccg 


gta 


cag 


tct 


tta 


eta 


gac 


ttg 


ttt 


cag 


act 


agt 


gaa 


gag 


194 


Ser 


Ser 


Pro 


Val 
50 


Gin 


Ser 


Leu 


Leu 


Asp 
55 


Leu 


Phe 


Gin 


Thr 


Ser 
60 


Glu 


Glu 




aaa 


tea 


gaa 


ttt 


ttg 


ggt 


ttc 


aca 


age 


tac 


aca 


gaa 


aag 


agt 


ggt 


ata 


242 


Lys 


Ser 


Glu 
65 


Phe 


Leu 


Gly 


Phe 


Thr 

70 


Ser 


Tyr 


Thr 


Glu 


Lys 
75 


Ser 


Gly 


He 




tgc 


aat 


gtt 


tta 


gat 


att 


tgg 


gaa 


gag 


gaa 


aat 


tea 


gat 


aat 


ctg 


tta 


290 


Cys 


Asn 
80 


Val 


Leu 


Asp 


He 


Trp 
85 


Glu 


Glu 


Glu 


Asn 


Ser 
90 


Asp 


Asn 


Leu 


Leu 




aca 


gcg 


ttt 


ttc 


teg 


tec 


cct 


tea 


act 


tct 


aca 


ttt 


act 


ggc 


ttt 




335 


Thr 


Ala 


Phe 


Phe 


Ser 


Ser 


Pro 


Ser 


Thr 


Ser 


Thr 


Phe 


Thr 


Gly 


Phe 






95 










100 










105 















tagaatttaa aaaatgcata cttttcagaa gtgataagga teatattett gaaattttta 395 

taaatatgta tggaaattct taggattttt ttaecagctt tgtttacaga cccaaatgta 455 

aatattaaaa ataaatattt geaattttet acagaattga atacctgtta aagaaaaatt 515 

acagaataaa cttgtgactg gtaaaaaaaa aaaaaaat 553 

<210> 144 
<211> 631 
<212> DNA 
<213> Homo sapiens 

<220> 
<221> CDS 
<222> 156.. 464 



<220> 

<221> polyA_site 
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<222> 610.. 631 

<220> 

<221> polyA_signal 
<222> 559.. 564 

<400> 144 

aattcatatc tgagtgggcg gtggcgattg gtgttggcgg tctggctcag ctgggcaggg 60 

ggtaacttta ctgatttggg ggtggttttt agtttaattt ttcttttcta gcttcccatc 120 

gacggtcagt gcgcacgttg taatcagctg aggcc atg tea gga gac gga gcc 173 

Met Ser Gly Asp Gly Ala 

1 5 



acg 


gag 


cag 


gca 


get 


gag 


tat 


gtc 


cca 


gag 


aag 


gtg 


aag 


aaa 


geg 


gaa 


221 


Thr 


Glu 


Gin 


Ala 

10 


Ala 


Glu 


Tyr 


Val 


Pro 

15 


Glu 


Lys 


Val 


Lys 


Lys 
20 


Ala 


Glu 




aag 


aaa 


tta 


gaa 


gag 


aat 


cca 


tat 


gac 


ctt 


gat 


get 


tgg 


age 


att 


etc 


269 


Lys 


Lys 


Leu 
25 


Glu 


Glu 


Asn 


Pro 


Tyr 
30 


Asp 


Leu 


Asp 


Ala 


Trp 
35 


Ser 


He 


Leu 




att 


eg a 


gag 


gca 


cag 


aat 


caa 


cct 


ata 


gac 


aaa 


gca 


egg 


aag 


act 


tat 


317 


lie Arg 


Glu 


Ala 


Gin 


Asn 


Gin 


Pro 


He 


Asp 


Lys 


Ala 


Arg 


Lys 


Thr 


Tyr 






40 










45 










50 












gaa 


cgc 


ctt 


gtt 


gcc 


cag 


ttc 


CGC 


agt 


tct 


ggc 


aga 


ttc 


tgg 


aaa 


ctg 


365 


Glu Arg 


Leu 


Val 


Ala 


Gin 


Phe 


Pro 


Ser 


Ser 


Gly 


Arg 


Phe 


Trp 


Lys 


Leu 




55 










60 










65 










70 




tac 


att 


gaa 


gca 


gag 


gtt 


act 


att 


tta 


ttt 


tat 


ttt 


ttc 


tta 


tat 


cag 


413 


Tyr 


He 


Glu 


Ala 


Glu 
75 


Val 


Thr 


He 


Leu 


Phe 
80 


Tyr 


Phe 


Phe 


Leu 


Tyr 
85 


Gin 




tat 


tgc 


age 


att 


cac 


tgt 


agt 


gat 


aga 


aaa 


caa 


gtt 


asg 


aac 


ata 


gcc 


461 


Tyr 


Cys 


Ser 


He 


His 


Cys 


Ser 


Asp 


Arg 


Lys 


Gin 


Val 


Xaa 


Asn 


He 


Ala 





90 95 100 



mat taggacaagg aggatttaaa tgtgtcttae ctttattttg taaaataggt 514 
Xaa 

ataaaggagt aattaaaatg aatttttgaa atttgggtct tttacaaget gatgattgtt 574 

gcattttgga gttgcaacaa cattaaaaea gtttcaatga taaaaaaaaa aaaaaaa 631 

<210> 145 
<211> 635 
<212> DNA 
<213> Homo sapiens 

<220> 
<221> CDS 
<222> 160. .375 

<220> 

<221> polyA_site 
<222> 620.. 635 

<220> 

<221> polyA_signal 
<222> 569.. 574 

<400> 145 

ttttaattca tatetgagtg ggcggtggcg attggtgttg gcggtctggc tcagctgggc 60 
agggggtaac tttactgatt tgggggtggt ttttagttta atttttcttt tetagcttcc 120 
catcgacggt cagtgegcae gttgtaatca gctgaggcc atg tea gga gac gga 174 

Met Ser Gly Asp Gly 
1 5 

gcc acg gag cag gca get gag tat gtc tea gag aag gtg aag aaa geg 222 
Ala Thr Glu Gin Ala Ala Glu Tyr Val Ser Glu Lys Val Lys Lys Ala 

10 15 20 

gaa aag aaa tta gaa gag aat cca tat gac ctt gat get tgg age att 270 
Glu Lys Lys Leu Glu Glu Asn Pro Tyr Asp Leu Asp Ala Trp Ser He 
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112 

25 30 35 

etc att cga gag gca cag aat caa cct ata gac aaa gca egg aag act 318 
Leu He Arg Glu Ala Gin Asn Gin Pro He Asp Lys Ala Arg Lys Thr 

40 45 50 

tat gaa egc ctt gtt gcc cag ttc ccc agt tct ggc aga ttc tgg aaa 366 
Tyr Glu Arg Leu Val Ala Gin Phe Pro Ser Ser Gly Arg Phe Trp Lys 

55 60 65 

ctg tac att taagcagagg ttaetatttt attttatttt ttcttatatc 415 
Leu Tyr He 
70 

agtattgcag eatteactgt agtgatagaa aamaagttas gaacatagcc aattaggaca 475 
aggaggattt aaatttgtct tacetttatt ttgtaaaata ggtataaagg agtaattaaa 535 
atgaattttt gaaatttggg tcttttacaa gctgatgatt gttgcatttt ggagttgeaa 595 
caacattaaa acagtttcaa tggcaaaaaa aaaaaaaaaa 635 

<210> 146 
<211> 733 
<212> DNA 
<213> Homo sapiens 

<220> 
<221> CDS 
<222> 32. .349 

<220> 

<221> polyA site 
<222> 718.. 733 

<220> 

<221> polyA_signal 
<222> 711.. 716 

<400> 146 

actaactage aaacggggac tagaaatagg g atg ctg aaa age aac ggg gag 52 

Met Leu Lys Ser Asn Gly Glu 

1 5 

aga egc agt cgt aac gca ctt ccg gcg gtc tac gcg agg aag atg get 100 
Arg Arg Ser Arg Asn Ala Leu Pro Ala Val Tyr Ala Arg Lys Met Ala 

10 15 20 

gca tec cag cag caa get tea gcg get tec tea get get ggt gta teg 148 
Ala Ser Gin Gin Gin Ala Ser Ala Ala Ser Ser Ala Ala Gly Val Ser 

25 30 35 

ggt cct agt teg get ggc ggc ccg ggt ccc cag cag cag ccg caa ccg 196 
Gly Pro Ser Ser Ala Gly Gly Pro Gly Pro Gin Gin Gin Pro Gin Pro 
40 45 50 55 

cea gca caa ctg gtg ggc cct gcc cag age ggc etc ctg cag caa cag 24 4 

Pro Ala Gin Leu Val Gly Pro Ala Gin Ser Gly Leu Leu Gin Gin Gin 

60 65 70 

caa cag gac ttc gat cct gtg cag cgt tat aag atg etc ate ccg cag 292 
Gin Gin Asp Phe Asp Pro Val Gin Arg Tyr Lys Met Leu He Pro Gin 

75 80 85 

etg aag gag agt eta cag gtg att ggc ctt aag cag cga gaa gca aac 340 
Leu Lys Glu Ser Leu Gin Val He Gly Leu Lys Gin Arg Glu Ala Asn 

90 95 100 

tgg att tgg tagtctagag gggcaggccg rgggecaggt tagteggaga 389 
Trp He Trp 

105 

gagegctaag cagaaagagg aatagaaect gctgttttgg cctggctggt geacgcctgt 44 9 
gttcceagca ctttgggagg ccgaggcggg cggatcactt gaggtcagga gttcaagacc 509 
agcctggtga aacctegtet ctactaaaaa tacaaaaatt ageegggcat ggtggegcgc 569 
gectgtaatt acagctactc gggaagetga ggeaggagaa tcgcttgaac ccaggaggeg 629 
gaggttgeag tgagcegaca tegtgceatt gcacteeagc etgggcaaea gagcgagcte 689 
cgtcaawaaa ataaataaat aaataaataa aaaaaaaaaa aaaa 733 
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<210> 147 
<211> 521 
<212> DNA 

<213> Homo sapiens 

<220> 
<221> CDS 
<222> 111. .368 

<220> 

<221> polyA_site 
<222> 506.. 521 

<220> 

<221> polyA_signal 
<222> 483.. 488 

<400> 147 

tctcagcttc cggctggtag tagttccgct tcctgtccga ctgtggtgtc tttgctgagg 60 
gtcacattga gctgcaggtt gaatccgggg tgcctttagg attcagcacc atg gcg 116 

Met Ala 
1 



gaa gac 


atg 


gag 


ace 


aaa 


ate 


aag 


aac 


tac 


aag 


acc 


gee 


cet 


ttt 


gac 


164 


Glu Asp 


Met 


Glu 


Thr 


Lys 


He 


Lys 


Asn 


Tyr 


Lys 


Thr 


Ala 


Pro 


Phe 


Asp 






5 










10 










15 










age cgc 


tte 


ccc 


aac 


cag 


aac 


cag 


act 


aga 


aac 


tgc 


tgg 


cag 


aac 


tac 


212 


Ser Arg 


Phe 


Pro 


Asn 


Gin 


Asn 


Gin 


Thr 


Arg 


Asn 


Cys 


Trp 


Gin 


Asn 


Tyr 




20 










25 










30 












ctg gac 


tte 


cac 


cgc 


tgt 


cag 


aag 


gca 


atg 


acc 


get 


aaa 


gga 


ggc 


gat 


260 


Leu Asp 


Phe 


His 


Arg 


Cys 


Gin 


Lys 


Ala 


Met 


Thr 


Ala 


Lys 


Gly 


Gly 


Asp 




35 








40 










45 










50 




ate tct 


gtg 


tgc 


gaa 


tgg 


tac 


cag 


cgt 


gtg 


tac 


cag 


tec 


etc 


tgc 


ccc 


308 


lie Ser 


Val 


Cys 


Glu 


Trp 


Tyr 


Gin 


Arg 


Val 


Tyr 


Gin 


Ser 


Leu 


Cys 


Pro 










55 










60 










65 






aca tec 


tgg 


gtc 


aca 


gac 


tgg 


gat 


gag 


caa 


egg 


get 


gaa 


ggc 


acg 


ttt 


356 


Thr Ser 


Trp 


Val 


Thr 


Asp 


Trp 


Asp 


Glu 


Gin 


Arg 


Ala 


Glu 


Gly 


Thr 


Phe 








70 










75 










80 









ccc ggg aag aty tgaactggct gcatetcect ttcctctgtc ctecatcctt 408 
Pro Gly Lys He 

85 

cteccaggat ggtgaagggg gaeetggtac ceagtgatcc ccaccccagg atcctaaatc 4 68 
atgacttacc tgctaataaa aactcattgg aaaagtcaaa aaaaaaaaaa aaa 521 



<210> 


148 


<211> 


547 


<212> 


DNA 


<213> 


Homo sapiens 


<220> 




<221> 


CDS 


<222> 


95. .409 


<220> 




<221> 


polyA site 


<222> 


533. .547 


<220> 




<221> 


polyA signal 


<222> 


509. .514 


<400> 


148 



ttcgaagtgg ttgtctctgc actatttact gaaaagttca tcctttcccc agtgatttgt 60 
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aacactgcct ctttcataaa ttaaattttt gtgt atg tgt gtg ggt ctt ttg atg 115 

Met Cys Val Gly Leu Leu Met 



gtt 


tct 


att 


ctg 


act 


aac 


ate 


aat 


ttg 


1 

tct 


aat 


ctt 


ata 


5 

aca 


ata 


caq 


163 


Val 


Ser 


He 


Leu 


Thr 


Asp 


He 


Asn 


Leu 


Ser 


Asn 


Leu 


Val 


Ala 


Val 


Gin 








10 










15 










20 










tac 


aat 


cct 


cat 


tat 


tac 


aac 


ttt 


aaa 


aaa 


aaa 


tct 


aat 


aaa 


aac 


caa 


211 


Tyr 


Ser 


Pro 


Asp 


Tvr 


Cys 


Asn 


Phe 


Arg 


Lys 


Ara 


Ser 


Asp 


Lys 


Asn 


Gin 






25 










30 










35 












gat 


qcc 


tec 


aca 


ttt 


tqt 


eat 


aat 


tqt 


aac 


caa 


ttt 


cac 


etc 


tqt 


etc 


259 


Asp 


Ala 


Ser 


Thr 


Phe 


Cys 


His 


Asn 


Cys 


Asn 


Gin 


Phe 


His 


Leu 


Cys 


Leu 




40 










45 










50 










55 




cag 


tat 


cac 


cac 


aaa 


att 


gtt 


ctt 


cct 


tgg 


agt 


gtc 


ttg 


get 


att 


ctt 


307 


Gin 


Tyr 


His 


His 


Lys 


He 


Val 


Leu 


Pro 


Trp 


Ser 


Val 


Leu 


Ala 


He 


Leu 












60 










65 










70 






age 


caa 


ctg 


ttc 


etc 


cat 


att 


act 


tct 


aga 


att 


aga 


cea 


aca 


att 


tat 


355 


Ser 


Gin 


Leu 


Phe 


Leu 


His 


He 


Thr 


Ser 


Arg 


He 


Arg 


Pro 


Thr 


He 


Tyr 










75 










80 










85 








aaa 


tea 


aac 


aaa 


ccc 


aag 


age 


att 


gaa 


att 


ttg 


att 


gga 


ttt 


gta 


ttg 


403 


Lys 


Ser 


Asn 


Lys 


Pro 


Lys 


Ser 


He 


Glu 


He 


Leu 


He 


Gly 


Phe 


Val 


Leu 





90 95 100 



aat tta tagattaatc tggtaaacca tgtcatcttt acaatgttgt cttccaatac 459 
Asn Leu 
105 

atgaatatgg tacagctctt catttactta ggcctttgaa atatetttca ataaagttta 519 
taattttctc eataaaaaaa aaaaaaaa 547 



<210> 149 

<211> 3159 

<212> DNA 

<213> Homo sapiens 

<220> 
<221> CDS 
<222> 103. ,405 



<400> 149 

acttccggtg cgaaccgect cggccgttcc ctcgcggact tactgagcgc ggccgccgag 60 
cccagctccg ccgccgagcg cctgtgccgg cacggctaca cc atg gag cgc ccg 114 

Met Glu Arg Pro 

1 



gat 


aag 


gcg 


gcg 


ctg 


aac 


gca 


ctg 


cag 


cct 


cct 


gag 


ttc 


aga 


aat 


gaa 


162 


Asp Lys 


Ala 


Ala 


Leu 


Asn 


Ala 


Leu 


Gin 


Pro 


Pro 


Glu 


Phe 


Arg 


Asn 


Glu 




5 










10 










15 










20 




age 


tea 


tta 


gea 


tct 


aca 


ctg 


aag 


acg 


etc 


ctg 


ttc 


ttc 


aca 


get 


tta 


210 


Ser 


Ser 


Leu 


Ala 


Ser 
25 


Thr 


Leu 


Lys 


Thr 


Leu 
30 


Leu 


Phe 


Phe 


Thr 


Ala 
35 


Leu 




atg 


ate 


act 


gtt 


cct 


att 


ggg 


tta 


tat 


ttc 


aca 


act 


aaa 


tct 


tac 


ata 


258 


Met 


He 


Thr 


Val 

40 


Pro 


He 


Gly 


Leu 


Tyr 
45 


Phe 


Thr 


Thr 


Lys 


Ser 

50 


Tyr 


He 




ttt 


gaa 


ggc 


gcc 


ctt 


ggg 


atg 


tec 


aat 


agg 


gac 


age 


tat 


ttt 


tac 


get 


306 


Phe 


Glu 


Gly 
55 


Ala 


Leu 


Gly 


Met 


Ser 
60 


Asn 


Arg 


Asp 


Ser 


Tyr 
65 


Phe 


Tyr 


Ala 




get 


att 


gtt 


gca 


gtg 


gtc 


gcc 


gte 


cat 


gtg 


gtg 


ctg 


gcc 


etc 


ttt 


gtg 


354 


Ala 


He 
70 


Val 


Ala 


Val 


Val 


Ala 
75 


Val 


His 


Val 


Val 


Leu 
80 


Ala 


Leu 


Phe 


Val 




tat 


gtg 


gcc 


tgg 


aat 


gaa 


ggc 


tea 


ega 


cag 


tgg 


cgt 


gaa 


ggc 


aaa 


cag 


402 


Tyr 


Val 


Ala 


Trp 


Asn 


Glu 


Gly 


Ser 


Arg 


Gin 


Trp 


Arg 


Glu 


Gly 


Lys 


Gin 




85 










90 










95 










100 





gat taaagtgaae atcaeetttt tatagcatta aattcatttt ttaaaatgat 455 
Asp 

aaatgctgga gggggceatc tgatttgaat aaagttgaaa gaaeatgtta aagtcagtct 515 

taaggagtea cgtttgagta tgtaaatttt gatctttcta atatgttggt ttgtatattc 575 
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agttttaact gtatgaatct gatttgcaaa tgagaatttg gaaaagttag ttacaaagaa 635 

atatgttaat ttaattagac aatactctgg aaggaatttt atcttctttc aacaaaacat 695 

gttttatagt attctgactt acggttgctt ttgagtttta ctcatktgga tatattaaga 755 

tgcacacagt gaagcaaatt aaactccact ttacgctgga atgctttctt tagcatgaaa 815 

ataccaggtc cttggatttg ggattttaat ttcctatgga aagttgctta aattgtggac 875 

actggaatta atctgaatgt cactgaggaa tttcacatga agtgtaatcc ctagtcaata 935 

agaattatcc attacattat tttatgggaa aactaggcta aattacatcc attcaggtaa 995 

aaggacctta gcttactgaa ggatctaaag agcaaagcaa agatctcact actcaaacac 1055 

tcagcctgct tccttcaagt ccccttgcag gccagctttg tgctttgcag accaactttt 1115 

taatgagata ctttgcttcc tcattcaaca ttgaagctag gcttcaatta aaaggttcga 1175 

ggaagctcca tttaaaattg tttttttact attttttaaa attgtagtgt atatgatagg 1235 

aatttgcatt taaatatgtt catttttgca tatgttagga gtggaaacaa tctggaaaac 1295 

attttttttt catccaaaaa gtattctcct tgggcatatc tgatggaaaa aaaccttgat 1355 

tttattttcg tatctttagt ctgtgttctt tctagttatt tggtactaat tatgtgcaat 1415 

ctaaaaacac tcccacaagt atttgttttt taattataaa atcatagtat atgttctttg 1475 

tagaaaactg gaaaaataca tattcaaaca gaaaaaaaaa tcaaaattcc ccataatgtt 1535 

gccatctaaa aataacctct attttagttg atatcccgta ttcatttttg aaagccattc 1595 

cttaatgcta gtttgataca cactaaaagt ttagcttaca agttcaaatt ctgccagctt 1655 

ttcctgacag ctatttgcat ttttttcaga tgagtgatta ttggccattt tctttttctt 1715 

tttctttatt ttatttattt atttttttga gacagagttt tgctctgttg cccaggctgg 1775 

agtgcagtgg tgcaatctcg gctcactgca acctctgcct cctgggttca agtgattctc 1835 

cacctcagcc tcccaagtag ctgggactac ggatgcctgc caccacgcct ggctaatttt 1895 

tttttgtatt tttttgtaga gacggggttt caccatgttg tccaggctaa tcttgaactt 1955 

gtgacctcag gtgatccacc cgcctcggcc tccgaaagtg ctgggattac aggcgtgagc 2015 

taccacgccc ggccttattg accattttct aaataagcac attctatctt tattctctta 2075 

aaattcaaat tttctgttac tgataatcct aatactagga ttcttgctta agtatgtgaa 2135 

accattaccg atttgttgtt cacatttatt ttttatgttg tgaaactgga ctaaaggaat 2195 

agagggatga ttagtcataa aagtcaaata gcatttgtgt ttaactgttg agaaaagtga 2255 

aagatcagta tgattattat ggaactgttt ttaattcttg cttaaagact acaattttag 2315 

tataatgaca tttgagtcta gggtagtatg tggtagattt ctagatggtc cctaattaag 2375 

aagtattgtt gtatttagaa ttgtccacct aatttctttt tatataatgc caaggtattt 2435 

cttgtgcttt tgggatctta tgctgtttgt aaaatgttac tgtccaatgt tggattattg 2495 

ttttggtttc aggcatttgc tgaataggtg atgatacatg ggtatttttc tgcaagtatt 2555 

taaaccaggg gcatatgcaa aggcagttgt aatttcctct tggaaaaagc gccaaatgtt 2615 

tgaaggttaa aatcaaatgc tagggttgat atttaggctt ataacaaaat aggcttgttt 2675 

tcaaagcagt tttttcctag agttttaact gttaactcac tagtttgctg ctgtttttaa 2735 

ctatgttaaa taacatatgg tatttggcaa atagatttat ttttcaaaat gtctcactag 2795 

tttcctttta cacaatgtat atacttcaag atgtatagaa aggaaagcta cagttgagcc 2855 

cttatacatg ttttaaggta gaaatatgtt ccctattgtt tgaaaactga ttgtaagaat 2915 

aacctcagtt aggagatata acttgaagtg tcagtccaaa ctactgattt aaccctattt 2975 

acggtaacac attaccttcc tcacctcctg tttggccctg gagaatgtag tcctttttct 3035 

catttgtgtt gagaaatgaa aagtctgctg tagaatgtat ctgatgtcat tagttcttca 3095 

aatggatacc attgtacata taacagtaga atttggtttg gggttgttag tgaaaaaaaa 3155 

ttta 3159 

<210> 150 
<211> 1033 
<212> DNA 

<213> Homo sapiens 

<220> 
<221> CDS 
<222> 30. ,305 

<400> 150 

agaatgtcgt actctcagga ggctccacc atg ttc agg gat ttc gga cgc cga 53 

Met Phe Arg Asp Phe Gly Arg Arg 
1 5 

ctg cag agg gat ttg aag aga gtg gtg gat get agg ctg agg etc age 101 
Leu Gin Arg Asp Leu Lys Arg Val Val Asp Ala Arg Leu Arg Leu Ser 

10 15 20 

gag gag etc ars sgc ggg agg ate aag eeg aag cct gtg gag gte cag 14 9 
Glu Glu Leu Xaa Xaa Gly Arg lie Lys Pro Lys Pro Val Glu Val Gin 
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25 










30 










35 










40 




gtg 


gtc 


acq 


cat 


cac 


atg 


cag 


cgc 


tac 


gcc 


gtg 


tgg 


ttc 


gga 


ggc 


tec 


197 


Vai 


Val 


Thr 


His 


His 


Met 


Gin Arg Tyr 


Ala 


Val 


Trp 


Phe Gly Gly 


Ser 












45 










50 










55 






atg 


ctg 


gcc 


teg 


act 


ccc 


gag 


ttc 


ttt 


cag 


gtc 


tgc 


cac 


acc 


aag 


aag 


245 


Met 


Leu 


Ala 


Ser 


Thr 


Pro 


Glu 


Phe 


Phe 


Gin 


Val 


Cys 


His 


Thr 


Lys 


Lys 










60 










65 










70 








gac 


tat 


gaa 


gag 


tac 


ggg 


ccc 


age 


ate 


tgc 


cgc 


cac 


aac 


ccc 


gtc 


ttt 


293 


Asp 


Tyr 


Glu 


Glu 


Tyr 


Gly 


Pro 


Ser 


He 


Cys Arg 


His 


Asn 


Pro 


Val 


Phe 








75 










80 










85 











gga gtc atg tec tagtgtctgc ctgaacgcgt cgttcgatgg tgtcacgttg 345 
Gly Val Met Ser 
90 

gggaacaagt gtcettcaga acccagagaa ggccgecgtt ctgtaaatag cgacgtcggt 405 

gttgctgccc agcagcgtgc ttgeattgee ggtgcatgag gcgcggcgcg ggcccttcag 4 65 

taaaagecat ttatccgtgt gecgaccget gtctgccagc ctcctccttc tcccgccctc 525 

ctcaccctcg ctctccctce tectcctcct ccgagctgct agctgacaaa tacaattctg 585 

aaggaatcca aatgtgactt tgaaaattgt tagagaaaac aacattagaa aatggcgcaa 645 

aatcgttagg tcccaggaga gaatgtgggg gcgcaaaccc ttttcctccc agcctatttt 705 

tgtaaataaa atgtttaaae ttgaaataca aatcgatgtt tatatttcet atcattttgt 765 

attttatggt atttggtaca actggctgat actaagcacg aatagatatt gatgttatgg 825 

agtgctgtaa tccaaagttt ttaattgtga ggcatgttet gatatgttta taggcaaaca 885 

aataaaacag caaaettttt tgccacatgt ttgctagaaa atgattatac tttattggag 945 

tgacatgaag tttgaacact aaacagtaat gtatgagaat tactacagat acatgtatct 1005 

tttagttttt tttgtttgaa ctttctgg 1033 

<210> 151 

<211> 1033 

<212> DNA 

<213> Homo sapiens 



<220> 
<221> CDS 
<222> 30. .305 



<400> 151 

agaatgtcgt actctcagga ggctccacc atg ttc agg gat ttc gga cgc cga 53 

Met Phe Arg Asp Phe Gly Arg Arg 
1 5 



ctg 


cag 


agg 


gat 


ttg 


aag 


aga 


gtg 


gtg 


gat get 


agg 


ctg 


agg 


etc 


age 


101 


Leu 


Gin 


Arg 


Asp 


Leu 


Lys 


Arg 


Val 


Val 


Asp Ala Arg 


Leu Arg 


Leu 


Ser 






10 










15 








20 












gag 


gag 


etc 


age 


ggc 


ggg 


agg 


ate 


aag 


ecg aag 


ect 


gtg 


gag 


gtc 


cag 


149 


Glu 


Glu 


Leu 


Ser 


Gly 


Gly 


Arg 


He 


Lys 


Pro Lys 


Pro 


Val 


Glu 


Val 


Gin 




25 










30 








35 










40 




gtg 


gtc 


aeg 


cat 


cac 


atg 


cag 


cgc 


tac 


gcc gtg 


tgg 


ttc 


gga 


ggc 


tec 


197 


Val 


Val 


Thr 


His 


His 


Met 


Gin 


Arg Tyr Ala Val 


Trp 


Phe 


Gly Gly 


Ser 












45 










50 








55 






atg 


ctg 


gcc 


teg 


act 


ccc 


gag 


ttc 


ttt 


cag gtc 


tgc 


cac 


acc 


aag 


aag 


245 


Met 


Leu 


Ala 


Ser 
60 


Thr 


Pro 


Glu 


Phe 


Phe 
65 


Gin Val 


Cys 


His 


Thr 

70 


Lys 


Lys 




gac 


tat 


gaa 


gag 


tac 


ggg 


ccc 


age 


ate 


tgc cgc 


cac 


aac 


ccc 


gtc 


ttt 


293 


Asp 


Tyr 


Glu 
75 


Glu 


Tyr 


Gly 


Pro 


Ser 
80 


He 


Cys Arg 


His 


Asn 
85 


Pro 


Val 


Phe 




gga 


gtc 


atg 


tec 


tagtgtctgc ctgaacgcgt cgttcgatgg tgtcacgttg 




345 


Gly 


Val 


Met 


Ser 



























90 



gggaacaagt gtcettcaga acccagagaa ggccgecgtt ctgtaaatag cgacgtcggt 405 

gttgctgccc agcagcgtgc ttgeattgee ggtgcatgag gcgcggcgcg ggcccttcag 465 

taaaagecat ttatccgtgt gecgaccget gtctgccagc ctcctccttc tcccgccctc 525 

ctcaccctcg ctctccctce tectcctcct ccgagctgct agctgacaaa tacaattctg 585 

aaggaatcca aatgtgactt tgaaaattgt tagagaaaac aacattagaa aatggcgcaa 64 5 

aatcgttagg tcccaggaga gaatgtgggg gcgcaaaccc ttttcctccc agcctatttt 705 
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tgtaaataaa atgtttaaac ttgaaataca aatcgatgtt tatatttcct atcattttgt 765 

attttatggt atttggtaca actggctgat actaagcacg aatagatatt gatgttatgg 825 

agtgctgtaa tccaaagttt ttaattgtga ggcatgttct gatatgttta taggcaaaca 885 

aataaaacag caaacttttt tgccacatgt ttgctagaaa atgattatac tttattggag 945 

tgacatgaag tttgaacact aaacagtaat gtatgagaat tactacagat acatgtatct 1005 

tttagttttt tttgtttgaa ctttctgg 1033 

<210> 152 
<211> 688 
<212> DNA 

<213> Homo sapiens 

<220> 
<221> CDS 
<222> 159. .452 



<220> 

<221> polyA site 
<222> 678.. 688 



<220> 

<221> polyA_signal 
<222> 670. .675 



<400> 152 

atacgtacac caatgcggac aagctcctag aagcagcaga gcagttggct cagacggggg 60 
aatgtgaccc cgaggagatc tacaaggcag ctcgacacct ggaggtgcgc atccaagact 120 
tcgtgcgcag ggtggagcag cggaagcttc tcctggac atg tct gtt tec ttc cac 176 

Met Ser Val Ser Phe His 



aca 


cac 


acc 


aaa 


gag 


ttg 


tgg 


aca 


tgg 


atg 


1 

gaa 


gac 


ctt 


cag 


5 

aag 


gag 


224 


Thr 


His 


Thr 


Lys 


Glu 


Leu 


Trp 


Thr 


Trp 


Met 


Glu 


Asp 


Leu 


Gin 


Lys 


Glu 










10 










15 










20 








atg 


ttg 


gag 


gat 


gtc 


tgt 


gca 


gat 


tct 


gtg 


gat 


gca 


gtc 


cag 


gaa 


ctg 


272 


Met 


Leu 


Glu 


Asp 


Val 


Cys 


Ala 


Asp 


Ser 


Val 


Asp 


Ala 


Val 


Gin 


Glu 


Leu 








25 










30 










35 










ate 


aag 


cag 


ttc 


cag 


cag 


cag 


cag 


acc 


gee 


act 


eta 


gat 


gee 


aca 


etc 


320 


lie 


Lys 


Gin 


Phe 


Gin 


Gin 


Gin 


Gin 


Thr 


Ala 


Thr 


Leu 


Asp 


Ala 


Thr 


Leu 






40 










45 










50 












aat 


gtc 


ate 


aag 


gaa 


ggc 


gaa 


gac 


ctt 


ate 


cag 


cag 


etc 


agg 


tea 


gcg 


368 


Asn 


Val 


He 


Lys 


Glu 


Gly 


Glu 


Asp 


Leu 


He 


Gin 


Gin 


Leu 


Arg 


Ser 


Ala 




55 










60 










65 










70 




ect 


cce 


tec 


etc 


ggg 


gag 


cce 


age 


gag 


gee 


agg 


tea 


gca 


tgg 


gca 


gag 


416 


Pro 


Pro 


Ser 


Leu 


Gly 


Glu 


Pro 


Ser 


Glu 


Ala 


Arg 


Ser 


Ala 


Trp 


Ala 


Glu 












75 










80 










85 






ett 


tec 


agt 


ggg 


aaa 


tgc 


etc 


ggg 


eta 


gac 


gtg 


aga 


taagtcctgt 




462 


Leu 


Ser 


Ser 


Gly 


Lys 


Cys 


Leu 


Gly 


Leu 


Asp 


Val 


Arg 













90 95 



cccagtcttg aetcagcaaa atcttgcttt gtggtcttgt gtgagctteg gtttctttat 522 

ctgtaagatg taaaaatgcc tgcaccatac tcaacttgaa tctagaataa aatagtaaat 582 

ggaaaagctt taatgtttca aagaattatt tatcattcag gatgttattc agecatcagt 642 

aagtaacata aatacaacaa aaattaacat aaaacaaaaa aaaaaa 688 



<210> 153 
<211> 420 
<212> DNA 

<213> Homo sapiens 

<220> 
<221> CDS 
<222> 63. .329 



<220> 
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<221> polyA_site 
<222> 406. .420 



<400> 153 

atgctcgcca ggataccccc tcttcccatg gcgacctcag accgcgggcg tcagcgttgg 60 
eg atg gtc tec gga gac ggc ttc etc gtt tec agg ect gaa geg att 107 
Met Val Ser Gly Asp Gly Phe Leu Val Ser Arg Pro Glu Ala lie 

15 10 15 



cat 


eta 


gga 


ect 


egg 


cag 


geg 


gtg 


cga 


eca 


age 


gtt 


egg 


gee 


gag age 


155 


His 


Leu 


Gly 


Pro 


Arg 


Gin 


Ala 


Val 


Arg 


Pro 


Ser 


Val 


Arg 


Ala 


Glu Ser 












20 










25 










30 




cgt 


cga 


gtg 


gat 


ggt 


ggc 


ggc 


egg 


age 


eca 


agg 


gaa 


eca 


gat 


ggc egg 


203 


Arg 


Arg 


Val 


Asp 


Gly 


Gly 


Gly 


Arg 


Ser 


Pro 


Arg 


Glu 


Pro 


Asp 


Gly Arg 










35 










40 










45 






ggc 


egg 


age 


cgc 


caa 


gcg 


aga 


ttc 


tea 


ect 


tac 


eca 


ate 


ect 


gee gtt 


251 


Gly 


Arg 


Ser 


Arg 


Gin 


Ala 


Arg 


Phe 


Ser 


Pro 


Tyr 


Pro 


He 


Pro 


Ala Val 








50 










55 










60 








gaa 


eee 


gat 


etc 


eta 


aga 


agt 


gtg 


etg 


caa 


eag 


cgt 


ttg 


att 


gea tta 


299 


Glu 


Pro 


Asp 


Leu 


Leu 


Arg 


Ser 


Val 


Leu 


Gin 


Gin 


Arg 


Leu 


He 


Ala Leu 






65 










70 










75 










gga 


ggt 


gtt 


ate 


gea 


get 


cga 


att 


tea 


gtt 


taaacgaaea cctttcctct 


349 


Gly Gly 


Val 


lie 


Ala 


Ala 


Arg 


He 


Ser 


Val 















80 85 

ggccctcact tagcttgtga aeaggcettt ttaaaatcet ttcttggtgt agcagcaaaa 409 
aaaaaaaaaa a 420 



<210> 154 
<211> 640 
<212> DNA 
<213> Homo sapiens 

<220> 
<221> CDS 
<222> 181. .549 



<220> 

<221> polyA_site 
<222> 626.. 640 



<400> 154 

atgctcgcca gaatacecct cgttaaaggc aaggeggett etggetcttc cgcaggctea 60 

gttatgggtt cetgtgtgct tagtcategg gtccgacaea ggctggactg atetggggag 120 

ccgcgaaggg cctgccttca caaagggacg taacgeaagt attgcgggca gtgtttgaat 180 



atg 


gee 


etg 


aac 


aat 


gtg 


tec 


etg 


tec 


tee 


ggt 


gat 


cag 


agg 


age 


agg 


228 


Met 


Ala 


Leu 


Asn 


Asn 


Val 


Ser 


Leu 


Ser 


Ser 


Gly Asp 


Gin 


Arg 


Ser 


Arg 




1 








5 










10 










15 






gtg 


gee 


tac 


cgc 


tet 


tec 


cat 


ggc 


gac 


etc 


aga 


ccg 


egg 


gcg 


tea 


gcg 


276 


Val 


Ala 


Tyr 


Arg 


Ser 


Ser 


His 


Gly 


Asp 


Leu 


Arg 


Pro 


Arg 


Ala 


Ser 


Ala 










20 










25 










30 








ttg 


gcg 


atg 


gtc 


tec 


gga 


gac 


ggc 


ttc 


etc 


gtt 


tec 


agg 


ect 


gaa 


gcg 


324 


Leu 


Ala 


Met 


Val 


Ser 


Gly 


Asp 


Gly 


Phe 


Leu 


Val 


Ser 


Arg 


Pro 


Glu 


Ala 








35 










40 










45 










att 


eat 


eta 


gga 


ect 


egg 


cag 


gcg 


gtg 


cga 


eca 


age 


gtt 


egg 


gee 


gag 


372 


He 


His 


Leu 


Gly 


Pro 


Arg 


Gin 


Ala 


Val 


Arg 


Pro 


Ser 


Val 


Arg 


Ala 


Glu 






50 










55 










60 












age 


cgt 


cga 


gtg 


gat 


ggt 


ggc 


ggc 


egg 


age 


eca 


agg 


gaa 


eca 


gat 


ggc 


420 


Ser 


Arg 


Arg 


Val 


Asp 


Gly 


Gly 


Gly 


Arg 


Ser 


Pro Arg 


Glu 


Pro 


Asp 


Gly 




65 










70 










75 










80 




egg 


ggc 


egg 


age 


cgc 


caa 


gee 


aga 


ttc 


tea 


ect 


tac 


eca 


ate 


ect 


gee 


4 68 


Arg 


Gly 


Arg 


Ser 


Arg 


Gin 


Ala 


Arg 


Phe 


Ser 


Pro 


Tyr 


Pro 


He 


Pro 


Ala 












85 










90 










95 






gtt 


gaa 


eee 


gat 


etc 


eta 


aga 


agt 


gtg 


etg 


caa 


cag 


cgt 


ttg 


att 


gea 


516 


Val 


Glu 


Pro 


Asp 


Leu 


Leu 


Arg 


Ser 


Val 


Leu 


Gin 


Gin 


Arg 


Leu 


He 


Ala 





wo 02/083898 PCT/IBO 1/009 14 

119 

100 105 110 

tta gga ggt gtt ate gca get cga att tea gtt taaacgaaca cctttcctet 569 
Leu Gly Gly Val He Ala Ala Arg He Ser Val 

115 120 
ggccctcact tagcttgtga acaggccttt ttaaaatcct ttcttggtgt agcagcaaaa 629 
aaaaaaaaaa a 640 

<210> 155 

<211> 1582 

<212> DNA 

<213> Homo sapiens 

<220> 
<221> CDS 
<222> 8.. 523 

<220> 

<221> polyA_site 
<222> 1567. .1582 

<400> 155 



agtcaag 


atg 


qcQ 


gga 


gca 


get 


acc cag get tec 


ctg 


gag 


teg 


gee 


cea 


49 






Met 


Ala 


Gly Ala 


Ala 


Thr Gin Ala Ser 


Leu 


Glu 


Ser 


Ala 


Pro 








1 








5 




10 












egg 


ate 


atg 


egg 


ctg 


gtg 


gee 


gaa tge age cgc 


tee 


agg 


gee 


egg 


gca 


97 


Arg 


He 


Met 


Arq 


Leu 


Val 


Ala 


Glu Cys Ser Arg Ser Arg Ala Arg Ala 




15 










20 




25 










30 




ggc 


gag 


ctg 


tgg 


ctg 


ccg 


eat 


ggg aca gtg gee 


act 


ect 


gtg 


ttc 


atg 


145 


Gly Glu 


Leu 


Trp 


Leu 


Pro 


His 


Gly Thr Val Ala 


Thr 


Pro 


Val 


Phe 


Met 












35 






40 








45 






oca 


gtg 


ggc 


acg 


cag 


gee 


ace 


atg aag ggc ate 


acg 


ace 


gaa 


cag 


ctg 


193 


Pro 


Val 


Gly 


Thr 


Gin 


Ala 


Thr 


Met Lys Gly He 


Thr 


Thr 


Glu 


Gin 


Leu 










50 








55 






60 








gac 


get 


ctg 


ggt 


tge 


cgc 


ate 


tge ctg ggc aat 


acc 


tac 


eat 


ctg 


ggt 


241 


Asp 


Ala 


Leu 


Gly 


Cys 


Arg 


He 


Cys Leu Gly Asn 


Thr 


Tyr 


His 


Leu 


Gly 








65 










70 




75 










eta 


agg 


ccg 


gga 


ccc 


gag 


ctg 


ate cag aaa gee 


aac 


ggt 


etc 


cae 


ggc 


289 


Leu 


Arg 


Pro 


Gly 


Pro 


Glu 


Leu 


He Gin Lys Ala 


Asn 


Gly Leu 


His 


Gly 






80 










85 




90 












ttc 


atg 


aat 


tgg 


ect 


cat 


aat 


ctg eta acg gac 


age 


ggc 


ggt 


ttc 


cag 


337 


Phe 


Met 


Asn 


Trp 


Pro His Asn Leu Leu Thr Asp Ser Gly Gly Phe Gin 




95 










100 




105 










110 




atg 


gtg 


teg 


ctg 


gtg 


tct 


ctg 


tee gag gtg acg 


gag 


gag 


ggc 


gtc 


cgc 


385 


Met 


Val 


Ser 


Leu 


Val 


Ser 


Leu 


Ser Glu Val Thr 


Glu 


Glu 


Gly Val 


Arg 












115 






120 








125 






ttc 


ege 


tec 


ccc 


tac 


gac 


ggc 


aat gag ace ctg 


ctg 


age 


ccg 


gag 


aaa 


433 


Phe 


Arg 


Ser 


Pro 


Tyr Asp Gly Asn Glu Thr Leu 


Leu 


Ser 


Pro 


Glu 


Lys 










130 








135 






140 








tec 


gtg 


cag 


ate 


cag 


aat 


gcg 


ctg ggc teg gac 


ate 


ate 


atg 


cag 


ctg 


481 


Ser 


Val 


Gin 


He 


Gin 


Asn 


Ala 


Leu Gly Ser Asp He 


He 


Met 


Gin 


Leu 








145 










150 




155 










gac 


gac 


gtg 


gtt 


age 


agt 


act 


gtg act ggg cea 


cgt 


gtg 


gag 






523 


Asp Asp 


Val 


Val 


Ser 


Ser 


Thr 


Val Thr Gly Pro Arg 


Val 


Glu 










160 










165 




170 













taggccatgt acaggtcaat ccgctggctg gaccggtgca ttgeagecea tcagcggccg 583 

gacaagcaga acctettcgc eattatceag ggtgggctgg acgcagatct ccgggccaec 643 

tgccttgaag agatgaccaa gegagacgtg cctggcttcg ceateggggg cetgageggg 703 

ggtgagagca agtcgcagtt ctggcggatg gtggegetga geacctcteg gctgccgaag 763 

gacaagcecc gatatctgat gggggttggg tatgttgtgg atagggaage cagageceta 823 

cetgtgggaa gtggattcet ggggaececc taccctgett ggggaggtgg catttggggg 883 

aaacggacac aggtctgate tgaggagact aggaagacat ggctgtccet tgggggceat 94 3 

tetgagggaa tatggeccag tctggggcag tgtgagtgtt gggaggggcc ctgggaagcc 1003 

cctgaggttc tctgccccct cccgtcatgg ctgcaacccc agctatgcca ctgatctggt 1063 
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120 

agtctgcgtg gctcttggat gtgacatgtt cgactgcgtc ttccccacac ggacagcgcg 1123 

ctttggctct gccctggtgc ccactgggaa cctgcagttg aggaagaagg tgtttgagaa 1183 

ggacttcggc cccatagacc cggagtgcac ctgccccacg tgccaaaagc acagccgcgc 1243 

cttcctgcac gcactgctgc acagtgacaa cacggccgcg ctgcaccacc tcacggtcca 1303 

caacatcgcc taccagctgc agctcatgag cgccgtccgc accagcatcg tggagaagcg 1363 

cttcccggac ttcgtgcggg acttcatggg cgccatgtac ggggatccca ccctctgtcc 1423 

cacctgggcc actgacgctc tggcctctgt gggaatcaca ctgggctgac ctggcattgg 1483 

gagagggagg gaggaaggaa gggagggagg ggctggaaga tactgaagga ttcctttttg 1543 

aaaggttttt tttattgtaa cttaaaaaaa aaaaaaaaa 1582 

<210> 156 
<211> 104 
<212> DNA 
<213> Homo sapiens 

<220> 
<221> CDS 
<222> 45.. 104 



<400> 156 

agagctgagc tgaagcggga cccggagccc gagcagccgc cgcc atg gca ate aaa 56 

Met Ala lie Lys 
1 

ttt ctg gaa gtc ate aag ccc ttc tgt gtc ate ctg caa eta aea tet 104 
Phe Leu Glu Val lie Lys Pro Phe Cys Val lie Leu Gin Leu Thr Ser 
5 10 15 20 



<210> 157 
<211> 150 
<212> DNA 
<213> Homo sapiens 



<220> 
<221> CDS 
<222> 2.. 70 



<220> 

<221> polyA_site 
<222> 135. .150 



<220> 

<221> polyA_signal 
<222> 115. .120 



<400> 157 

t ttt etc aac aat tec tea ceg cag gag eet etg aag etc eca eca ggc 4 9 
Phe Leu Asn Asn Ser Ser Pro Gin Glu Pro Leu Lys Leu Pro Pro Gly 
15 10 15 

cag etc tec tec acc cac cgc taactctcag ccccagtcac ectcttggag 100 

Gin Leu Ser Ser Thr His Arg 
20 

cttccctget ttgaattaaa gaccactcat gcgcaaaaaa aaaaaaaaaa 150 



<210> 158 
<211> 510 
<212> DNA 
<213> Homo sapiens 



<220> 
<221> CDS 
<222> 150. .284 



<220> 
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121 

<22l> polyA_site 
<222> 495. .510 



<220> 

<221> polyA^signal 
<222> 475. .480 



<400> 158 

aaccaaagca tgggctcttt gtttcctctt tgccaggcct gggtccttcc cagtgacccc 60 
tcagccttca cttcctcctc actcccgggg agctgtgaca ggaagaggag atcccagaac 120 
tggagattga cgtggatgag ctcctggac atg gag agt gac gat gcc egg get 173 

Met Glu Ser Asp Asp Ala Arg Ala 

1 5 

gcc agg gtc aag gag ctg ctg gtt gac tgt tac aaa ccc aca gag gcc 221 
Ala Arg Val Lys Glu Leu Leu Val Asp Cys Tyr Lys Pro Thr Glu Ala 

10 15 20 

ttc att tyt ggc ctg ctg gac aag ate egg ggc atg cag aag ctg age 269 
Phe lie Xaa Gly Leu Leu Asp Lys He Arg Gly Met Gin Lys Leu Ser 
25 30 35 40 

aca ccc cag aag aag tgagggtcce egaeccagga gaacggtggc tcceacagga 324 
Thr Pro Gin Lys Lys 
45 

caatcgctgc cccccaacct cgtagcaaca gcaataccgg gggaccctgc ggceaggect 384 
ggtgccatga geagggctcc tcgtgcccct ggcccagggg tctcttcccc tgeeecetca 44 4 
gttttccact tttggggttt ttttattgtt attaaactga tgggacttet aaaaaaaaaa 504 
aaaaaa 510 



<210> 159 
<211> 390 
<212> DNA 

<213> Homo sapiens 

<220> 
<221> CDS 
<222> 10. .336 



<220> 

<221> polyA_site 
<222> 373. .390 



<220> 

<221> polyA_signal 
<222> 354. .359 



<400> 159 

agtctgaag atg gcg gcc tea gca gcg ega ggt get gcg geg ctg egt aga 51 

Met Ala Ala Ser Ala Ala Arg Gly Ala Ala Ala Leu Arg Arg 
1 5 10 



agt 


ate 


aat 


cag 


ceg 


gtt 


get 


ttt 


gtg 


aga 


aga 


att 


cct 


tgg 


act 


geg 


99 


Ser 


He 


Asn 


Gin 


Pro 


Val 


Ala 


Phe 


Val 


Arg 


Arg 


He 


Pro 


Trp 


Thr 


Ala 




15 










20 










25 










30 




gcg 


teg 


agt 


cag 


ctg 


aaa 


gaa 


cae 


ttt 


gca 


cag 


ttc 


ggc 


cat 


gtc 


aga 


147 


Ala 


Ser 


Ser 


Gin 


Leu 


Lys 


Glu 


His 


Phe 


Ala 


Gin 


Phe 


Gly 


His 


Val 


Arg 












35 










40 










45 






agg 


tge 


att 


tta 


cat 


ttt 


gac 


aag 


gag 


act 


ggc 


ttt 


cae 


aga 


ggt 


ttg 


195 


Arg 


Cys 


He 


Leu 


Pro 


Phe 


Asp 


Lys 


Glu 


Thr 


Gly 


Phe 


His 


Arg 


Gly 


Leu 










50 










55 










60 








ggt 


tgg 


gtt 


cag 


ttt 


tct 


tea 


gaa 


gaa 


gga 


ctt 


egg 


aat 


gca 


eta 


caa 


243 


Gly Trp 


Val 


Gin 


Phe 


Ser 


Ser 


Glu 


Glu 


Gly 


Leu 


Arg 


Asn 


Ala 


Leu 


Gin 








65 










70 










75 










cag 


gaa 


aat 


cat 


att 


ata 


gat 


gga 


gta 


aag 


gtc 


cag 


gtt 


cac 


act 


aga 


291 


Gin 


Glu 


Asn 


His 


He 


He 


Asp 


Gly 


Val 


Lys 


Val 


Gin 


Val 


His 


Thr 


Arg 





80 85 90 
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agg cca aaa ctt ccg caa aca tct gat gat gaa aag aaa gat ttt 336 

Arg Pro Lys Leu Pro Gin Thr Ser Asp Asp Glu Lys Lys Asp Phe 
95 100 105 

tgagactgca gcctattaat aaagttaaca taactgacaa aaaaaaaaaa aaaa 390 

<210> 160 
<211> 398 
<212> DNA 

<213> Homo sapiens 

<220> 
<221> CDS 
<222> 23.. 175 

<220> 

<221> polyA_site 
<222> 383.. 398 

<220> 

<221> polyA_signal 
<222> 365,. 370 

<400> 160 

gagaaggtgc tttagtctga ag atg gcg gcc tea gca gcg cga ggt get gcg 52 

Met Ala Ala Ser Ala Ala Arg Gly Ala Ala 
15 10 
gcg ctg cat aag tat caa tea gcc ggt tgc ttt tgt gag aag aat tec 100 
Ala Leu His Lys Tyr Gin Ser Ala Gly Cys Phe Cys Glu Lys Asn Ser 

15 20 25 

ttg gac tgc ggc gtc gag tea get gaa aga aca ctt tgc aca gtt egg 148 
Leu Asp Cys Gly Val Glu Ser Ala Glu Arg Thr Leu Cys Thr Val Arg 

30 35 40 

cca tgt cag aag gtg cat ttt acc ttt tgacaaggag actggcttte 195 
Pro Cys Gin Lys Val His Phe Thr Phe 

45 50 
acagaggttt gggttgggtt cagttttctt cagaagaagg acttcggaat gcaetacaac 255 
aggaaaatca tattatagat ggagtaaagg tccaggttea cactagaagg ecaaaactte 315 
cgcaaacatc tgatgatgaa aagaaagatt tttgagactg cagcctatta ataaagttaa 375 
eataaecaaa aaaaaaaaaa aaa 398 



<210> 


161 


<211> 


425 


<212> 


DNA 


<213> 


Homo sapiens 


<220> 




<221> 


CDS 


<222> 


127. .423 


<220> 




<221> 


polyA site 


<222> 


394. .425 


<400> 


161 


c cca 


gtc aga ggg ctg 


Pro 


Val Arg Gly Leu 


1 


5 



c atg tec gag ttt gtc tgt aac ctg tea gca 4 9 
n Met Ser Glu Phe Val Cys Asn Leu Ser Ala 
10 15 
agg ect tat gtg tac gac etc att gee gtg tec aat cat tat gga gee 97 
Arg Pro Tyr Val Tyr Asp Leu lie Ala Val Ser Asn His Tyr Gly Ala 

20 25 30 

atg ggg gtt ggc cac tac taaagcagct tatgtgctat tttaccaacg 14 5 

Met Gly Val Gly His Tyr 
35 
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tcgagatgat gaattttata agacaccttc acttagcagt tctggttcct ctgatggagg 205 

gacacgacca agcagctctc agcagggctt tggggatgat gaggcttgca gcatggacac 265 

caactaatgc tgactccacg atcctgccac cctgtagcgc cagtgtaatc ccccaggaga 325 

acatctttga cactctgcag actgctagtg ttctgtctaa aaaccagaca aggaaatacc 385 

cttcttttat gagcagaagg aawcaaaaaa aaaaaaaaaa 425 

<210> 162 
<211> 438 
<212> DNA 
<213> Homo sapiens 

<220> 
<221> CDS 
<222> 3.. 41 

<220> 

<221> polyA_site 
<222> 419. .438 

<400> 162 

ag atg tta ggc ttt gcc aaa aac tgg ate ttt agt ate act tgageactea 51 

Met Leu Gly Phe Ala Lys Asn Trp He Phe Ser He Thr 

15 10 

ttggacacaa gaggtcaact gttttgtcat ttgtgtagta agaacaaaat aaagtgtgtg 111 

gcattaaaga aataacactg tctggccggg ateggtggct cacgectgta atcccagcac 171 

tttgggagge cgaggcgggc agatcaegag gtcaggagtt cgagaecage ctgaccaaca 231 

tgatgaaacc ctgtctctac taaaaataca aaaattagcc aggcatggtg gtgcacaect 291 

gtagtcccag ctgctcagga ggctgaggca ggagaatcgc tcgaacccgg gaggtggagg 351 

ttgcagtgag ctgagatggc gccactgcat tccaggcctg gggggtgaca gagcaagact 411 

cagtctcaga aaaaaaaaaa aaaaaag 438 

<210> 163 
<211> 436 
<212> DNA 
<213> Homo sapiens 

<220> 
<221> CDS 
<222> 3. .41 

<220> 

<221> polyA_site 
<222> 419.. 436 

<400> 163 

ag atg tta ggc ttt gcc aaa aac tgg ate ttt agt ate act tgageactea 51 

Met Leu Gly Phe Ala Lys Asn Trp He Phe Ser He Thr 

15 10 

ttggacacaa gaggtcaact gttttgtcat ttgtgtagta agaacaaaat aaagtgtgtg 111 

gcattaaaga aataacactg tctggccggg ateggtggct cacgectgta atcccagcac 171 

tttgggagge cgaggcgggc agatcaegag gtcaggagtt cgagaecage ctgaccaaca 231 

tgatgaaacc etgtetctac taaaaataca aaaattagcc aggcatggtg gtgcacaect 291 

gtagtcccag ctgctcagga ggctgaggca ggagaatcgc tcgaacccgg gaggtggagg 351 

ttgcagtgag ctgagatggc gccactgcat tccaggcctg gggggtgaca gagcaagact 411 

cagtctcaga aaaaaaaaaa aaaaa 4 36 

<210> 166 
<211> 436 
<212> DNA 

<213> Homo sapiens 



<220> 
<221> CDS 



wo 02/083898 
<222> 3. .41 
<220> 

<221> polyA_site 
<222> 421.. 436 

<400> 166 

ag atg tta ggc ttt gcc aaa aac tgg ate ttt agt ate act tgagcactca 

Met Leu Gly Phe Ala Lys Asn Trp lie Phe Ser lie Thr 

15 10 
ttggacacaa gaggtcaact gttttgtcat ttgtgtagta agaacaaaat aaagtgtgtg 
gcattaaaga aataacactg tctggccggg atcggtggct cacgcctgta atcccagcac 
tttgggaggc cgaggcgggc agatcacgag gtcaggagtt cgagaccagc ctgaccaaca 
tgatgaaacc ctgtctctac taaaaataca aaaattagcc aggcatggtg gtgcacacct 
gtagtcccag ctgctcagga ggctgaggca ggagaatcgc tcgaacccgg gaggtggagg 
ttgcagtgag ctgagatggc gccactgcat tccaggcctg gggggtgaca gagcaagact 
cagtctcaga aaaaaaaaaa aaaaa 

<210> 167 
<211> 436 
<212> DNA 
<213> Homo sapiens 

<220> 
<221> CDS 
<222> 3.. 41 

<220> 

<221> polyA_site 
<222> 419. .436 

<400> 167 

ag atg tta ggc ttt gcc aaa aac tgg ate ttt agt ate act tgagcactca 

Met Leu Gly Phe Ala Lys Asn Trp lie Phe Ser lie Thr 

15 10 

ttggacacaa gaggtcaact gttttgtcat ttgtgtagta agaacaaaat taagtgtgtg 

gcattaaaga aataacactg tctggccggg atcggtggct cacgcctgta atcccagcac 

tttgggaggc cgaggcgggc agatcacgag gtcaggagtt cgagaccagc ctgaccaaca 

tgatgaaacc ctgtctctac taaaaataca aaaattagcc aggcatggtg gtgcacacct 

gtagtcccag ctgctcagga ggctgaggca ggagaatcgc tcgaacccgg gaggtggagg 

ttgcagtgag ctgagatggc gccactgcat tccaggcctg gggggtgaca gagcaagact 

cagtctcaga aaaaaaaaaa aaaaa 

<210> 168 
<211> 394 
<212> DNA 

<213> Homo sapiens 

<220> 
<221> CDS 
<222> 10. .336 

<220> 

<221> polyA_site 
<222> 373.-394 

<220> 

<221> polyA_signal 
<222> 354. .359 

<400> 168 

agtctgaag atg gcg gcc tea gca gcg cga ggt get gcg gcg ctg cgt aga 51 
Met Ala Ala Ser Ala Ala Arg Gly Ala Ala Ala Leu Arg Arg 
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124 



51 



111 
171 
231 
291 
351 
411 
436 



51 



111 
171 

231 
291 
351 
411 
436 
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15 10 



agt 


ate 


aat 


cag 


ccg 


gtt 


get 


ttt 


gtg 


aga 


aga 


atw 


cct 


tgg 


act 


gcg 


99 


Ser 


He 


Asn 


Gin 


Pro 


Val 


Ala 


Phe 


Val 


Arg 


Arg 


He 


Pro 


Trp 


Thr 


Ala 




15 










20 










25 










30 




gcg 


teg 


agt 


cag 


ctg 


aaa 


gaa 


cac 


ttt 


gca 


cag 


ttc 


gge 


cat 


gtc 


aga 


147 


Ala 


Ser 


Ser 


Gin 


Leu 


Lys 


Glu 


His 


Phe 


Ala 


Gin 


Phe 


Gly 


His 


Val 


Arg 












35 










40 










45 






agg 


tgc 


att 


tta 


cct 


ttt 


gac 


aag 


gag 


act 


gge 


ttt 


cac 


aga 


ggt 


ttg 


195 


Arg 


Cys 


He 


Leu 


Pro 


Phe 


Asp 


Lys 


Glu 


Thr 


Gly 


Phe 


His 


Arg 


Gly 


Leu 










50 










55 










60 








ggt 


tgg 


gtt 


cag 


ttt 


tct 


tea 


gaa 


gaa 


gga 


ctt 


egg 


aat 


gca 


eta 


caa 


243 


Gly Trp 


Val 


Gin 


Phe 


Ser 


Ser 


Glu 


Glu 


Gly 


Leu 


Arg 


Asn 


Ala 


Leu 


Gin 








65 










70 










75 










cag 


gaa 


aat 


cat 


att 


ata 


gat 


gga 


gta 


aag 


gtc 


cag 


gtt 


cac 


act 


aga 


291 


Gin 


Glu 


Asn 


His 


He 


He 


Asp 


Gly 


Val 


Lys 


Val 


Gin 


Val 


His 


Thr 


Arg 






80 










85 










90 












agg 


cca 


aaa 


ctt 


ccg 


caa 


aca 


tct 


gat 


gat 


gaa 


aag 


aaa 


gat 


ttt 




336 


Arg 


Pro 


Lys 


Leu 


Pro 


Gin 


Thr 


Ser 


Asp 


Asp 


Glu 


Lys 


Lys 


Asp 


Phe 







95 100 105 

tgagactgca gcctattaat aaagttaaca taactgaaaa aaaaaaaaaa aaggtcaa 394 



<210> 169 
<211> 772 
<212> DNA 
<213> Homo sapiens 

<220> 
<221> CDS 
<222> 595-. 771 



<400> 169 

atataggcgt ggctgaatac gggcctcctg cgtacgtgcg tggtgtacgt acgtgcgtga 60 
ttacgcgcac acgtacgttc etcatgaaag ggacgacggg agctgcatga aagccgaagt 120 
tatggaccgc tagcatctgt cactggccac cggtttccgg gagtaagcgg cagctacctt 180 
acagccctga cacgagccgg gtgctctctc ttctcaccgc ggcccacgtc tcctcgctgg 240 
ctccggtggc ctcgctgggt cgcgaggagg eggaggactg tactctgagg ccaaaagcca 300 
gagtcggccc tgaaegceca egactctcag ggtccagagg ccgtgagacc ggccgcggct 360 
gaaaggtaaa gaaaccaagt ggaagagtgt ttcctcctct ggecgtaaag caggtactet 420 
ctgcagcacc agctgtcccc gccctactcc ggaccgcccc aaagactcca tgggatggac 4 80 
ctgagtcagc cgaatcctag ccccttcect tgggcctgct gtggtgctcg acatcagtga 540 
cagacggaag cagcagacca tcaaggctac gggaggcccg gggcgcttgc gaag atg 597 

Met 
1 

get gge ttt gtc tta 645 
Ala Gly Phe Val Leu 
15 

cct ctg ytg age age 693 
Pro Leu Leu Ser Ser 
30 

cac agg gac tgg gaa 741 



aag ttt gge tgc 


etc 


tec 


ttc 


egg 


cag 


cct 


tat 


Lys Phe Gly Cys 


Leu 


Ser 


Phe 


Arg 


Gin 


Pro 


Tyr 


5 










10 






aat gga ate aag 


act 


gtg 


gag 


acg 


cgc 


tgg 


cgt 


Asn Gly He Lys 


Thr 


Val 


Glu 


Thr 


Arg 


Trp Arg 


20 








25 








cag egg aac tgt 


ace 


ate 


gee 


gtc 


cac 


att 


get 


Gin Arg Asn Cys 


Thr 


He 


Ala 


Val 


His 


He 


Ala 


35 






40 










gge gat gee tgt 


egg 


gag 


ctg 


ctg 


gtg 


gag 


a 


Gly Asp Ala Cys 


Arg Glu 


Leu 


Leu 


Val 


Glu 





45 



50 55 

<210> 170 
<211> 128 
<212> PRT 

<213> Homo sapiens 



772 



<220> 

<221> SIGNAL 



wo 02/083898 



PCT/TBOl/00914 



126 

<222> -34.. -1 

<220> 

<221> UNSURE 
<222> 7 

<223> Xaa = Ile,Thr 



<400> 170 



Met 


Ser 


Leu Arg 


Leu 


Asp Thr Thr Pro Ser Cys Asn Ser Ala 


Arg 


Pro 










-30 


-25 


-20 




Leu 


His 


Ala 


Leu 


Gin 


Val Leu Leu Leu Leu Ser Leu Leu Leu 


Thr 


Ala 








-15 




-10 -5 






Leu 


Ala 


Ser 


Ser 


Thr 


Lys Gly Gin Xaa Lys Arg Asn Leu Ala 


Lys 


Gly 






1 






5 10 






Lys Asp 


Glu 


Ser 


Leu 


Asp Ser Asp Leu Tyr Ala Glu Leu Arg 


Cys 


Met 


15 










20 25 




30 


Cys 


He 


Lys 


Thr 


Thr 


Ser Gly He His Pro Lys Asn He Gin 


Ser 


Leu 










35 


40 


45 




Glu 


Val 


lie Gly Lys 


Gly Thr His Cys Asn Gin Val Glu Val 


He 


Ala 








50 




55 60 






Thr 


Leu 


Lys 


Asp Gly 


Arg Lys He Cys Leu Asp Pro Asp Ala 


Pro 


Arg 






65 






70 75 






He 


Lys 


Lys 


He 


Val 


Gin Lys Lys Leu Ala Gly Asp Glu Ser 


Ala 


Asp 




80 








85 90 







<210> 171 
<211> 160 
<212> PRT 

<213> Homo sapiens 
<220> 

<221> SIGNAL 
<222> -18. .-1 

<400> 171 



Met Lys 


Ala 


Ala 


Val 


Leu 


Thr 


Leu Ala Val Leu Phe Leu Thr 


Gly Ser 






-15 








-10 -5 






Gin Ala 


Arg 


His 


Phe 


Trp Gin Gin Asp Glu Pro Pro Gin Ser 


Pro 


Trp 




1 








5 


10 






Asp Arg 


Val 


Lys 


Asp 


Leu 


Ala 


Thr Val Tyr Val Asp Val Leu 


Lys 


Asp 


15 








20 




25 




30 


Ser Gly 


Arg 


Asp 


Tyr 


Val 


Ser 


Gin Phe Glu Gly Ser Ala Leu 


Gly 


Lys 








35 






40 


45 




Gin Leu 


Asn 


Leu 


Lys 


Leu 


Leu 


Asp Asn Trp Asp Ser Val Thr 


Ser 


Ala 






50 








55 60 






Phe Ser 


Asn 


Leu 


Arg 


Glu 


Gin 


Leu Gly Pro Val Thr Gin Glu 


Phe 


Trp 




65 










70 75 






Asp Asn 


Leu 


Glu 


Lys 


Glu 


Thr 


Glu Gly Leu Arg Gin Glu Met 


Ser 


Lys 


80 










85 


90 






Asp Leu 


Glu 


Glu 


Val 


Lys 


Ala 


Lys Val Gin Pro Tyr Leu Asp 


Asp 


Phe 


95 








100 




105 




110 


Gin Lys 


Lys 


Trp 


Gin 


Glu 


Glu 


Met Glu Leu Tyr Arg Gin Lys 


Ala 


Ala 








115 






120 


125 




Phe Leu 


Thr 


He 


Leu 


Thr Ser Leu Gly Pro Asn Gly Asn Lys 


Ala 


Phe 






130 








135 140 







<210> 172 

<211> 80 

<212> PRT 

<213> Homo sapiens 



<220> 

<221> SIGNAL 



wo 02/083898 
<222> -26. .-1 

<220> 

<221> UNSURE 
<222> 52 

<223> Xaa = Leu, Pro 
<400> 172 

Met Gly Arg Ala Met Val Ala Arg Leu Gly Leu Gly Leu Leu Leu Leu 

-25 -20 -15 

Ala Leu Leu Leu Pro Thr Gin He Tyr Ser Ser Glu Thr Thr Thr Gly 
-10 -5 15 

Thr Ser Ser Asn Ser Ser Gin Ser Thr Ser Asn Ser Gly Leu Ala Pro 

10 15 20 

Asn Pro Thr Asn Ala Thr Thr Lys Val Ala Gly Gly Ala Leu Gin Ser 

25 30 35 

Thr Ala Ser Leu Phe Val Val Ser Leu Ser Leu Leu His Xaa Tyr Ser 

40 45 50 

<210> 173 
<211> 101 
<212> PRT 

<213> Homo sapiens 
<220> 

<221> SIGNAL 

<222> -27. .-1 

<223> Von Heijne matrix 

score 11.8486806626293 

seq LFLGLLLLPLVVA/FA 

<400> 173 



Met 


Ser 


Ser 


Ala 


Ala 


Gly 


Phe 


Cys 


Ala 


Ser Arg 


Pro Gly Leu 


Leu 


Phe 






-25 










-20 








-15 






Leu Gly 


Leu 


Leu 


Leu 


Leu 


Pro 


Leu 


Val 


Val Ala 


Phe 


Ala Ser 


Ala 


Glu 




-10 










-5 








1 






5 


Ala 


Glu 


Glu 


Asp 


Gly 


Asp 


Leu 


Gin 


Cys 


Leu Cys 


Val 


Lys Thr 


Thr 


Ser 










10 










15 






20 




Gin 


Val 


Arg 


Pro 


Arg 


His 


lie 


Thr 


Ser 


Leu Glu 


Val 


He Lys 


Ala 


Gly 




His 




25 










30 






35 






Pro 


Cys 


Pro 


Thr 


Ala 


Gin 


Leu 


He 


Ala Thr 


Leu 


Lys Asn Gly Arg 






40 










45 








50 






Lys 


He 


Cys 


Leu 


Asp 


Leu 


Gin 


Ala 


Pro Leu Tyr Lys 


Lys He 


He 


Lys 




55 










60 








65 








Lys 


Leu 


Leu 


Glu 


Ser 





















70 

<210> 174 
<211> 101 
<212> PRT 

<213> Homo sapiens 
<220> 

<221> SIGNAL 

<222> -27. .-1 

<223> Von Heijne matrix 

score 11.8486806626293 

seq LFLGLLLLPLVVA/FA 

<400> 174 

Met Ser Ser Ala Ala Gly Phe Cys Ala Ser Arg Pro Gly Leu Leu Phe 

-25 -20 -15 

Leu Gly Leu Leu Leu Leu Pro Leu Val Val Ala Phe Ala Ser Ala Glu 



PCT/lBOl/00914 

127 



wo 02/083898 





-10 










-5 




Ala 


Glu 


Glu 


Asp 


Gly 


Asp 


Leu 


Gin Cys 










10 








Gin 


Val 


Arg 


Pro 


Arg 


His 


He 


Thr Ser 








25 








30 


Pro 


His 


Cys 


Pro 


Thr 


Ala 


Gin 


Leu He 






40 










45 


Lys 


He 


Cys 


Leu 


Asp 


Leu 


Gin 


Ala Pro 




55 










60 




Lys 


Leu 


Leu 


Glu 


Ser 









70 



PCT/IBOl/00914 

128 

1 5 
Leu Cys Val Lys Thr Thr Ser 
15 20 
Leu Glu Val He Lys Ala Gly 
35 

Ala Thr Leu Lys Asn Gly Arg 
50 

Leu Tyr Lys Lys He He Lys 
65 



<210> 175 
<211> 86 
<212> PRT 

<213> Homo sapiens 
<220> 

<221> SIGNAL 
<222> -18. .-1 



<400> 175 



Met 


He Pro 


Ala 


Val 


Val 


Leu 


Leu 


Leu 


Leu 


Leu 


Leu 


Val Glu 


Gin 


Ala 






-15 










-10 








-5 






Ala 


Ala Leu 


Gly 


Glu 


Pro 


Gin 


Leu Cys 


Tyr 


He 


Leu Asp Ala 


He 


Leu 




1 








5 










10 








Phe 


Leu Tyr 


Gly 


He 


Val 


Leu 


Thr 


Leu 


Leu Tyr 


Cys 


Arg Leu 


Lys 


He 


15 








20 










25 








30 


Gin 


Val Arg 


Lys 


Ala 


Ala 


He 


Thr 


Ser 


Tyr 


Glu 


Lys 


Ser Asp 


Gly Val 








35 










40 








45 




Tyr 


Thr Gly 


Leu 


Ser 


Thr 


Arg 


Asn 


Gin 


Glu 


Thr 


Tyr 


Glu Thr 


Leu 


Lys 






50 










55 








60 






His 


Glu Lys 


Pro 


Pro 


Gin 





















65 



<210> 176 
<211> 92 
<212> PRT 

<213> Homo sapiens 
<220> 

<221> SIGNAL 
<222> -24. .-1 



<220> 

<221> UNSURE 
<222> 52 

<223> Xaa = Arg, Thr 
<400> 176 



Met 


Val 


Met 


Gly 


Leu 


Gly 


Val 


Leu Leu Leu Val Phe Val 


Leu 


Gly Leu 










-20 






-15 




-10 


Gly 


Leu 


Thr 


Pro 
-5 


Pro 


Thr 


Leu 


Ala Gin Asp Asn Ser Arg 
1 5 


Tyr 


Thr His 


Phe 


Leu 


Thr 


Gin 


His 


Tyr 


Asp 


Ala Lys Pro Gin Gly Arg 


Asp 


Asp Arg 




10 










15 


20 






Tyr 


Cys 


Glu 


Ser 


He 


Met 


Arg 


Arg Arg Gly Leu Thr Ser 


Pro 


Cys Lys 


25 










30 




35 




40 


Asp 


He 


Asn 


Thr 


Phe 


He 


His 


Gly Asn Lys Arg Xaa Ser Arg 


Pro Ser 










45 






50 




55 


Val 


Lys 


Thr 


Arg 


Met 


Glu 


Thr 


Leu Thr Glu Lys Thr 







60 65 



wo 02/083898 PCT/IB01/009I4 

129 

<210> 177 
<211> 111 
<212> PRT 

<213> Homo sapiens 
<220> 

<221> SIGNAL 
<222> -16. .-1 

<400> 177 



Met Leu Leu lie 


Leu Leu 


Ser 


Val Ala 


Leu Leu 


Ala 


Phe Ser Ser Ala 


-15 




-10 






-5 




Gin Asp Leu Asp 


Glu Asp 


Val 


Ser Gin Glu Asp Val 


rro i>eu vaj. xxe 


1 


5 






10 




15 


Ser Asp Gly Gly 


Asp Ser 


Glu 


Gin Phe 


lie Asp Glu 


Glu Arg Gin Gly 


20 






25 






30 


Pro Pro Leu Gly 


Gly Gin 


Gin 


Ser Gin 


Pro Ser 


Ala 


Gly Asp Gly Asn 


35 






40 






45 


Gin Asn Asp Gly 


Pro Gin 


Gin Gly Pro Pro Gin Gin 


Gly Gly Gin Gin 


50 




55 






60 




Gin Gin Gly Pro 


Pro Pro 


Pro 


Gin Gly Lys Pro 


Gin 


Gly Pro Pro Pro 


65 


70 






75 




80 


Gin Gly Gly Arg 


Pro Gin 


Gly 


Pro Pro Gin Gly Gin 


Ser Pro Gin 




85 






90 




95 


<210> 178 














<211> 106 














<212> PRT 














<213> Homo sapiens 












<220> 














<221> SIGNAL 














<222> -16.. -1 














<400> 178 














Met Leu Leu lie 


Leu Leu 


Ser 


Val Ala 


Leu Leu 


Ala 


Phe Ser Ser Ala 


-15 




-10 






-5 




Gin Asp Leu Asp 


Glu Asp 


Val 


Ser Gin Glu Asp Val 


fro jjcU Vax ±xe 


1 


5 






10 




15 


Ser Asp Gly Gly 


Asp Ser 


Glu 


Gin Phe 


lie Asp Glu 


Glu Arg Gin Gly 


20 






25 






30 


Pro Pro Leu Gly 


Gly Gin 


Gin 


Ser Gin 


Pro Ser 


Ala 


uxy Msp o±y Msn 


35 






40 






45 


Gin Asp Asp Gly 


Pro Gin 


Gin Gly Pro 


Pro Gin 


Gin 


Gly Gly Gin Gin 


50 




55 






60 




Gin Gin Gly Pro 


Pro Pro 


Pro 


Gin Gly 


Lys Pro 


Gin 


Gly Pro Pro Gin 


65 


70 






75 




80 


Gin Gly Gly Gin 


Ser Cys 


Cys 


Cys Asp 


Lys 








85 






90 






<210> 179 














<211> 141 














<212> PRT 














<213> Homo sapiens 












<220> 














<221> SIGNAL 














<222> -24. .-1 














<400> 179 














Met Pro Ser Ser 


Val Ser 


Trp Gly lie 


Leu Leu 


Leu 


Ala Gly Leu Cys 




-20 






-15 




-10 


Cys Leu Val Pro 


Val Ser 


Leu 


Ala Glu 


Asp Pro 


Gin 


Gly Asp Ala Ala 



wo 02/083898 



PCT/IBOl/00914 



-5 



Gin Lys 


Thr 


Asp 


Thr 


Ser 


His 


His 


10 










15 




Lys lie 


Thr 


Pro 


Asn 


Leu 


Ala 


Glu 


25 








30 






Leu Ala 


His 


Gin 


Ser 


Asn 


Ser 


Thr 








45 








lie Ala 


1 nr 


Ala 


Pne 


Ala 


Met 


Leu 






60 










His Asp 


Glu 


He 


Leu 


Glu 


Gly Leu 




75 










80 


Glu Ala 


Gin 


He 


His 


Glu 


Gly 


Phe 


90 










95 




Gin Pro 


Asp 


Ser 


Gin 


Leu 


Gin 


Leu 


105 








110 






<210> 180 












<211> 124 












<212> PRT 












<213> Homo sapiens 








<220> 














<221> SIGNAL 










<222> -30..- 


-1 










<400> 180 












Met Pro 


Ala 


Cys 


Arg 


Leu 


Gly 


Pro 


-30 








-25 






Leu Leu 


Leu 


Phe Gly 


Phe 


Thr 


Leu 








-10 








Thr Gly 


Val 


Cys 


Pro 


Glu 


Leu 


Gin 




5 










10 


Cys Val 


Ser 


Asp 


Ser 


Glu 


Cys Ala 


20 










25 




Gly Cys 


Ala 


Thr 


Phe 


Cys 


Ser 


Leu 


35 








40 






Pro Gin 


Val 


Asn 


He 


Asn 


Phe 


Pro 








55 








Cys Gin 


Val 


Asp 


Ser 


Gin 


Cys 


Pro 






70 










Gly Cys 


Gly Lys 


Val 


Ser 


Cys 


Val 



85 90 



<210> 181 
<211> 124 
<212> PRT 

<213> Homo sapiens 
<220> 

<221> SIGNAL 
<222> -30. ,-1 



<400> 181 



Met 


Pro 


Ala 


Cys 


Arg 


Leu 


Gly Pro 


-30 










-25 






Leu 


Leu 


Leu 


Phe 


Gly 
-10 


Phe 


Thr 


Leu 


Thr 


Gly Val 


Cys 


Pro 


Glu 


Leu 


Gin 






5 










10 


Cys 


Val 
20 


Ser 


Asp 


Ser 


Glu 


Cys 
25 


Ala 


Gly 


Cys 


Ala 


Thr 


Phe 


Cys 


Ser 


Leu 



35 40 



1 








5 








Asp 


Gin 


Asp 


His 


Pro 


Thr 


Phe 


Asn 








20 










Phe 


Ala 


Phe 


Ser 


Leu 


Tyr 


Arg 


Gin 






35 










40 


Asn 


He 


Phe 


Phe 


Ser 


Pro 


Val 


Ser 




50 










55 




Ser 


Leu 


Gly 


Thr 


Lys 


Ala 


Asp 


Thr 


65 










70 






Asn 


Phe 


Asn 


Leu 


Thr 


Glu 


He 


Pro 










85 








Gin 


Glu 


Leu 


Leu 


Arg 


Thr 


Leu 


Asn 








100 










Thr 


Thr 


Gly 


Lys 


Asn 












115 












Leu 


Ala 


Ala 


Ala 


Leu 


Leu 


Leu 


Ser 






-20 










-15 


Val 


Ser 


Gly 


Thr 


Gly 


Ala 


Glu 


Lys 




-5 










1 




Ala 


Asp 


Gin 


Asn 


Cys 


Thr 


Gin 


Glu 










15 








Asp 


Asn 


Leu 


Lys 


Cys 


Cys 


Ser 


Ala 








30 










Pro 


Asn 


Asp 


Lys 


Glu 


Gly 


Ser 


Cys 






45 










50 


Gin 


Leu 


Gly 


Leu 


Cys 


Arg 


Asp 


Gin 




60 










65 




Gly 


Gin 


Met 


Lys 


Cys 


Cys 


Arg 


Asn 


75 










80 






Thr 


Pro 


Asn 


Phe 










Leu 


Ala 


Ala 


Ala 


Leu 


Leu 


Leu 


Ser 






-20 










-15 


Val 


Ser 


Gly 


Thr 


Gly 


Ala 


Glu 


Lys 




-5 










1 




Ala 


Asp 


Gin 


Asn 


Cys 


Thr 


Gin 


Glu 










15 








Asp 


Asn 


He 


Lys 


Cys 


Cys 


Ser 


Ala 








30 










Pro 


Asn 


Asp 


Lys 


Glu 


Gly 


Ser 


Cys 



45 50 



wo 02/083898 

Pro Gin Val Asn He 
55 

Cys Gin Val Asp Ser 
70 

Gly Cys Gly Lys Val 
85 



Asn Phe Pro Gin 

Gin Cys Pro Gly 
75 

Ser Cys Val Thr 
90 



131 

Leu Gly Leu Cys Arg 
60 

Gin Met Lys Cys Cys 
80 

Pro Asn Phe 



PCT/IBOl/00914 

Asp Gin 
65 

Arg Asn 



<210> 182 
<211> 69 
<212> PRT 

<213> Homo sapiens 



<220> 

<221> SIGNAL 

<222> -17. .-1 

<223> Von Heijne matrix 

score 9.5148977432302 
seq FVFLLFVISLAAA/AH 



<400> 182 

Met Phe Pro Gly Phe Val Phe Leu Leu Phe Val lie Ser Leu Ala Ala 

-15 -10 -5 

Ala Ala His Leu Trp Val Leu Ala Ala Phe Met Gly Arg He Thr Val 

15 10 15 

Lys Val Cys Ser Phe Thr Pro Glu Ala Ser Lys Thr Val Ser Pro Pro 

20 25 30 

Glu Gly Ala Asn 'Asn Ser Arg Arg Thr Ala Phe Lys Ser Cys Asn Thr 

35 40 45 

His His Lys Gly Leu 
50 

<210> 183 
<211> 77 
<212> PRT 

<213> Homo sapiens 
<220> 

<221> SIGNAL 
<222> -33.. -1 



<220> 

<221> UNSURE 
<222> 6 

<223> Xaa = Lys, Met 



<400> 183 
Met Gly Pro Val 
-30 

He Ala Thr He 
-15 

Ala Phe Trp Trp 
1 

Gin Ser Leu Ala 

Arg Asp Ala Val 
35 



Lys Gin Leu Lys 

Met Val Leu Leu 
-10 

His Asn Xaa Gly 

5 

Leu Thr Trp Tyr 
20 

Lys Lys Cys Phe 



Arg Met Phe Glu 
-25 

Ser Phe Ala Leu 

Leu Ala Leu He 
10 

Ser Leu Ser Phe 
25 

Ala Val Cys Leu 
40 



Pro Thr Arg Leu 
-20 

Thr Leu Cys Ser 
-5 

Phe Cys He Leu 

15 

He Pro Phe Ala 
30 

Ala 



<210> 184 
<211> 146 
<212> PRT 

<213> Homo sapiens 



<220> 



wo 02/083898 



<221> SIGNAL 
<222> -26. .-1 



<400> 184 



Met 


Thr 


Met 


Arg 


Ser 


Leu 


Leu Arg Thr 




-25 










-20 


Trp 


Ala 


Phe 


Cys 


Ala 


Pro 


Gly Ala Arg 


-10 










-5 




Phe 


Ser 


Gin 


Pro 


Gly 


Ser 


Met Gly Leu 








10 






15 


Gin 


Glu 


His 


He 


Met 


Glu 


His Leu Glu 






25 








30 


Ala 


Glu 


Met 


Ser 


Pro 


Gin 


Glu Leu Gin 




40 










45 


Asp 


Tyr 


Asp 


Gly 


Asn 


Asn 


Leu Leu Asp 


55 










60 




He 


Thr 


His 


Val 


His 


Lys 


Glu Glu Gly 










75 






Ser 


Glu 


Asp 


Glu 


Leu 


He 


Asn He He 








90 






95 


Asp 


Lys 


Asn 


Asn 


Asp 


Gly 


Tyr He Asp 






105 








110 



Leu Gin 



120 

<210> 185 
<2H> 62 
<212> PRT 

<213> Homo sapiens 

<220> 

<221> SIGNAL 

<222> -46. ,-1 

<223> Von Heijne matrix 

score 8.64329745298384 

seq AVLLLILFAIVFG/LL 

<400> 185 

Met Asp Gin Leu Val Phe Lys Glu Thr 

-45 -40 
Gin Asn Pro Trp Asp Gin Gly Gly Leu 
-30 -25 
Thr Ala Val Leu Leu Leu He Leu Phe 
-10 

Thr Ser Thr Glu Asn Thr Gin Cys Glu 
5 10 

<210> 186 
<2H> 62 
<212> PRT 

<213> Homo sapiens 
<220> 

<221> SIGNAL 

<222> -46. .-1 

<223> Von Heijne matrix 

score 8.64329745298384 

seq AVLLLILFAIVFG/LL 

<400> 186 

Met Asp Gin Leu Val Phe Lys Glu Thr 

-45 -40 
Gin Asn Pro Trp Asp Gin Gly Gly Leu 
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Pro 


Phe 


Leu Cys 


Gly Leu Leu 






-15 










Ala 


Glu 


Glu 


Pro 


Ala 


Ala 


Ser 




1 








5 




Asp 


Lys 


Asn 


Thr 


Val 


His 


Asp 










20 






Gly 


Val 


He 


Asn 


Lys 


Pro 


Glu 








35 








Leu 


His 


Tyr 


Phe 


Lys 


Met 


His 






50 










Gly 


Leu 


Glu 


Leu 


Ser 


Thr 


Ala 




65 










70 


Ser 


Glu 


Gin 


Ala 


Pro 


Leu 


Met 


80 










85 




Asp 


Gly Val 


Leu 


Arg Asp Asp 










100 






Tyr 


Ala 


Glu 


Phe 


Ala 


Lys 


Ser 








115 








He 


Trp 


Asn 


Asp 


Ala 


Phe 


Trp 






-35 










Ala 


Val 


He 


He 


Leu 


Phe 


He 




-20 










-15 


Ala 


He 


Val 


Phe 


Gly Leu 


Leu 


-5 










1 




Ala 


Gly Glu Glu 


Glu 












15 








He 


Trp Asn 


Asp 


Ala 


Phe 


Trp 






-35 










Ala 


Val 


He 


He 


Leu 


Phe 


He 
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-30 ^25 -20 -15 

Thr Ala Val Leu Leu Leu lie Leu Phe Ala lie Val Phe Gly Leu Leu 

-10 -5 1 

Thr Ser Thr Glu Asn Thr Gin Cys Glu Ala Gly Glu Glu Glu 
5 10 15 



<210> 187 
<211> 162 
<212> PRT 

<213> Homo sapiens 
<220> 

<221> SIGNAL 
<222> -47.. -1 



<400> 187 
























Met Arg 


He 


Ala 


Asn Arg 


Thr 


Arg 


Phe Ser Ser 


Pro 


Phe 


Leu 


Ala 


Arg 




-45 










-40 






-35 








Gly Ala 


Gly 


Trp 


Thr 


His 


Gly Arg Gly Met Met 


Val 


Val 


Gly Thr Gly 


-30 










-25 






-20 










Thr Ser 


Leu 


Ala 


Leu 


Ser 


Ser 


Leu Arg Ser Leu 


Leu 


Leu 


Phe 


Ala 


Gly 


-15 








-10 






-5 










1 


Met Gin 


Met 


Tyr 


Ser 


Arg 


Gin 


Leu 


Ala Ser Thr 


Glu 


Trp 


Leu 


Thr 


He 






5 










10 






15 






Gin Gly 


Gly 


Leu 


Leu 


Gly 


Ser 


Gly 


Leu Phe Val 


Phe 


Ser 


Leu 


Thr 


Ala 




20 










25 






30 








Phe Asn 


Asn 


Leu 


Glu 


Asn 


Leu 


Val 


Phe Gly Lys 


Gly 


Phe 


Gin 


Ala 


Lys 


35 










40 






45 










He Phe 


Pro 


Glu 


He 


Leu 


Leu 


Cys 


Leu Leu Leu 


Ala 


Leu 


Phe 


Ala 


Ser 


50 








55 






60 










65 


Gly Leu 


He 


His 


Arg 


Val 


Cys 


Val 


Thr Thr Cys 


Phe 


He 


Phe 


Ser 


Met 








70 








75 








80 




Val Gly 


Leu 


Tyr 


Tyr 


He 


Asn 


Lys 


He Ser Ser 


Thr 


Leu 


Tyr 


Gin 


Ala 






85 










90 






95 






Ala Ala 


Pro 


Val 


Leu 


Thr 


Pro 


Ala 


Lys Val Thr Gly Lys 


Ser 


Lys 


Lys 




100 










105 






110 








Arg Asn 


























115 



























<210> 188 

<211> 90 

<212> PRT 

<213> Homo sapiens 

<220> 

<221> SIGNAL 
<222> -35.. -1 



<400> 188 

Met Glu Thr Gly Ala Ser Ala Ser 
-35 -30 
Met Arg Arg He Trp Ser Leu Gly 
-15 

Ser Trp Ala Ala Leu Phe His Asp 

1 5 
He Val Ser Asp Leu Ser Thr Val 

15 20 
Phe Gin Leu Leu Asp Val Trp Thr 
30 35 
Ser Met Cys Phe Val Phe Leu Leu 
50 



He Pro Glu Leu He Cys Glu Ala 
-25 -20 
Leu Gly Leu Val Thr Leu Thr Ala 

-10 -5 
Gly Phe Ala Val Leu Gly Gly Asn 
10 

Arg Phe Val Ala Gin Gin Gin His 
25 

Arg Asn Phe Arg Lys Pro Leu Gly 
40 45 

Leu Pro 
55 



<210> 189 
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<211> 146 
<212> PRT 

<213> Homo sapiens 

<220> 

<221> SIGNAL 
<222> -25. .-1 

<220> 

<221> UNSURE 
<222> 60 

<223> Xaa = Lys.Arg 
<400> 189 



Met 


Arg 


Leu Leu 


Gin 


Leu 


Leu 


Phe Arg 


-25 








-20 






Leu 


Val 


Leu Cys 


Leu 
-5 


Gin Leu Gly Ala 


Arg 


Lys 


He He 


He 


Lys 


Asn 


Phe Asp 






10 








15 


Asn 


Asp 


Glu Val 


Thr 


Ala 


Val 


Leu Ala 




25 








30 




Cys 


Met 


Val Val 


Lys 


Thr 


Tyr 


Leu He 


40 








45 






Ala 


Phe 


Asn Tyr 


Xaa 


Tyr 


Thr 


Ala Cys 








60 








Thr 


Phe 


Tyr Trp Asp 


Phe 


Tyr 


Thr Asn 






75 








80 


Val 


Val 


Asp Val 


He 


Arg 


Glu 


Leu Gly 






90 








95 


Val 


He 


Pro He 


Lys 


Asn 


Asn 


Arg Phe 




105 








110 




Val 


Glu 













120 

<210> 190 
<211> 120 
<212> PRT 
<213> Homo sapiens 

<220> 

<221> SIGNAL 

<222> -32.. -1 

<223> Von Heijne matrix 

score 7.64815044221151 
seq VTLGIGFFALASA/LW 



<400> 190 



Met 


Ala 


Ser 


Pro 


Lys 


Gly 


Phe 


Phe 


Asn 






-30 










-25 




Ala 


Gly 


Ala 


Val 


Thr 


Leu 


Gly 


He Gly 




-15 










-10 






Leu 
1 


Trp 


Phe 


Leu 


He 
5 


Cys 


Lys 


Arg 


Arg 


Phe 


Lys 


Ala 


He 


Asp 


Glu 


Arg 


Cys 


Arg 








20 










25 


He 


Lys 


Ser 


His 


Ser 


Gin 


Cys 


Val 


Phe 






35 










40 




Gly 


Arg 


Phe 


Gin 


Leu 


Gin 


Gin 


Leu 


Lys 




50 










55 






Asn 


Leu 


Gin 


Gin 


Lys 


He 


Ser 


Phe 


Val 


65 










70 








Gin 


Gin 


He 


Ala 


Ala 


Val 


Leu 


His 
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Ala 


Ser 


Pro 


Ala 


Thr 


Leu 


Leu 




-15 










-10 


Asn 


Lys 


Ala 


Gin Asp Asn Thr 


1 








5 






He 


Pro 


Lys 


Ser 


Val 


Arg 


Pro 








20 








Val 


Gin 


Thr 


Glu 


Leu 


Lys 


Glu 






35 










Ser 


Ser 


He 


Pro 


Leu 


Gin 


Gly 




50 










55 


Leu 


Cys 


Asp 


Asp Asn 


Pro 


Lys 


65 










70 




Arg 


Thr 


Val 


Gin 


He 


Ala 


Ala 










85 






He 


Cys 


Pro 


Asp Asp Ala Ala 








100 








Tyr 


Thr 


He 


Glu 


He 


Leu 


Lys 






115 










Tyr 


Leu 


Thr 


Tyr 


Phe 


Leu 


Ala 








-20 








Phe 


Phe 


Ala 


Leu 


Ala 


Ser 


Ala 






-5 










Glu 


He 


Phe 


Gin 


Asn 


Ser 


Lys 


10 










15 




Gin 


Arg 


Pro 


Ser 


Met 


Ala 


Lys 










30 






He 


Ser Arg 


Asn 


Phe 


His 


Thr 








45 








He 


He 


Leu 


Lys 


Met 


Asn 


Pro 






60 










He 


Pro 


Gin 


Arg 


Pro 


Ala 


Pro 




75 










80 



wo 02/083898 PGT/IBO 1/009 14 

135 

85 

<210> 191 
<211> 78 
<212> PRT 

<213> Homo sapiens 
<220> 

<221> SIGNAL 
<222> -19.. -1 

<400> 191 

Met Lys Phe Phe Val Phe Ala Leu Val Leu Ala 

-15 -10 
lie Ser Ala Asp Ser His Glu Lys Arg His His 

1 5 
Phe His Glu Lys His His Ser Tyr His He Thr 

15 20 
Glu Glu Ser Ser Lys Ser Asn Ala Asn Glu Lys 
30 35 40 

Tyr Thr Leu Cys Phe Arg He Leu Ala Phe Ser 

50 55 

<210> 192 
<211> 87 
<212> PRT 

<213> Homo sapiens 
<220> 

<221> SIGNAL 
<222> -20. .-1 
<223> Von Heijne matrix 

score 7.42708462116258 

seq LLVFLAGFPVLDA/ND 



<400> 192 



Met Gin 


Lys 


Val 


Thr 


Leu 


Gly 


Leu 


Leu Val 


Phe 


Leu 


Ala 


Gly 


Phe 


Pro 


-20 








-15 








-10 










-5 


Val Leu 


Asp 


Ala 


Asn 
1 


Asp 


Leu 


Glu 


Asp Lys 
5 


Asn 


Ser 


Pro 


Phe 
10 


Tyr 


Tyr 


Asp Trp 


His 


Ser 


Leu 


Gin 


Val 


Gly Gly Leu 


He 


Cys 


Ala Gly Val 


Leu 




15 










20 








25 








Cys Ala 


Met 


Gly 


He 


He 


He 


Val 


Met Ser 


Ala 


Lys 


Cys 


Lys 


Cys 


Lys 


30 










35 








40 










Phe Gly 


Gin 


Lys 


Ser 


Gly 


His 


His 


Pro Gly Glu 


Thr 


Pro 


Pro 


Leu 


He 


45 








50 








55 










60 


Thr Pro 


Gly 


Ser 


Ala 


Gin 


Ser 



















65 



<210> 193 
<211> 105 
<212> PRT 

<213> Homo sapiens 
<220> 

<221> SIGNAL 

<222> -19.. -1 

<223> Von Heijne matrix 

score 7.20796835452081 
seq VSIMLLLVTVSDC/AV 



Leu Met He Ser Met 
-5 

Gly Tyr Arg Arg Lys 

10 

Leu Leu Pro Leu Phe 
25 

His Tyr Asn Leu Leu 
45 

He Val Thr 



<220> 

<221> unsure 
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<222> 37 

<223> Xaa = * ,Glu 
<400> 193 



Met Arg 


Gly Ala 


Thr Arg Val Ser 


He 


Met 


Leu Leu Leu Val 


Thr Val 








-15 






-10 




-5 


Ser Asp 


Cys 


n±a 


Val 


He Thr Gly 


Ala 


Cys 


Glu Arg Asp Val 


Gin Cys 






1 




5 






1 n 




Gly Ala 


Gly 


Thr 


Cys 


Cys Ala He 


Ser 


Leu 


Trp Leu Arg Gly 


Leu Arg 


15 








20 






25 




Met Cys 


Thr 


Pro 


Leu 


Gly Arg Xaa 


Gly Glu 


Glu Cys His Pro 


Gly Ser 


30 








35 






40 


45 


His Lys 


He 


Pro 


Phe 


Phe Arg Lys 


Arg 


Lys 


nis HIS inr v^ys 


Pro Cys 








50 






55 




60 


Leu Pro 


Asn 


Leu 


Leu Cys Ser Arg 


Phe 


Pro 


Asp Gly Arg Tyr 


Arg Cys 






65 






70 




75 




Ser Met 


Asp 


Leu 


Lys 


Asn He Asn 


Phe 










80 






85 










<210> 194 
















<211> 105 
















<212> PRT 
















<213> Homo i 


sapiens 












<220> 


















<221> SIGNAL 














<222> -19. 


-1 














<223> Von Heijne matrix 










score 


7.20796835452081 










seq VSIMLLLVTVSDC/AV 










<400> 194 
















Met Arg 


Gly Ala Thr 


Arg Val Ser 


He 


Met 


Leu Leu Leu Val 


Thr Val 








-15 






-10 




-5 


Ser Asp 


Cys Ala Val 


He Thr Gly 


Ala 


Cys 


Glu Arg Asp Val 


Gin Cys 






1 




5 






10 




Gly Ala 


Gly Thr Cys 


Cys Ala He 


Ser 


Leu 


Trp Leu Arg Gly 


Leu Arg 


15 








20 






25 




Met Cys 


Thr 


Pro 


Leu 


Gly Arg Glu 


Gly Glu 


Glu Cys His Pro 


Gly Ser 


30 








35 






40 


45 


His Lys 


He 


Pro 


Phe 


Phe Arg Lys 


Arg 


Lys 


His His Thr Cys 


Pro Cys 








50 






55 




60 


Leu Pro 


Asn 


Leu 


Leu 


Cys Ser Arg 


Phe 


Pro 


Asp Gly Arg Tyr 


Arg Cys 






65 






70 




75 




Ser Met 


Asp Leu Lys 


Asn He Asn 


Phe 










80 






85 











<210> 


195 


<211> 


102 


<212> 


PRT 


<213> 


Homo sapiens 


<220> 




<221> 


SIGNAL 


<222> 


-36. .-1 


<220> 




<221> 


UNSURE 


<222> 


21 


<223> 


Xaa = He, Met 



<220> 

<221> UNSURE 
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<222> 52 

<223> Xaa = Ile,Ser 



<400> 195 



Met Ser Asn 


Thr Thr 


Val 


Pro 


Asn 


Ala 


Pro 


Gin 


Ala Asn Ser Asp 


Ser 


-35 






-30 










-25 




Met Val Gly 


Tyr Val 


Leu 


Gly 


Pro 


Phe 


Phe 


Leu 


He Thr Leu Val 


Gly 






-Id 










- iU 




c 
-D 


Val Val Val 


Ala Val 


Val 


Met 


Tyr 


Val 


Gin 


Lys 


Lys Lys Arg Val 


Asp 




1 








5 






10 




Arg Leu Arg 


His His 


Leu 


Leu 


Pro 


Xaa 


Tyr 


Ser 


Tyr Asp Pro Ala 


Glu 


15 








20 








25 




Glu Leu His 


Glu Ala 


Glu 


Gin 


Gin 


Leu 


Leu 


ber 


Asp Met Gly Asp 


Pro 


30 






35 










40 




Lys Val Val 


His Gly 


Trp 


Gin 


Xaa 


Gly 


Tyr 


Gin 


His Lys Arg Met 


Pro 


45 




50 










55 




60 


Leu Leu Asp 


Val Lys 


Thr 


















65 


















<210> 196 




















<211> 137 




















<212> PRT 




















<213> Homo sapiens 


















<220> 




















<221> SIGNAL 


















<222> -18. 


-1 


















<400> 196 




















Met Lys Ala 


Leu lie 


Val 


Leu 


Gly 


Leu 


Val 


Leu 


Leu Ser Val Thr 


Val 




-15 








-10 






-5 




Gin Gly Lys 


Val Phe 


Glu 


Arg 


Cys 


Glu 


Leu 


Ala 


Arg Thr Leu Lys 


Arg 


1 






5 










10 




Leu Gly Met 


Asp Gly 


Tyr 


Arg 


Gly 


He 


Ser 


Leu 


Ala Asn Trp Met 


Cys 


15 




20 










25 




30 


Leu Ala Lys 


Trp Glu 


Ser 


Gly 


Tyr 


Asn 


Thr 


Arg 


Ala Thr Asn Tyr 


Asn 




35 










40 




45 




Ala Gly Asp 


Arg Ser 


Thr 


Asp 


Tyr 


Gly 


He 


Phe 


Gin He Asn Ser Arg 




50 








55 






60 




Tyr Trp Cys 


Asn Asp 


Gly 


Lys 


Thr 


Pro 


Gly 


Ala 


Val Asn Ala Cys 


His 


65 








70 








75 




Leu Ser Cys 


Ser Gly 


Trp 


nlS 


uiy 


ulU 


lie 


vai 


Val Lys Thr Glu 


Met 


80 






85 










90 




Ser Val Ser 


Met Phe 


Lys 


Val 


Val 


Glu 


Cys 


Asn 


Ser Arg He Phe 


Leu 


95 




100 










105 




110 


Leu Gin Leu 


lie Leu 


Ser 


Leu 


Ser 


His 












115 


















<210> 197 




















<211> 101 




















<212> PRT 




















<213> Homo sapiens 


















<220> 




















<221> SIGNAL 


















<222> -29..- 


-1 


















<400> 197 




















Met Arg Glu Glu Lys 


Lys 


Pro 


Phe 


Glu 


Arg 


Glu 


Arg Glu Ser Val 


Cys 




-25 










-20 




-15 




Val Cys Met 


Cys Val 


Phe 


Ser 


Thr 


Gin 


Gly 


Ala 


Leu Gly Glu Met 


Ala 




-10 








-5 






1 




Ala His Phe 


lie Asp 


Glu 


Lys 


Leu 


Arg 


Pro 


Ser 


Glu Gly Asn Gly His 



wo 02/083898 

5 10 



Arg 


Gly 


Thr 


Leu 


Asp Ser 


Leu 


Ser 


Ser 


20 








25 








Ser 


Thr 


Ala 


Asp 


Pro Thr 


Gin 


Ala 


Gly 










40 








Leu 


Pro 


val 


Thr 


Ser Arg 


Ser 


Gin 


Pro 








55 








60 


Pro 


Ala 


Ala 


Ala 


Leu 









70 
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15 

Asp Gin Glu Ser Tyr lie Pro 

30 35 
Pro Glu Leu Leu His Lys Asn 
45 50 
Leu Pro Ser Asp Leu Ala lie 
65 



<210> 198 
<211> 128 
<212> PRT 

<213> Homo sapiens 
<220> 

<221> SIGNAL 
<222> -26. .-1 



<220> 

<221> UNSURE 
<222> 49 

<223> Xaa = Glu, Lys 



<400> 198 












Met 


Cys 


Trp 


Leu Arg Ala 


Trp Gly 




-25 










-20 




Ser 


Leu 


Phe 


Leu 


He 


Gin 


Leu 


Leu 


-10 










-5 






He 


His 


Ser 


Pro 


Arg 


Asn 


Asn 


Gin 








10 










Glu 


Cys 


Ala 


Val 


Lys 


Lys 


Ser 


Cys 






25 










30 


Cys 


Val 


Trp 


Cys 


Ser 


Glu 


Glu 


Lys 




40 










45 




Tyr 


Phe 


Gly 


Cys Arg 


Phe 


Ser 


Ser 


55 










60 






Asp Met 


Phe 


Gly 


He 


Met 


Met 


Leu 










75 








Gly 


Phe 


Val 


Trp 


Tyr Cys 


Cys 


Ala 



90 



Gin 


He 


Leu 


Leu 
-15 


Pro 


Val 


Phe 


Leu 


He 


Ser 


Phe 

1 


Ser 


Glu 


Asn 


Gly 

5 


Phe 


Lys 


Pro 


Arg Asp Gly Asn 


Glu 


Glu 


15 










20 






Gin 


Leu 


Cys 


Thr 


Glu 
35 


Asp 


Lys 


Lys 


Ala Cys 


Xaa 


Lys 


Tyr 


Cys 


Phe 


Pro 








50 










He Tyr 


Trp 


Leu Asn Cys 


Lys 


Val 






65 










70 


Leu 


Leu 
80 


He 


Ala 


Val 


Leu 


He 
85 


Thr 


Tyr 


His 


Phe 


Tyr 


Leu 


Gin 


Asp 


He 


95 










100 







<210> 199 
<211> 142 
<212> PRT 

<213> Homo sapiens 



<220> 

<221> SIGNAL 
<222> -24.. -1 



<220> 

<221> UNSURE 
<222> 90 

<223> Xaa = Asp, Glu 

<220> 

<221> UNSURE 
<222> 80 

<223> Xaa = Leu, Arg, Trp 



<220> 

<221> UNSURE 
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<222> 79 

<223> Xaa = Lys,Asn 
<400> 199 



Met 


Arg 


Leu Ser 


Trp Phe Arg Val 


Leu 








-20 










Ser 


Ala 


Val Ala 


Thr 


Ala 


Thr Gly Ala 






-5 










1 


lie 


Gly 


Val Lys 


Lys 


Arg 


Val 


Asp 


His 




10 








15 






Gly 


Asp 


Val Leu 


His 


Met 


His 


Tyr 


Thr 


25 








30 








Glu 


Phe 


Asp Ser 


Ser 
45 


Leu 


Pro 


Gin 


Asn 


Gly 


Thr 


Gly Gin 


Val 


lie 


Lys 


Gly Trp 






60 










65 


Cys 


Glu 


Gly Glu 


Lys Arg Xaa 


Xaa 


Val 






75 








80 




Gly 


Xaa 


Arg Gly 


Ala 


Pro 


Pro 


Lys 


lie 




90 








95 






Phe 


Glu 


Val Glu 


Leu 


Leu 


Lys 


He 


Glu 



105 110 
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Thr Val Leu Ser He Cys Leu 
-15 -10 
Glu Gly Lys Arg Lys Leu Gin 

5 

Cys Pro He Lys Ser Arg Lys 
20 

Gly Lys Leu Glu Asp Gly Thr 

35 40 
Gin Pro Phe Val Phe Ser Leu 
50 55 
Asp Gin Gly Leu Leu Gly Met 
70 

He Pro Ser Glu Leu Gly Tyr 
85 

Pro Gly Gly Ala Thr Leu Val 
100 

Arg Arg Thr Glu Leu 
115 



<210> 200 
<211> 70 
<212> PRT 

<213> Homo sapiens 



<220> 

<221> SIGNAL 

<222> -24. .-1 

<223> Von Heijne matrix 

score 6.54728084640774 
seq ILLTRPACLGSWA/EI 



<400> 200 
















Met Ala Ser Ser 


Thr 


Ser 


Val Ser 


Thr Gly Gin 


He Leu Leu 


Thr 


Arg 




-20 






-15 




-10 




Pro Ala Cys Leu 


Gly 


Ser 


Trp Ala 


Glu He Arg 


Ser Pro Val 


Arg 


Thr 


-5 








1 


5 






He Ser He Ala 


Ser 


Asp 


Phe Pro Thr Ala Arg 


Val Ser Leu 


Trp 


Val 


10 






15 




20 






Pro Pro Ala Pro 


Gly 


Met 


Val Pro 


He Lys He 


Ser Gly Cys Ala 


Asn 


25 




30 




35 






40 


Trp Ala Phe Ser 


Pro 


Ala 












45 















<210> 201 
<211> 113 
<212> PRT 

<213> Homo sapiens 



<220> 

<221> SIGNAL 
<222> -23. .-1 



<220> 

<221> UNSURE 
<222> 8 

<223> Xaa = His, Lys, Asn, Gin 



<400> 201 

Met Ala Gly Pro Ala Ala Ala Phe Arg Arg Leu Gly Ala Leu Ser Gly 
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-20 -15 -10 

Ala Ala Ala Leu Gly Phe Ala Ser Tyr Gly Ala His Gly Ala Xaa Phe 

-5 15 
Pro Asp Ala Tyr Gly Lys Glu Leu Phe Asp Lys Ala Asn Lys His His 
10 15 20 25 

Phe Leu His Ser Leu Ala Leu Leu Gly Val Pro His Cys Arg Lys Pro 

30 35 40 

Leu Trp Ala Gly Leu Leu Leu Ala Ser Gly Thr Thr Leu Phe Cys Thr 

45 50 55 

Ser Phe Tyr Tyr Gin Ala Leu Ser Gly Asp Pro Ser He Gin Thr Leu 

60 65 70 

Ala Pro Ala Gly Gly Thr Leu Leu Leu Leu Gly Trp Leu Ala Leu Ala 
75 80 85 

Leu 
90 

<210> 202 
<211> 113 
<212> PRT 

<213> Homo sapiens 
<220> 

<221> SIGNAL 
<222> -23.. -1 

<220> 

<221> UNSURE 
<222> 8 

<223> Xaa = Lys, Gin 

<400> 202 



Met 


Ala 


Gly 


Pro Ala 


Ala 


Ala 


Phe Arg Arg Leu Gly Ala Leu 


Ser 


Gly 








-20 






-15 -10 






Ala 


Ala 


Ala 


Leu Gly 


Phe 


Ala 


Ser Tyr Gly Ala His Gly Ala Xaa 


Phe 






-5 








1 5 






Pro 


Asp 


Ala 


Tyr Gly 


Lys 


Glu 


Leu Phe Asp Lys Ala Asn Lys 


His 


His 


10 








15 




20 




25 


Phe 


Leu 


His 


Ser Leu 


Ala 


Leu 


Leu Gly Val Pro His Cys Arg 


Lys 


Pro 








30 






35 


40 




Leu 


Trp 


Ala 


Gly Leu 


Leu Leu Ala Ser Gly Thr Thr Leu Phe 


Cys 


Thr 








45 






50 55 






Ser 


Phe 


Tyr Tyr Gin 


Ala 


Leu 


Ser Gly Asp Pro Ser He Gin 


Thr 


Leu 






60 








65 70 






Ala 


Pro 


Ala 


Gly Gly 


Thr 


Leu 


Leu Leu Leu Gly Trp Leu Ala 


Leu 


Ala 




75 








80 


85 







Leu 
90 

<210> 203 
<211> 54 
<212> PRT 

<213> Homo sapiens 
<220> 

<221> SIGNAL 

<222> -17. .-1 

<223> Von Heijne matrix 

score 6.48133858229514 
seq LTFAQLLFATVLG/IA 



<400> 203 

Met Phe Arg Arg Leu Thr Phe Ala Gin Leu Leu Phe Ala Thr Val Leu 
-15 -10 -5 
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Gly He Ala Gly Gly 


Val 


Tyr He Phe Gin 


Pro 


Val 


Phe Glu Gin 


Tyr 


1 


5 




10 






15 


Ala Lys Asp Gin Lys 


Glu Leu Lys Glu Lys 


Met 


Gin 


Leu Val Gin 


Glu 


20 




25 






30 




Ser Glu Glu Lys Lys 


Ser 












35 














<210> 204 














<211> 83 














<212> PRT 














<213> Homo sapiens 














<220> 














<221> SIGNAL 














<222> -15.. -1 














<400> 204 














Met Arg Leu Phe Leu 


Ser 


Leu Pro Val Leu 


Val 


Val 


Val Leu Ser 


He 


-15 


-10 




-5 






1 


Val Leu Glu Gly Pro 


Ala 


Pro Ala Gin Gly 


Thr 


Pro 


Asp Val Ser 


Ser 


5 




10 






15 




Ala Leu AsD Lvs Leu 


Lys 


Glu Phe Gly Asn Thr Leu 


Glu Asp Lys Ala 


20 




25 






30 




Arg Glu Leu He Ser 


Arg 


He Lys Gin Ser 


Glu 


Leu 


Ser Ala Lys 


Met 


35 




40 




45 






Arg Glu Trp Phe Ser 


Glu Thr Phe Gin Lys 


Val 


Lys 


Asp Lys Leu 


Lys 




55 




60 






65 


He Asp Ser 














<210> 205 














T 1 Qft 

X i. 














<212> PRT 














<213> Homo sapiens 














<220> 














<221> SIGNAL 














<222> -43. .-1 














<400> 205 














Met Phe Arg He Glu 


Gly 


Leu Ala Pro Lys 


Leu Asp 


Pro Glu Glu 


Met 


-40 




-35 






-30 




Lys Arg Lys Met Arg 


Glu Asp Val He Ser 


Ser 


He 


Arg Asn Phe 


Leu 


-25 




-20 






-15 




He Tyr Val Ala Leu 


Leu 


Arg Val Ser Glu 


Cys 


Leu 


Pro Gly Cys 


Asp 


-10 




-5 




1 




5 


Cys Asp Thr Ser Gly 


Glu 


Leu Thr Asp Gly His 


Pro 


Leu Thr Leu 


Arg 


10 




15 






20 




Gly His Arg Gly Leu 


Arg 


Thr Glu Leu Asn 


Gly 


Ser 


Gly Glu Gin 


Gly 


25 




30 






35 




Gly Ser He Tyr Leu 


Lys 


Glu He Gly Gin 


His 


Met 


Lys Thr Gly 


His 


40 




45 






50 




His He 















55 



<210> 206 
<211> 82 
<212> PRT 

<213> Homo sapiens 

<220> 

<221> SIGNAL 

<222> -43. .-1 

<223> Von Heijne matrix 
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score 5.65876793443964 
seq VVSFALIATLVYA/LF 

<400> 206 



Mat 2X1 a T tie 

net MXo i^ys 


Tyr Gin Gly Glu Val Gin 


Ser Leu 


Lys 


ueu Asp 


Asp Asp 




-40 


-35 






-30 






Ser Val lie 


Glu Gly Val 


Ser Asp Gin 


Val Leu 


Val 


Ala Val 


Val 


Val 


-25 




-20 






-15 






Ser Phe Ala 


Leu He Ala 


Thr Leu Val 


Tyr Ala 


Leu 


Phe Arg 


2X e n 


V Cl X 


-10 




-5 




J. 






5 


His Gin Asn 


He His Pro 


Glu Asn Gin 


Glu Leu 


Val 


Arg Val 


Leu 


Arg 




10 




15 






20 




Glu Gin Leu 


Gin Thr Glu Gin Asp Ala 


Pro Ala 


Asp 


Ser Thr 


Aia 


vax 




25 


30 












Leu His 
















<210> 207 
















<211> 82 
















<212> PRT 
















<213> Homo sapiens 














<220> 
















<221> SIGNAL 














<222> -43..- 


-1 














<400> 207 
















Met Ala Lys 


Tyr Gin Gly Glu Val Gin 


Ser Leu 


Lys 


Leu Asp 


Asp Asp 




-40 


-35 






-30 






Ser Val He 


Glu Gly Val 


Ser Asp Gin 


Val Leu 


Val 


Ala Val 


Val 


Val 


-25 




-20 






-15 






Ser Phe Ala 


Leu He Ala 


Thr Leu Val 


Tyr Ala 


Leu 


Phe Arg 


Asn 


Val 


-10 




-5 




1 






5 


His Gin Asn 


He His Pro 


Glu Asn Gin 


Glu Leu Val Arg Val 


Leu 


Arg 




10 




15 






20 




Glu Gin Leu 


Gin Thr Glu 


Gin Asp Ala 


Pro Ala 


Asp 


Ser Thr 


Ala 


Val 




25 


30 






35 






Leu His 
















<210> 208 
















<211> 82 
















<212> PRT 
















<213> Homo sapiens 














<220> 
















<221> SIGNAL 














<222> -43..- 


-1 














<220> 
















<221> UNSURE 














<222> 32 
















<223> Xaa ^ 


Ala, Gly 














<400> 208 
















Met Ala Lys 


Tyr Gin Gly Glu Val Gin 


Ser Leu Lys 


Leu Asp 


Asp Asp 




-40 


-35 






-30 






Ser Val He 


Glu Gly Val 


Ser Asp Gin 


Val Leu 


Val 


Ala Val 


Val 


Val 


-25 




-20 






-15 






Ser Phe Ala 


Leu He Ala 


Thr Leu Val 


Tyr Ala 


Leu 


Phe Arg 


Asn 


Val 


-10 




-5 




1 






5 


His Gin Asn 


He His Pro 


Glu Asn Gin 


Glu Leu 


Val 


Arg Val 


Leu 


Arg 




10 




15 






20 




Glu Gin Leu 


Gin Thr Glu 


Gin Asp Ala 


Pro Xaa 


Asp 


Ser Thr 


Ala 


Val 




25 


30 






35 
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Leu His 



<210> 209 
<211> 82 
<212> PRT 

<213> Homo sapiens 
<220> 

<221> SIGNAL 
<222> -43. .-1 



<400> 209 

Met Ala Lys Tyr Gin Gly Glu Val 
-40 

Ser Val lie Glu Gly Val Ser Asp 
-25 -20 
Ser Phe Ala Leu lie Ala Thr Leu 

-10 -5 
His Gin Asn lie His Pro Glu Asn 
10 

Glu Gin Leu Gin Thr Glu Gin Asp 
25 

Leu His 



Gin Ser Leu Lys Leu Asp Asp Asp 
-35 -30 
Gin Val Leu Val Ala Val Val Val 
-15 

Val Tyr Ala Leu Phe Arg Asn Val 

1 5 
Gin Glu Leu Val Arg Val Leu Arg 

15 20 
Ala Pro Ala Asp Ser Thr Ala Val 
30 35 



<210> 210 
<211> 99 
<212> PRT 

<213> Homo sapiens 



<220> 

<221> SIGNAL 
<222> -28. .-1 



<220> 

<221> UNSURE 

<222> 2,16 

<223> Xaa = Phe, Ser 



<400> 210 








Met Thr 


Ser 


Ala 


Leu Thr Gin Gly 






-25 






Gly Tyr 


Leu 


Val 


Leu 


Ser Glu Gly 




-10 






-5 


Leu Glu 


Asn 


Asp 


Glu 


Gin Ala Ala 


5 








10 


Thr Ala 


Cys 


Gly 


Phe 


Arg Leu His 








25 




Arg Leu 


Ser 


Val 


Val 


Phe Gly Glu 






40 






Gly Gin 


Arg 


Val 


Phe 


Val Val Lys 




55 






60 


He Asp 


Val 









70 



Leu Glu Arg lie Pro Asp Gin Leu 
-20 -15 
Ala Val Leu Ala Ser Xaa Gly Asp 
1 

Ser Ala He Xaa Glu Leu Val Ser 

15 20 
Arg Gly Met Asn Val Pro Phe Lys 

30 35 
His Thr Leu Leu Val Thr Val Ser 
45 50 
Arg Gin Asn Arg Gly Arg Glu Pro 
65 



<210> 211 
<211> 99 
<212> PRT 

<213> Homo sapiens 



<220> 

<221> SIGNAL 
<222> -28. .-1 



wo 02/083898 

<220> 

<221> UNSURE 
<222> 16 

<223> Xaa = Phe,Ser 



<400> 211 



Met Thr 


Ser 


Ala 


Leu 


Thr Gin Gly Leu 






-25 




-20 


Gly Tyr 


Leu 


Val 


Leu 


Ser Glu Gly Ala 




-10 






-5 


Leu Glu 


Asn 


Asp 


Glu 


Gin Ala Ala Ser 


5 








10 


Thr Ala 


Cys 


Gly 


Phe 


Arg Leu His Arg 








25 




Arg Leu 


Ser 


Val 


Val 


Phe Gly Glu His 






40 




45 


Gly Gin 


Arg 


Val 


Phe 


Val Val Lys Arg 




55 






60 


lie Asp 


Val 









70 



<210> 212 
<211> 186 
<212> PRT 
<213> Homo sapiens 

<220> 

<221> SIGNAL 
<222> -15. .-1 

<220> 

<221> UNSURE 
<222> 37 

<223> Xaa = Cys, Gly 
<220> 

<221> UNSURE 
<222> 11 

<223> Xaa = Cys,Trp 
<220> 

<221> UNSURE 
<222> 149 

<223> Xaa = Gly, Arg 



<400> 212 



Met 


His 


Val 


Met 


Ala 


Ala Ser Met Ala 


-15 










-10 


Val 


Leu 


Leu 


Gin 


Ala 


Ala Arg Gly Thr 








5 




10 


Thr 


Ser 


Gly 


Ser 


Gly 


Glu Gly Val Ala 






20 






25 


Gin 


Ala 


Thr 


Xaa 


Ser 


Arg Pro Ala Gly 




35 








40 


Glu 


Arg 


Pro 


Gly 


Asp 


Val Val Asn Val 


50 










55 


Gin 


Arg 


lie 


Pro 


Val 


Ser Gly Arg Val 










70 




Ala 


Gin 


Arg 


His 


Gly 


Val Asp Leu Glu 








85 




90 


Ala 


Cys 


Ser 


Thr 


Cys 


His Val Tyr Val 






100 






105 


Leu 


Pro 


Pro 


Pro 


Glu 


Glu Arg Glu Asp 
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Glu Arg lie Pro Asp Gin Leu 
-15 

Val Leu Ala Ser Ser Gly Asp 
1 

Ala lie Xaa Glu Leu Val Ser 

15 20 
Gly Met Asn Val Pro Phe Lys 
30 35 
Thr Leu Leu Val Thr Val Ser 
50 

Gin Asn Arg Gly Arg Glu Pro 
65 



Arg Gly Gly Val Ser Ala Arg 

-5 1 
Xaa Trp Asn Arg Pro Gly Gly 

15 

Leu Gly Thr Thr Arg Lys Phe 

30 

Glu Glu Asp Ala Gly Gly Pro 
45 

Val Phe Val Asp Arg Ser Gly 

60 65 
Gly Asp Asn Val Leu His Leu 
75 80 
Gly Ala Cys Glu Ala Ser Leu 
95 

Ser Glu Asp His Leu Asp Leu 
110 

Asp Met Leu Asp Met Ala Pro 



wo 02/083898 

115 120 
Leu Leu Gin Glu Asn Ser Arg Leu Gly 
130 135 
Glu Leu Glu Xaa Ala Glu Phe Thr Leu 
150 

Tyr Val Asp Gly His Val Pro Lys Pro 
165 170 

<210> 213 
<211> 123 
<212> PRT 

<213> Homo sapiens 
<220> 

<221> SIGNAL 
<222> -48. .-1 

<220> 

<221> UNSURE 
<222> 75 

<223> Xaa = Ala, Asp 
<400> 213 



Met 


Glu 


Ala Leu Gly Lys Leu Lys Gin 


Phe Asp 


Ala 


Tyr Pro 


Lys 


Thr 






-45 


-40 






-35 






Leu 


Glu 


Asp Phe Arg Val Lys Thr Cys 


Gly Gly 


Ala 


Thr Val 


Thr 


lie 






-30 


-25 






-20 






Val 


Ser 


Gly Leu 


Leu Met Leu Leu Leu 


Phe Leu 


Ser 


Glu Leu 


Gin 


Tyr 




-15 




-10 




-5 








Tyr 


Leu 


Thr Thr 


Glu Val His Pro Glu 


Leu Tyr 


Val 


Asp Lys 


Ser 


Arg 


1 






5 


10 






15 




Gly 


Asp 


Lys Leu 


Lys lie Asn lie Asp 


Val Leu 


Phe 


Pro His 


Met 


Pro 






20 


25 






30 






Cys 


Ala 


Tyr Leu 


Ser lie Asp Ala Met 


Asp Val 


Ala 


Gly Glu 


Gin 


Gin 






35 


40 






45 






Leu 


Asp 


Val Glu 


His Asn Leu Phe Lys 


Gin Arg 


Leu Asp Lys Asp Gly 




50 




55 




60 








lie 


Pro 


Val Ser 


Ser Glu Ala Glu Arg 


His Xaa 










65 






70 


75 











<210> 214 

<211> 77 

<212> PRT 

<213> Homo sapiens 
<220> 

<221> SIGNAL 

<222> -29. .-1 

<400> 214 



Met 


Ala Ala 


Thr 


Asp 


Phe Val Gin Glu 


Met 


Arg 


Ala 


Val 


Gly 


Glu 


Arg 








-25 




-20 










-15 




Leu 


Leu Leu 


Lys 


Leu Gin Arg Leu Pro 


Gin 


Ala 


Glu 


Pro 


Val 


Glu 


He 






-10 




-5 










1 






Val 


Ala Phe 


Ser 


Val 


He He Leu Phe 


Thr 


Ala 


Thr 


Val 


Leu 


Leu 


Leu 




5 






10 






15 










Leu 


Leu He 


Ala 


Cys 


Ser Cys Cys Cys 


Thr 


His 


Cys 


Cys 


Cys 


Pro 


Glu 


20 








25 




30 










35 


Arg Arg Gly 


Arg 


Lys 


Val Gin Val Gin 


Pro 


Thr 


Pro 


Pro 














40 




45 
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145 

125 

Cys Gin He Val Leu Thr Pro 
140 145 
Pro Lys He Thr Arg Asn Phe 
155 160 
His 



<210> 215 
<211> 107 
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<212> PRT 

<213> Homo sapiens 
<220> 

<221> SIGNAL 
<222> -27. .-1 



<400> 215 












Met Leu 


Phe 


Glu Glu Ala 


Leu 


Pro Leu Ser Cys 


Ser Asp Pro 


Val Leu 




-25 






-20 


-15 




Ser Thr 


Leu 


Ser Leu Val 


Gin 


Phe Ser Pro Ser 


Gly Arg Thr 


Gin Asp 


-10 






-5 




1 


5 


Leu Leu 


Ser 


Pro Gly Val Glu Asn Leu Ser Val 


Leu Asp Val 


Ser Pro 






10 




15 




20 


Leu Gly 


Leu 


Ala Cys Cys 


Leu 


Leu Thr Leu Thr 


Met Ser Cys 


Pro Gly 






25 




30 


35 




Pro Asp 


Pro 


Pro Glu Gly 


Pro 


Gly Thr Gin Arg 


Val Trp Gin Gly Ala 




40 






45 


50 




Leu Arg 


He 


Leu Gin Leu 


Pro 


Gly Ala Pro Asp 


Gly Val Ser 


Pro Tyr 


55 






60 




65 




Gin Pro 


Val 


Trp Ser Arg 


Thr 


Pro Asp Leu Lys 







70 75 80 



<210> 216 
<211> 112 
<212> PRT 

<213> Homo sapiens 



<220> 

<221> SIGNAL 
<222> -39.. -1 



<400> 216 




Met 


Leu 


Glu 


His 


Gin 


Lys 


Leu 


Ala 








-20 


He 


Val 


Gly 


Phe 






-5 




Val 


Tyr 


He 


Val 


10 








Pro 


Pro 


Trp 


Pro 


Gin 


Ala 


Gin 


Thr 








45 


Leu 


Lys 


He 


He 






60 





Leu Ser Ser Leu 
-35 

Glu Gin Met Phe 

He Tyr Gly Tyr 
1 

Met Ala Gly Phe 
15 

He Tyr Arg Arg 
30 

Thr Arg Asn Gin 

Glu Val Phe Met 
65 



Pro Thr Gin Met 
-30 

Gin Gly He He 
-15 

Val Ala Glu Gin 
5 

Ala Phe Ser Cys 
20 

His Pro Leu Lys 
35 

Gly Lys Glu Lys 
50 

He Gin His Leu 



Asp Tyr Lys Gly 
-25 

Leu Phe Ser Ala 
-10 

Phe Gly Trp Thr 

Leu Leu Thr Leu 
25 

Trp Leu Pro Val 
40 

Leu Arg Gly Met 
55 

Leu Leu Phe Leu 
70 



<210> 217 

<211> 71 
<212> PRT 

<213> Homo sapiens 



<220> 

<221> SIGNAL 

<222> -23.. -1 

<223> Von Heijne matrix 

score 5.918755252201 
seq SALMLLPCRPVLT/SV 



<400> 217 

Met Met Asn Asn Gly Leu Leu Gin Gin Pro Ser Ala Leu Met Leu Leu 
-20 -15 -10 
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Pro Cys Arg Pro Val Leu Thr Ser Val Ala Leu Asn Ala Asn Phe Val 

-5 15 
Ser Trp Lys Ser Arg Thr Lys Tyr Thr lie Thr Pro Val Lys Met Arg 
10 15 20 25 

Lys Ser Gly Gly Arg Asp His Thr Gly Ala Gly Asn Val Arg Ser Asn 

30 35 40 

Ser Arg Pro Ser lie Gin Arg 
45 



<210> 218 
<211> 71 
<212> PRT 

<213> Homo sapiens 



<220> 

<221> SIGNAL 

<222> -23. .-1 

<223> Von Heijne matrix 

score 5.918755252201 
seq SALMLLPCRPVLT/SV 



<400> 218 










Met Met Asn 


Asn Gly 


Leu 


Leu 


Gin 




-20 








Pro Cys Arg 


Pro Val 


Leu 


Thr 


Ser 


-5 








1 


Ser Trp Lys 


Ser Arg 


Thr 


Lys 


Tyr 


10 




15 






Lys Ser Gly 


Gly Arg 


Asp 


His 


Thr 




30 








Ser Arg Pro 


Ser He 


Gin 


Arg 






45 









Gin Pro Ser Ala Leu Met Leu Leu 
-15 -10 
Val Ala Leu Asn Ala Asn Phe Val 
5 

Thr He Thr Pro Val Lys Met Arg 

20 25 
Gly Ala Gly Asn Val Arg Ser Asn 
35 40 



<210> 219 
<211> 92 
<212> PRT 

<213> Homo sapiens 



<220> 

<221> SIGNAL 

<222> -15. .-1 

<223> Von Heijne matrix 

score 4.68058603039206 
seq VLAGSLLGPTSRS/AA 



<400> 219 

Met Ala Val Leu Ala Gly Ser Leu 
-15 -10 
Ala Leu Leu Gly Gly Arg Trp Leu 
5 

Pro Asp Ala Trp Gly Leu Pro Thr 

20 25 
Arg Gly Asn Glu Tyr Gin Pro Ser 

35 40 
Gly Trp Val Arg Arg Leu Ser Thr 
50 55 
Arg Arg Met Leu Lys Gly Arg Lys 
70 



Leu Gly Pro Thr Ser Arg Ser Ala 

-5 1 
Gin Pro Arg Ala Trp Leu Gly Phe 

10 15 
Pro Gin Gin Ala Arg Gly Lys Ala 
30 

Asn He Lys Arg Lys Asn Lys His 
45 

Pro Ala Gly Xaa Gin Val He Leu 

60 65 
Ser Leu Ser His 
75 



<210> 220 
<211> 92 
<212> PRT 

<213> Homo sapiens 



wo 02/083898 



148 



PCT/IBOl/00914 



<220> 

<221> SIGNAL 
<222> -15. .-1 



<220> 

<221> UNSURE 
<222> 56 

<223> Xaa = Asn,Ser 



<400> 220 








Met Ala 


Val 


Leu Ala Gly 


Ser 


Leu 


-15 




-10 






Ala Leu 


Leu 


Gly Gly Arg 

5 


Trp Leu 


Pro Asp 


Ala 


Trp Gly Leu 


Pro 


Thr 




20 






25 


Arg Gly 


Asn 


Glu Tyr Gin 


Pro 


Ser 


35 






40 




Gly Trp 


Val 


Arg Arg Leu 


Xaa 


Thr 


50 




55 






Arg Arg 


Met 


Leu Lys Gly 


Arg 


Lys 



70 



Leu Gly Pro Thr Ser Arg Ser Ala 

-5 1 
Gin Pro Arg Ala Trp Leu Gly Phe 

10 15 
Pro Gin Gin Ala Arg Gly Lys Ala 
30 

Asn He Lys Arg Lys Asn Lys His 
45 

Pro Ala Gly Val Gin Val lie Leu 

60 65 
Ser Leu Ser His 
75 



<210> 221 
<211> 92 
<212> PRT 

<213> Homo sapiens 
<220> 

<221> SIGNAL 
<222> -15, .-1 



<220> 

<221> UNSURE 
<222> 56 

<223> Xaa « Asn, Ser 

<220> 

<221> UNSURE 
<222> 62 

<223> Xaa = Lys, Gin, Arg 



<400> 221 








Met Ala 


Val 


Leu Ala Gly 


Ser 


Leu 


-15 




-10 






Ala Leu 


Leu 


Gly Gly Arg 
5 


Trp 


Leu 


Pro Asp 


Ala 


Trp Gly Leu 


Pro 


Thr 




20 






25 


Arg Gly 


Asn 


Glu Tyr Gin 


Pro 


Ser 


35 






40 




Gly Trp 


Val 


Arg Arg Leu 


Xaa 


Thr 


50 




55 






Arg Arg 


Met 


Leu Lys Gly 


Arg 


Lys 



70 



Leu Gly Pro Thr Ser Arg Ser Ala 

-5 1 
Gin Pro Arg Ala Trp Leu Gly Phe 
10 15 
Pro Gin Gin Ala Arg Gly Lys Ala 
30 

Asn He Lys Arg Lys Asn Lys His 
45 

Pro Ala Gly Val Xaa Val He Leu 

60 65 
Ser Leu Ser His 
75 



<210> 222 
<211> 92 
<212> PRT 

<213> Homo sapiens 



<220> 
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<221> SIGNAL 
<222> -15. .-1 



<220> 

<221> UNSURE 
<222> 61 

<223> Xaa = * ,Glu,Leu,Val 
<220> 

<221> UNSURE 
<222> 62 

<223> Xaa = Ala, Glu, Pro, Gin 



<220> 

<221> UNSURE 
<222> 56 

<223> Xaa « Asn,Ser 
<220> 

<221> UNSURE 
<222> -6 

<223> Xaa = Asp, Gly 
<220> 

<221> UNSURE 
<222> 6 

<223> Xaa = Gly, Ser 

<220> 

<221> UNSURE 
<222> 63 

<223> Xaa = Phe,Val 



<400> 222 










Met Ala 


Val 


Leu Ala 


Gly 


Ser 


Leu 


-15 






-10 






Ala Leu 


Leu 


Gly Xaa 
5 


Arg 


Trp 


Leu 


Pro Asp 


Ala 


Trp Gly 


Leu 


Pro 


Thr 




20 








25 


Arg Gly 


Asn 


Glu Tyr 


Gin 


Pro 


Ser 


35 








40 




Gly Trp 


Val 


Arg Arg 


Leu 


Xaa 


Thr 


50 






55 






Arg Arg 


Met 


Leu Lys 


Gly Arg Lys 



70 



Leu 


Xaa 


Pro Thr Ser Arg 


Ser 


Ala 






-5 




1 


Gin 


Pro 


Arg Ala Trp Leu 


Gly 


Phe 


10 




15 






Pro 


Gin 


Gin Ala Arg Gly 


Lys 


Ala 






30 






Asn 


lie 


Lys Arg Lys Asn 


Lys 


His 






45 






Pro 


Ala 


Gly Xaa Xaa Xaa 


He 


Leu 






60 




65 


Ser 


Leu 


Ser His 







75 



<210> 223 

<211> 87 

<212> PRT 

<213> Homo sapiens 
<220> 

<221> SIGNAL 

<222> -39. .-1 

<400> 223 

Met Lys Val Leu Leu Leu Thr Gly 
-35 

Tyr Trp Asp Asp Asn Phe Asp Pro 
-20 

Leu Leu Thr Gly Ala Asn Ala Gly 
-5 1 



Leu Gly Ala Leu Phe Phe Ala Tyr 

-30 -25 
Ala Ser Leu Gin Gly Ala Arg Val 
-15 -10 
Val Gly Glu Glu Leu Ala Tyr His 
5 



1 
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Tyr Ala Arg Leu Gly Ser His Leu Val Leu Thr Ala His Thr Glu Ala 
10 15 20 25 

Leu Leu Gin Lys Ala Arg Trp Leu Thr Leu Val Val Ser Thr Leu Gly 

30 35 40 

Gly Arg Gly Lys Trp He Thr 
45 



<210> 224 
<211> 125 
<212> PRT 
<213> Homo sapiens 

<220> 

<221> SIGNAL 
<222> -15.. -1 



<220> 

<221> UNSURE 
<222> 100 

<223> Xaa = Asn, Thr 

<220> 

<221> UNSURE 
<222> 30 

<223> Xaa = He, Met 



<400> 224 


























Met 


Lys 


Leu 


Leu 


Thr 


His 


Asn 


Leu 


Leu 


Ser 


Ser 


His 


Val 


Arg 


Gly Val 


-15 










-10 










-5 








1 


Gly 


Ser 


Arg 


Gly 
5 


Phe 


Pro 


Leu 


Arg 


Leu 
10 


Gin 


Ala 


Thr 


Glu 


Val 
15 


Arg He 


Cys 


Pro 


Val 
20 


Glu 


Phe 


Asn 


Pro 


Asn 
25 


Phe 


Val 


Ala 


Arg 


Xaa 
30 


He 


Pro Lys 


Val 


Glu 


Trp 


Ser 


Ala 


Phe 


Leu 


Glu 


Ala 


Ala 


Asp 


Asn 


Leu Arg 


Leu He 




35 










40 










45 








Gin 


Val 


Pro 


Lys 


Gly 


Pro 


Val 


Glu Gly 


Tyr 


Glu 


Glu 


Asn 


Glu 


Glu Phe 


50 










55 










60 








65 


Leu Arg 


Thr 


Met 


His 


His 


Leu 


Leu 


Leu 


Glu 


Val 


Glu 


Val 


He 


Glu Gly 










70 










75 










80 


Thr 


Leu 


Gin 


Cys 


Pro 


Glu 


Ser 


Gly Arg 


Met 


Phe 


Pro 


He 


Ser Arg Gly 








85 










90 










95 




He 


Pro 


Xaa 
100 


Met 


Leu 


Leu 


Ser 


Glu 
105 


Glu 


Glu 


Thr 


Glu 


Ser 
110 







<210> 225 
<211> 93 
<212> PRT 

<213> Homo sapiens 



<220> 

<221> SIGNAL 
<222> -22. .-1 



<400> 225 



Met 


Lys 


He 
-20 


Ser 


Val 


Ala 


Ala 


He 
-15 


Pro 


Phe 


Phe 


Leu 


Leu 
-10 


He Thr He 


Ala 


Leu Gly 


Thr 


Lys 


Thr 


Glu 


Ser 


Ser 


Ser 


Arg 


Gly 


Pro 


Tyr His Pro 




-5 










1 








5 






10 


Ser 


Glu 


Cys 


Cys 


Phe 


Thr 


Tyr 


Thr 


Thr 


Tyr 


Lys 


He 


Pro Arg Gin Arg 










15 










20 








25 


He 


Met 


Asp 


Tyr 


Tyr Glu Thr Asn Ser 


Gin 


Cys 


Ser Lys 


Pro Gly He 








30 










35 










40 


Val 


Phe 


He 


Thr 


Lys 


Arg 


Gly 


His 


Ser 


Val 


Cys 


Thr 


Asn 


Pro Ser Asp 
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45 50 55 

Lys Trp Val Gin Asp Tyr He Lys Asp Met Lys Glu Asn 
60 65 70 



<210> 226 
<211> 72 
<212> PRT 

<213> Homo sapiens 
<220> 

<221> SIGNAL 
<222> -42. .-1 



<400> 226 

Met Asn Arg Leu Ala 
-40 

Ser Ser Leu Leu Gin 
-25 

Glu Val Ala Ala Ala 
-10 

Leu Ser Ala Asp Lys 
10 

Gin Thr Leu Met Ser 
25 



Gly Val Gly Trp Arg Val 
-35 

Ser Val Glu Glu Pro Met 
-20 

Pro Gly Thr Pro Ala Gin 
-5 1 
Phe Gin Val Leu Leu Ala 
15 

Ser Leu Gly 
30 



Asp Tyr Thr Leu Ser 
-30 

Val His Leu Arg Leu 
-15 

Pro Val Ala Met Ser 
5 

Glu Leu Lys Gin Ala 
20 



<210> 227 
<211> 84 
<212> PRT 

<213> Homo sapiens 



<220> 

<221> SIGNAL 

<222> -20.. -1 

<223> Von Heijne matrix 

score 4.37180298395146 
seq SLLLSLPPHQGLT/FS 



<400> 227 

Met Ala Ala Ala Ala 
-20 

Gin Gly Leu Thr Phe 
1 

Val Leu His Pro Glu 
15 

Ser Arg Gin Leu Arg 
30 

Leu Cys Gin Thr Gin 
45 

Thr Val Arg Thr 



Val Pro Ser Leu Leu Leu 
-15 -10 
Ser Asn Lys He Gin Pro 
5 

Pro Gly Leu Arg Asp Trp 
20 

Val Ala Leu Pro Glu Lys 
35 

Leu Pro Ala Thr Pro Cys 
50 55 



Ser Leu Pro Pro His 
-5 

Phe Gly Ala Gin Gly 
10 

Leu Leu Pro Thr Cys 
25 

Gly Ser Glu Gly Ser 
40 

Phe Leu Pro Ser Asn 
60 



<210> 228 
<211> 54 
<212> PRT 

<213> Homo sapiens 
<220> 

<221> SIGNAL 
<222> -19. .-1 



<400> 228 

Met Lys Phe Asn Phe Val Leu Leu He Leu Ser Lys Val Leu Asp Asp 

-15 -10 -5 

Thr Phe Gin Ser Val Lys Lys Trp Leu Asn Tyr Phe Gin Phe Thr Leu 
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15 10 
Arg Asn Gly Leu Met Trp Pro Gly Ala Val Ala His Ala Cys Asn Pro 

15 20 25 

Ser Thr Gly Ser Arg Leu 
30 35 

<210> 229 
<211> 54 
<212> PRT 

<213> Homo sapiens 
<220> 

<221> SIGNAL 

<222> -19. .-1 

<223> Von Heijne matrix 

score 4.24406450728052 

seq LLILSKVLDDTFQ/SV 

<400> 229 

Met Lys Phe Asn Phe Val Leu Leu lie Leu Ser Lys Val Leu Asp Asp 

-15 -10 -5 

Thr Phe Gin Ser Val Lys Lys Trp Leu Asn Tyr Phe Gin Phe Thr Leu 

15 10 
Arg Asn Gly Leu Met, Trp Pro Gly Ala Val Ala His Ala Cys Asn Pro 

15 20 25 

Ser Thr Gly Ser Arg Leu 
30 35 



<210> 230 
<211> 54 
<212> PRT 

<213> Homo sapiens 



<220> 

<221> SIGNAL 

<222> -19. .-1 

<223> Von Heijne matrix 

score 4 .24406450728052 
seq LLILSKVLDDTFQ/SV 



<400> 230 

Met Lys Phe Asn Phe Val Leu Leu 

-15 

Thr Phe Gin Ser Val Lys Lys Trp 

1 5 
Arg Asn Gly Leu Met Trp Pro Gly 

15 20 
Ser Thr Gly Ser Arg Leu 
30 35 



lie Leu Ser Lys Val Leu Asp Asp 

-10 -5 

Leu Asn Tyr Phe Gin Phe Thr Leu 
10 

Ala Val Ala His Ala Cys Asn Pro 
25 



<210> 231 

<211> 91 

<212> PRT 

<213> Homo sapiens 

<220> 

<221> SIGNAL 
<222> -19. .-1 

<400> 231 

Met Tyr Gly Lys He He Phe Val 
-15 

He Ser Ala Leu Ser Thr Thr Glu 



Leu Leu Leu Ser Glu He Val Ser 

-10 -5 
Val Ala Met His Thr Ser Thr Ser 



wo 02/083898 









1 






5 




Ser 


Ser 
15 


Val 


Thr 


Lvs 


Ser Tyr 
20 


He 


Ser 


Gly Gin Leu 


Val 


His 


Arg Phe 


Thr 


Val 


30 










35 






Leu 


He 


He 


Leu 


Cys 
50 


Val Met 


Ala 


Gly 


lie 


Ser 


Tyr 


Ser 
65 


He 


Arg Arg 


Leu 


He 
70 
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10 

Ser Gin Thr Asn Gly Glu Met 
25 

Pro Ala Pro Val Val He He 

40 45 
He He Gly Thr He Leu Leu 
55 60 
Lys Ala 



<210> 232 
<211> 114 
<212> PRT 

<213> Homo sapiens 

<220> 

<221> SIGNAL 
<222> -32, .-1 



<400> 232 








Met Gin 


Met 


Tyr 


Ser 


Arg Gin Leu 




-30 






-25 


Gin Gly 


Gly 


Leu 


Leu 


Gly Ser Gly 


-15 








-10 


Phe Asn 


Asn 


Leu 


Glu 


Asn Leu Val 


1 






5 




He Phe 


Pro 


Glu 


He 


Leu Leu Cys 






20 






Gly Leu 


He 


His 


Arg 


Val Cys Val 




35 






40 


Val Gly 


Leu Tyr 


Tyr 


He Asn Lys 


50 








55 


Ala Ala 


Pro 


Val 


Leu 


Thr Pro Ala 


65 








70 


Arg Asn 











Ala 


Ser Thr 


Glu 


Trp Leu 


Thr 


He 








-20 






Leu 


Phe Val 


Phe 
-5 


Ser Leu 


Thr 


Ala 


Phe 


Gly Lys 


Gly 


Phe Gin 


Ala 


Lys 




10 






15 




Leu 


Leu Leu 


Ala 


Leu Phe 


Ala 


Ser 


25 






30 






Thr 


Thr Cys 


Phe 


He Phe 


Ser 


Met 








45 






He 


Ser Ser 


Thr 


Leu Tyr 


Gin 


Ala 






60 








Lys 


Val Thr 


Gly Lys Ser 


Lys 


Lys 




75 








80 



<210> 233 
<211> 75 
<212> PRT 

<213> Homo sapiens 
<220> 

<221> SIGNAL 
<222> -35. .-1 



<400> 233 



Met 


Ala 


Pro 


Glu 


Val 


Leu 


Pro 


Lys 


Pro 


Arg Met 


Arg Gly Leu Leu 


Ala 


-35 










-30 










-25 










-20 


Arg Arg Leu Arg Asn 


His 


Met 


Ala 


Val 


Ala 


Phe 


Val 


Leu 


Ser 


Leu Gly 










-15 










-10 










-5 




Val 


Ala 


Ala 


Leu 
1 


Tyr 


Lys 


Phe 


Arg 
5 


Val 


Ala 


Asp 


Gin 


Arg 
10 


Lys 


Asn 


Ala 


Tyr 


Ala 
15 


Asp 


Phe 


Tyr 


Arg 


Asn 
20 


Tyr 


Asp 


Val 


Met 


Lys 
25 


Asp 


Phe 


Glu 


Glu 


Met 


Arg 


Lys 


Ala 


Gly 


He 


Phe 


Gin 


Ser 


Val 


Lys 













30 35 40 

<210> 234 
<211> 108 
<212> PRT 

<2l3> Homo sapiens 



<220> 
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<221> SIGNAL 
<222> -40. .-1 



<400> 234 



Met Gin Lys 


Phe 


Arg 


Lys 


Met 


Ser 


Glu 


Thr 


His His 


Ser 


Val 


He 


Ser 


-40 






-35 










-30 








-25 


Val Lys Ala 


Ser 


Ser 


Pro 


Trp Leu 


Ser 


Ser 


Ser Val 


Thr 


Ala 


Pro 


Ser 






-20 










-15 








-10 




Met Val Ala 


Pro 


Val 


Thr 


Phe 


Ala 


Ser 


He 


Val Glu 


Glu 


Glu 


Leu 


Gin 




-5 










1 






5 








Gin Glu Ala 


Ala 


Leu 


He Arg Ser Arg Glu Lys Pro 


Leu 


Ala 


Leu 


He 


10 








15 








20 










Gin lie Glu 


Glu 


His 


Ala 


He Gin Asp Leu Leu Val 


Phe 


Tyr 


Glu 


Ala 


25 






30 










35 








40 


Phe Gly Asn 


Pro 


Glu 


Glu 


Phe 


Val 


He 


Val 


Glu Arg Thr 


Pro 


Gin Gly 






45 










50 








55 




Pro Leu Ala 


Val 


Pro 


Met 


Trp Asn Lys 


His 


Gly Cys 












60 










65 














<210> 235 


























<211> 127 


























<212> PRT 


























<213> Homo J 


sapiens 






















<220> 


























<221> SIGNAL 
























<222> -47..- 


-1 
























<400> 235 


























Met Phe Thr 


Pro 


Leu 


Thr 


Val 


Lys 


Tyr 


Ala 


Tyr Tyr Asp 


Thr 


Glu Arg 


-45 










-40 








-35 








He Gly Val 


Asp 


Leu 


He 


Met 


Lys 


Thr Cys 


Phe Ser 


Pro 


Asn 


Arg 


Val 


-30 








-25 








-20 










He Gly Leu 


Ser 


Ser 


Asp 


Leu 


Gin 


Gin 


Val 


Gly Gly Ala Ser 


Ala Arg 


-15 






-10 










-5 








1 


He Gin Asp Ala 


Leu 


Ser 


Thr 


Val 


Leu 


Gin 


Tyr Ala 


Glu Asp 


Val 


Leu 




5 










10 








15 






Ser Gly Lys 


Val 


Ser 


Ala 


Asp 


Asn 


Thr 


Val 


Gly Arg 


Phe 


Leu 


Met 


Ser 


20 










25 








30 








Leu Val Asn 


Gin 


Val 


Pro Lys 


He 


Val 


Pro Asp Asp 


Phe 


Glu 


Thr 


Met 


35 








40 








45 










Leu Asn Ser 


Asn 


He 


Asn Asp Leu Leu Met 


Val Thr 


Tyr 


Leu 


Ala 


Asn 


50 






55 










60 








65 


Leu Thr Gin 


Ser 


Gin 


He 


Ala 


Leu 


Asn 


Glu 


Lys Leu 


Val 


Asn 


Leu 








70 










75 








80 




<210> 236 


























<211> 85 


























<212> PRT 


























<213> Homo sapiens 






















<220> 


























<221> SIGNAL 
























<222> -32. 


■1 
























<400> 236 


























Met Lys Ala 


Ala 


Leu 


Glu 


Asp 


Thr 


Leu 


Ala 


Glu Thr 


Glu 


Ala 


Arg 


Phe 


-30 










-25 








-20 








Gly Ala Gin 


Leu 


Ala 


His 


He 


Gin 


Ala 


Leu 


He Ser Gly 


He 


Glu 


Ala 


-15 








-10 








-5 










Gin Leu Gly Asp 


Val 


Arg 


Ala 


Asp Ser 


Glu 


Arg Gin 


Asn 


Gin 


Glu 


Tyr 


1 




5 










10 








15 




Gin Arg Leu 


Met 


Asp 


He 


Lys 


Ser Arg 


Leu 


Glu Gin 


Glu 


He 


Ala 


Thr 
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20 25 30 

Tyr Arg Ser Leu Leu Glu Gly Gin Glu Asp His Tyr Asn Asn Leu Ser 

35 40 45 

Ala Ser Lys Val Leu 
50 



<210> 237 
<211> 101 
<212> PRT 
<213> Homo sapiens 

<220> 

<221> SIGNAL 
<222> -25.. -1 



<400> 237 








Met 


Val 


Asp 


Arg 


Glu 


Leu Ala Asp 


-25 










-20 


Leu 


Val 


Leu 


Cys 


Ser Arg Val Leu 


Lys 


Thr 


Gin 


Leu 


Ser 


Leu Gly Arg 






10 






15 


Asp 


Tyr 


lie 


Pro 


Ser 


Thr Arg Tyr 




25 








30 


Leu 


Cys 


Arg 


Leu 


Thr 


He Arg Asn 


40 










45 


Pro Asp 


Gly 


He 


Phe 


Ser Leu Leu 










60 




Leu 


Asn 


Leu 


Glu 


He 










75 







He His Glu Asp Ala Lys Thr Cys 
-15 -10 

Ser Val He Ser Val Lys Glu He 

1 5 

His Pro He He Ser Asn Trp Phe 
20 

Lys Asp Pro Cys Glu Leu Leu His 
35 

Gin Leu Leu Thr Asn Asn Met Leu 

50 55 

He Pro Ala Arg Leu Gin Asn Tyr 
65 70 



<210> 238 
<211> 68 
<212> PRT 

<213> Homo sapiens 



<220> 












<221> SIGNAL 










<222> -47..- 


■1 










<400> 238 












Met Val Lys 


Ser 


Ser 


Leu 


Gin Arg 


-45 










-40 


Arg Glu Lys 


Glu 


Gly Asp 


Lys 


Pro 


-30 








-25 




Thr Met Pro 


Leu 


Leu 


Ser 


Leu 


His 


-15 






-10 






Ser Ser Arg 


Val 
5 


Ser 


Leu 


His 


Cys 


Arg Trp Cys 


Ser 











20 



He Leu Asn Ser His Cys Phe Ala 
-35 

Ser Ala Thr He His Ala Ser Arg 
-20 

Ser Arg Gly Gly Ser Ser Ser Glu 

-5 1 
Cys Ser Asn Pro Gly Pro Gly Pro 
10 15 



<210> 239 

<211> 68 

<212> PRT 

<213> Homo sapiens 

<220> 

<221> SIGNAL 
<222> -47. .-1 



<400> 239 

Met Val Lys Ser Ser Leu Gin Arg He Leu Asn Ser His Cys Phe Ala 



1 
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-45 

Arg Glu Lys Glu Gly Asp 
-30 

Thr Met Pro Leu Leu Ser 
-15 -10 
Ser Ser Arg Val Ser Leu 
5 

Arg Trp Cys Ser 
20 



156 

-40 

Lys Pro Ser Ala Thr lie 
-25 -20 
Leu His Ser Arg Gly Gly 
-5 

His Cys Cys Ser Asn Pro 
10 



PCT/IBOl/00914 

-35 

His Ala Ser Arg 

Ser Ser Ser Glu 
1 

Gly Pro Gly Pro 
15 



<210> 240 
<211> 68 

<212> PRT 

<213> Homo sapiens 



<220> 

<221> SIGNAL 

<222> -47. .-1 

<223> Von Heijne matrix 

score 3.72491712201476 
seq PLLSLHSRGGSSS/ES 



<400> 240 






Met 


Val Lys 


Ser 


Ser 




-45 






Arg 


Glu Lys 


Glu 


Gly 




-30 






Thr 


Met Pro 


Leu 


Leu 


-15 








Ser 


Ser Arg 


Val 


Ser 






5 




Arg 


Trp Cys 


Ser 





Leu Gin Arg lie Leu Asn 
-40 

Asp Lys Pro Ser Ala Thr 
-25 

Ser Leu His Ser Arg Gly 
-10 -5 
Leu His Cys Cys Ser Asn 
10 



Ser His Cys Phe Ala 
-35 

lie His Ala Ser Arg 
-20 

Gly Ser Ser Ser Glu 

1 

Pro Gly Pro Gly Pro 
15 



<210> 241 
<211> 68 
<212> PRT 

<213> Homo sapiens 



<220> 

<221> SIGNAL 
<222> -47. .-1 

<223> Von Heijne matrix 

score 3.72491712201476 
seq PLLSLHSRGGSSS/ES 



<220> 

<221> unsure 
<222> 12 

<223> Xaa = Lys, Asn 



<400> 241 




















Met 


Val Lys 


Ser 


Ser 


Leu 


Gin 


Arg 


lie 


Leu 


Asn 


Ser His Cys Phe Ala 




-45 










-40 








-35 


Arg 


Glu Lys 


Glu Gly 


Asp 


Lys 


Pro 


Ser 


Ala 


Thr 


lie His Ala Ser Arg 




-30 








-25 










-20 


Thr 


Met Pro 


Leu 


Leu 


Ser 


Leu 


His 


Ser Arg Gly Gly Ser Ser Ser Glu 


-15 








-10 










-5 


1 


Ser 


Ser Arg 


Val 


Ser 


Leu 


Xaa 


Xaa 


•Cys 


Ser 


Xaa 


Pro Gly Pro Gly Pro 






5 










10 






15 


Arg 


Trp Cys 


Ser 



















20 
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<210> 242 

<211> 129 

<212> PRT 

<213> Homo sapiens 
<220> 

<221> SIGNAL 

<222> "16. .-1 

<223> Von Heijne matrix 

score 3.61791418904325 
seq LLLLLE%ALRRGPG/PG 

<220> 

<221> unsure 

<222> 92 

<223> Xaa = His, Gin 
<220> 

<221> unsure 

<222> 84 

<223> Xaa = Phe,Tyr 

<400> 242 



Met 


Ala 


Val 


Leu 


Leu 


Leu 


Leu 


Leu 


Arg Ala 


Leu Arg Arg Gly 


Pro Gly 




-15 










-10 






-5 






Pro 


Gly 


Pro 


Arg 


Pro 


Leu 


Trp 


Gly 


Pro Gly 


Pro Ala Trp Ser 


Pro Gly 


1 








5 








10 




15 




Phe 


Pro 


Ala 


Arg 


Pro Gly Arg 


Gly 


Arg Pro 


Tyr Met Ala Ser 


Arg 


Pro 








20 










25 


30 






Pro 


Gly 


Asp 


Leu 


Ala 


Glu 


Ala 


Gly 


Gly Arg 


Ala Leu Gin Ser 


Leu 


Gin 






35 










40 




45 






Leu 


Arg 


Leu 


Leu 


Thr 


Pro 


Thr 


Phe 


Glu Gly lie Asn Gly Leu 


Leu 


Leu 




50 










55 






60 






Lys 


Gin 


His 


Leu 


Val 


Gin 


Asn 


Pro 


Val Arg Leu Trp Gin Leu 


Leu 


Gly 


65 










70 








75 




80 


Gly 


Thr 


Phe 


Xaa 


Phe 


Asn 


Thr 


Ser 


Arg Leu 


Lys Xaa Lys Asn 


Lys 


Glu 










85 








90 




95 




Lys 


Asp 


Lys 


Ser 


Lys Gly Lys Ala 


Pro Glu 


Glu Asp Glu Gly 


He 


Phe 








100 










105 


110 







He 



<210> 243 
<211> 129 
<212> PRT 

<213> Homo sapiens 
<220> 

<221> SIGNAL 

<222> -16. .-1 

<223> Von Heijne matrix 

score 3.61791418904325 
seq LLLLLRALRRGPG/PG 

<400> 243 



Met Ala Val Leu 


Leu Leu Leu 


Leu 


Arg Ala 


Leu Arg Arg 


Gly 


Pro 


Gly 


-15 


-10 








-5 










Pro Gly Pro Arg 


Pro Leu Trp 


Gly 


Pro Gly 


Pro 


Ala 


Trp 


Ser 


Pro 


Gly 


1 


5 




10 










15 




Phe Pro Ala Arg 


Pro Gly Arg 


Gly 


Arg Pro 


Tyr 


Met 


Ala 


Ser 


Arg 


Pro 


20 






25 








30 






Pro Gly Asp Leu Ala Glu Ala 


Gly 


Gly Arg Ala 


Leu 


Gin 


Ser 


Leu 


Gin 


35 




40 








45 








Leu Arg Leu Leu 


Thr Pro Thr 


Phe 


Glu Gly 


He 


Asn 


Gly 


Leu 


Leu 


Leu 
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157 
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50 




55 




60 






Lys Gin His Leu Val 


Gin 


Asn 


Pro Val Arg Leu 


Trp Gin Leu 


Leu Gly 


65 


70 




75 






80 


Gly Thr Phe Tyr Phe 


Asn 


Thr 


Ser Arg Leu Lys 


Gin Lys Asn 


Lys 


Glu 


85 






90 




95 




Lys Asp Lys Ser Lys Gly Lys 


Ala Pro Glu Glu Asp Glu Gly 


He 


Phe 


100 






105 


110 






lie 














<210> 244 














<211> 109 














<212> PRT 














<213> Homo sapiens 














<220> 














<221> SIGNAL 














<222> -33. .-1 














<220> 














<221> UNSURE 














<222> 43 














<223> Xaa = Asp,Gly,Asn, 


Ser 










<400> 244 














Met Ala Ala Ser Gly Ala 


Pro 


Arg He Leu Val 


Asp Leu Leu 


Lys 


Leu 


-30 






-25 


-20 






Asn Val Ala Pro Leu 


Ala 


Val 


Phe Gin Met Leu 


Lys Ser Met 


Cys 


Ala 


-15 






-10 


-5 






Gly Gin Arg Leu Ala 


Ser 


Glu 


Pro Gin Asp Pro 


Ala Ala Val 


Ser 


Leu 


1 


5 




10 






15 


Pro Thr Ser Ser Val 


Pro 


Glu 


Thr Arg Gly Arg Asn Lys Gly Ser Ala 


20 






25 




30 




Ala Leu Gly Gly Ala 


Leu 


Ala 


Leu Ala Glu Arg Xaa Ser Arg Glu Gly 


35 






40 


45 






Ser Ser Gin Arg Met 


Pro 


Arg 


Gin Pro Ser Ala 


Thr Arg Leu 


Pro 


Lys 


50 






55 


60 






Gly Gly Gly Pro Gly 


Lys 


Ser 


Pro Thr Arg Gly 


Ser Thr 






65 




70 




75 






<210> 245 














<211> 103 














<212> PRT 














<213> Homo sapiens 














<220> 














<221> SIGNAL 














<222> -36.. -1 














<400> 245 














Met Ala Asn Leu Phe 


He 


Arg 


Lys Met Val Asn 


Pro Leu Leu 


Tyr 


Leu 


-35 




-30 




-25 






Arg Arg His Thr Val 


Lys 


Pro 


Arg Ala Leu Ser 


Thr Phe Leu 


Phe 


Gly 


-20 


-15 




-10 






-5 


Ser lie Arg Gly Ala 


Ala 


Pro 


Val Ala Val Glu 


Pro Gly Ala 


Ala 


Val 


1 






5 


10 






Arg Ser Leu Leu Ser 


Pro 


Gly 


Leu Leu Pro His 


Leu Leu Pro 


Ala 


Leu 


15 






20 


25 






Gly Phe Lys Asn Lys 


Thr 


Val 


Leu Lys Lys Arg 


Cys Lys Asp 


Cys 


Tyr 


30 




35 




40 






Leu Val Lys Arg Arg Gly 


Arg 


Trp Tyr Val Tyr Cys Lys Thr 


His 


Pro 


45 


50 




55 






60 


Arg His Lys Gin Arg 


Gin 


Met 











65 
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<210> 246 
<211> 86 
<212> PRT 

<213> Homo sapiens 
<220> 

<221> SIGNAL 
<222> -36.. -1 

<400> 246 

Met Gly Pro Ala Leu Arg Ser Leu Gin Val Lys Lys Gly Thr Glu His 

-35 -30 -25 

Ala Asp Pro Leu Pro Phe Pro Ser Val Ser Leu Ser Gly Phe Thr Val 
-20 -15 -10 -5 

Gly Thr Leu Ser Glu Thr Ser Thr Gly Gly Pro Ala Thr Pro Thr Trp 

15 10 

Lys Glu Cys Pro lie Cys Lys Glu Arg Phe Pro Ala Glu Ser Asp Lys 

15 20 25 

Asp Ala Leu Glu Asp His Met Asp Gly His Phe Phe Phe Ser Thr Gin 

30 35 40 

Asp Pro Phe Thr Phe Glu 
45 50 

<210> 247 
<211> 52 
<212> PRT 

<213> Homo sapiens 

<220> 

<221> SIGNAL 
<222> -30.. -1 

<400> 247 

Met Ser Ser Cys Gly lie Val Gly Ser Ser Val Ser Phe Gin Leu Asp 
-30 -25 -20 -15 

Ala Val Lys Leu Leu Leu Lys Met Val Ser Ser Ala Thr Thr Glu Arg 

-10 -5 1 

Cys Cys Asn Gly Ser Ala Asn Phe His Lys Asn Leu Cys Ala Thr Gly 

5 10 15 

He Lys Asn Phe 
20 

<210> 248 
<211> 219 
<212> PRT 

<213> Homo sapiens 
<220> 

<221> SIGNAL 

<222> -40. .-1 

<223> Von Heijne matrix 

score 6.4143157541603 

seq ITMSATWDAVNA/AP 

<220> 

<221> unsure 
<222> 13 

<223> Xaa = Glu, Gin 



<220> 

<221> unsure 
<222> -39 



wo 02/083898 
<223> Xaa = Ile,Asn 



160 
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<400> 248 






Pro 


Xaa 


Arg 


Lys He 


Thr Arg Leu 


-40 








-35 


Gly Ala 


Ser 


Cys Gly Gly Ala Gly 








-20 




Thr 


Val 


Val 


Asp Ala Val Asn Ala 

_& 


Met 


Ser 


Leu 


Glu Xaa 


Pro Lys Lys 




10 






15 


Lys 


Lys 


Glu 


Leu Glu Glu Gin Arg 


25 








30 


Val 


Asp Glu 


Glu Gly 


Lys Asp He 








45 




Ser 


Ser 


Val 


Pro Trp 


Tyr He Asp 








60 




His 


Gin Arg 


Pro Gin 


Pro Glu Lys 






75 




80 


Glu 


Trp Tyr 


Lys Arg 


Gly Val Lys 




90 






95 


Arg 


Lys 


Gly 


Ala Cys 


Glu Asn Cys 


105 








110 


Asp Cys 


Phe 


Glu Arg 


Pro Arg Arg 








125 




Asn 


He 


Ala 


Pro Asp 


Glu His Val 








140 




Asp Gly Lys 


Arg Asp Arg Trp Asn 






155 




160 


Lys 


He 


Val 


Glu Glu 


Tyr Ala Lys 




170 






175 



Ala Trp 


He 
-30 


Leu 


Ser 


Asp 


Gly 


Leu 
-25 


Gly Gly 


Gin 


He 


Thr 


Met 


Ser 


Ala 


-15 










-10 




Ala Pro 


Leu 


Ser 


Gly 


Ser 


Lys 


Glu 


1 






5 








Met Thr 


Arg 


Glu 
20 


Asp 


Trp 


Arg 


Lys 


Lys Leu 


Gly 
35 


Asn 


Ala 


Pro 


Ala 


Glu 
40 


Asn Pro 


His 


He 


Pro 


Gin 


Tyr 


He 


50 










55 




Pro Ser 


Lys 


Arg 


Pro 


Thr 


Leu 


Lys 


65 








70 






Gin Lys 


Gin 


Phe 


Ser 
85 


Ser 


Ser 


Gly 


Glu Asn 


Ser 


He 
100 


He 


Thr 


Lys 


Tyr 


Gly Ala 


Met 
115 


Thr 


His 


Lys 


Lys 


Lys 
120 


Val Gly 


Ala 


Lys 


Phe 


Thr 


Gly 


Thr 


130 










135 




Gin Pro 


Gin 


Leu 


Met 


Phe 


Asp 


Tyr 


145 








150 






Gly Tyr 


Asn 


Pro 


Glu 


Glu 


His 


Met 



165 



Val Asp Leu 



<210> 249 
<211> 215 
<212> PRT 

<213> Homo sapiens 

<220> 

<221> SIGNAL 

<222> -36. .-1 

<223> Von Heijne matrix 

score 5.52542399647426 
seq ITMSATVXDAVNA/AP 



<400> 249 



He 


Thr 


Arg 


Leu 


Ala 


Trp 


He 


Leu 




-35 










-30 




Gly 


Gly 


Ala 


Gly 


Gly 


Gly 


Gin 


He 


-20 










-15 






Ala 


Val 


Asn 


Ala 


Ala 
1 


Pro 


Leu 


Ser 


Glu 


Pro 


Lys 


Lys 


Met 


Thr 


Arg 


Glu 






15 










20 


Glu 


Glu 


Gin 


Arg 


Lys 


Leu 


Gly 


Asn 




30 










35 




Gly 


Lys 


Asp 


He 


Asn 


Pro 


His 


He 


45 










50 






Trp 


Tyr 


He 


Asp 


Pro 


Ser 


Lys 


Arg 










65 








Gin 


Pro 


Glu 


Lys 


Gin 


Lys 


Gin 


Phe 








80 










Arg 


Gly 


Val 


Lys 


Glu 


Asn 


Ser 


He 






95 










100 



Xaa 


Asp 


Gly 


Leu 


Gly 


Ala 


Ser 


Cys 








-25 










Thr 


Met 


Ser 


Ala 


Thr 


Val 


Xaa 


Asp 






-10 










-5 


Gly Ser 


Lys 


Glu 


Met 


Ser 


Leu 


Glu 


5 










10 






Asp Trp 


Arg 


Lys 


Lys 


Lys 


Glu 


Leu 










25 








Ala 


Pro 


Ala 


Glu 


Val 


Asp 


Glu 


Glu 








40 










Pro 


Gin 


Tyr 


He 


Ser 


Ser 


Val 


Pro 






55 










60 


Pro 


Thr 


Leu 


Lys 


His 


Gin 


Arg 


Pro 




70 










75 




Ser 


Ser 


Ser 


Gly 


Glu 


Trp 


Tyr 


Lys 


85 










90 






He 


Thr 


Lys 


Tyr 


Arg 


Lys 


Gly 


Ala 



105 
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Cys Glu Asn 


Cvs 


Glv 


Ala 


Met 


Thr 


His 


Lvs 


Lvs 


Lvs 


ASD 


Cvs 


Phe 


Glu 


110 








115 










120 










Arg Pro Arg 


Arg 


Val 


Gly 


Ala 


Lys 


Phe 


Thr 


Gly 


Thr 


Asn 


He 


Ala 


Pro 


125 






130 










135 










140 


Asp Glu His 


Val 


Gin 


Pro 


Gin 


Leu 


Met 


Phe 


Asp 


Tvr 


Asp 


Gly 


Lys 


Arg 






145 










150 










155 




Asp Arg Trp 


Asn 


Glv 


Tvr 


Asn 


Pro 


Glu 


Glu 


His 


Met 


Lvs 


He 


Val 


Glu 




160 










165 










170 






Glu Tyr Ala 


Lys 


Val 


Asp 


Leu 




















175 




























<210> 250 




























<211> 214 




























<212> PRT 




























<213> Homo sapiens 
























<220> 




























<221> SIGNAL 


























<222> -36. 


-1 


























<220> 




























<221> UNSURE 


























<222> 120 




























<223> Xaa = 


Glu, Lys 
























<400> 250 




























lie Thr Arg 


Leu 


Ala 


TrD 


He 


Leu 


Ser 


Asp 


Glv 


Leu 


Glv 


Ala 


Ser 


Cvs 


-35 








-30 










-25 










Gly Gly Ala 


Gly 


Gly 


Glv 


Gin 


He 


Thr 


Met 


Ser 


Ala 


Thr 


Val 


Val 


Asp 


-20 






-15 










-10 










-5 


Ala Val Asn 


Ala 


Ala 


Pro 


Leu 


Ser 


Glv 


Ser 


Lys 


Glu 


Met 


Ser 


Leu 


Glu 






1 








5 










10 






Glu Pro Lys 


Lys 


Met 


Thr 


Arg 


Glu 


Asp 


TrD 


Arg 


Lys 


Lys 


Lys 


Glu 


Leu 


15 










20 










25 








Glu Glu Gin 


Arg 


Lys 


Leu 


Glv 


Asn 


Ala 


Pro 


Ala 


Glu 


Val 


ASD 


Glu 


Glu 


30 








35 










40 










Gly Lys Asp 


He 


Asn 


Pro 


His 


He 


Pro 


Gin 


Tvr 


He 


Ser 


Ser 


Val 


Pro 


45 






50 










55 










60 


Trp Tyr lie 


Asp 


Pro 


Ser 


Lvs 


Ara 


Pro 


Thr 


Leu 


Lvs 


His 


Gin 


Ara 


Pro 






65 










70 










75 




Gin Pro Glu 


Lys 


Gin 


Lvs 


Gin 


Phe 


Ser 


Ser 


Ser 


Glv 


Glu 


Trp 


Tvr 


Lvs 




80 










85 










90 






Arg Gly Val 


Lys 


Glu 


Asn 


Ser 


He 


He 


Thr 


Lvs 


Tvr 


Arg 


Lvs 


Gly 


Ala 


95 










100 










105 








Cys Glu Asn 


Cys 


Gly 


Ala 


Met 


Thr 


His 


Lvs 


Lvs 


Xaa 


Asp 


Cvs 


Phe 


Glu 


110 








115 










120 










Arg Pro Arg 


Arg 


Val 


Gly 


Ala 


Lys 


Phe 


Thr 


Gly 


Thr 


Asn 


He 


Ala 


Pro 


125 






130 










135 










140 


Asp Glu His 


Val 


Gin 


Pro 


Gin 


Leu 


Met 


Phe 


Asp 


Tyr 


Asp 


Gly 


Lys 


Arg 






145 










150 










155 




Asp Arg Trp 


Asn 


Gly 


Tyr 


Asn 


Pro 


Glu 


Glu 


His 


Met 


Lys 


He 


Val 


Glu 




160 










165 










170 







Glu Tyr Ala Lys Val Asp 
175 



<210> 251 
<211> 147 
<212> PRT 

<213> Homo sapiens 



<220> 

<221> SIGNAL 
<222> -15.. -1 



wo 02/083898 
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<220> 

<221> UNSURE 
<222> -9 

<223> Xaa = Glu,Lys 



<400> 251 



Met 


Asp 


Val 


Leu 


Val Ser 


Xaa 


Cvs 


-15 








-10 






Glu 


Glu 


lie 


Lys 
5 


Ser Leu 


Thr 


Ala 


Gly Cys 


Leu 


Gly 


Ala Ser 


Pro 


Asn 






20 








25 


Leu 


Lys 


Leu 


Lys 


Tyr Arg 


Leu 


Asn 




35 








40 




Glu 


Arg 


Asn 


Lys 


Pro Thr 


Lys 


Asn 


50 








55 






Gin 


Glu 


Val 


Phe 


Gly His 


Ala 


lie 










70 






Asn 


Pro 


Pro 


Leu 


Leu Val 


Thr 


Pro 








85 








Tyr Gin 


Cys 


Asn 


Ser Ala 


Met 


Gly 






100 








105 


Asp 


Ser 


Trp 


Leu 


Phe Asp 


Phe 


Phe 




115 








120 




Leu 


Gin 


Lys 











130 



<210> 252 
<211> 117 
<212> PRT 

<213> Homo sapiens 
<220> 

<221> SIGNAL 

<222> -43. .-1 

<223> Von Heijne matrix 

score 4.10872570449087 
seq LGQVAAAAAAANA/AS 



<400> 252 



Leu 


Ser 


Pro 


Ala 


Gin 


Ala 


Ser Gly 








-40 








Ser Gly 


Trp 


Ser 


Glu 


Arg 


Gly Ser 






-25 








-20 


Gin 


Val 


Ala 


Ala 


Ala 


Ala 


Ala Ala 




-10 










-5 


Ser 


Ala 


Asn 


Ser 


Leu 


Gly 


Ser Gly 










10 






Thr 


Ser 


Arg 


Pro 


Ser 


Gin 


Ser Arg 








25 








Phe 


Pro 


Pro 


Ser 


Pro 


Leu 


Pro Pro 






40 








45 


Cys 


Tyr 


Cys 


Phe 


Leu 


Leu 


lie Asn 




55 










60 


Lys 


Lys 


Lys 


Lys 


Lys 







70 



<210> 253 

<211> 88 

<212> PRT 

<213> Homo sapiens 



Ser Ala 


Arg Leu 


Leu 


Gin 


Gin 


Glu 




-5 










1 


Glu lie 


Asp Arg Leu Lys 


Asn Cys 


10 








15 






Leu Glu 


Gin 


Leu 


Gin 


Glu 


Glu 


Asn 








30 








lie Leu 


Arg 


Lys 


Ser 


Leu 


Gin 


Ala 






45 










Met lie 


Asn 


T 1 <» 


He 


Ser 


Arg 


Leu 




60 










65 


Lys Ala 


Ala 


Tyr 


Pro Asp 


Leu 


Glu 


75 










80 




Ser Gin 


Gin 


Ala 


Lys 


Phe 


Gly Asp 


90 








95 






lie Ser 


Gin 


Val 


Met 


Tyr 












110 








Lys Tyr 


Tyr 


Tyr 


His 


His 


Cys 


His 






125 
J. 6 -J 










He Trp 


Cys 


Arg 


Asp 


Ser 


Ala 


Phe 


-35 








-30 






Ser Ser 


Arg Arg 


Gly Cys 


Leu 


Gly 








-15 








Ala Asn 


Ala 


Ala 


Ser 


Pro 


Pro 


Pro 






1 








5 


Gly Arg 


Cys 


Lys 


Leu Arg 


Ala 


Pro 


15 










20 




Pro Arg 


Ser 


Leu 


Pro 


Gin 


Ala 


Arg 


30 








35 






Ser Pro 


Ala Gly 


Thr 


Ser 


Cys 


Thr 








50 








Ala Asp 


Leu 


He 


Lys 


Trp 


Ser 


Pro 



65 



wo 02/083898 
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<220> 

<221> SIGNAL 
<222> -36. .-1 

<400> 253 

Met Ala Ser Val Val Pro Val Lys Asp Lys Lys Leu Leu Glu Val Lys 

-35 -30 -25 

Leu Gly Glu Leu Pro Ser Trp He Leu Met Arg Asp Phe Ser Pro Ser 
-20 -15 -10 . -5 

Gly He Phe Gly Ala Phe Gin Arg Gly Tyr Tyr Arg Tyr Tyr Asn Lys 

1 5 10 

Tyr He Asn Val Lys Lys Gly Ser He Ser Gly He Thr Met Val Leu 

15 20 25 

Ala Cys Tyr Val Leu Phe Ser Tyr Ser Phe Ser Tyr Lys His Leu Lys 

30 35 40 

His Glu Arg Leu Arg Lys Tyr His 
45 50 

<210> 254 

<211> 62 

<212> PRT 

<213> Homo sapiens 

<220> 

<221> SIGNAL 
<222> -42. .-1 

<220> 

<221> UNSURE 
<222> -11 

<223> Xaa = Glu, Lys 



<400> 254 



Met Asp Pro Asn Cys 


Ser Cys 


Ala 


Ala 


Ala 


Gly Val 


Ser Cys 


Thr 


Cys 


-40 






-35 








-30 








Ala Ser Ser Cys Lys 


Cys 


Lys 


Glu 


Cys 


Lys 


Cys Thr 


Ser Cys 


Lys 


Xaa 


-25 




-20 








-15 










Ser Cys Cys Ser Cys 


Cys 


Pro 


Val 


Gly Cys 


Ala Lys 


Cys 


Ala 


Gin Gly 


-10 


-5 










1 






5 




Cys He Cys Lys Gly Ala 


Ser 


Glu 


Lys 


Cys 


Ser Cys 


Cys 


Ala 






10 








15 








20 






<210> 255 






















<211> 132 






















<212> PRT 






















<213> Homo sapiens 






















<220> 






















<221> UNSURE 






















<222> 118 






















<223> Xaa = Gly, Val 






















<400> 255 






















Leu Phe Pro Ala Pro 


Ala 


Pro 


Pro 


Pro 


Ala 


Pro Ala 


Phe 


Ala 


Pro 


Pro 


1 5 










10 








15 




Pro Lys Val Pro Ser 


Pro 


Glu 


Arg 


Ser 


Ala 


Pro Arg 


Val 


Pro 


Leu 


Pro 


20 








25 








30 






Ser Pro Gin Pro Ser 


Tyr 


Pro 


Phe 


Arg 


Pro 


Ala Ala 


Ser 


Gly 


Gly 


Thr 


35 






40 








45 








Pro Pro Pro Ala Cys 


Leu 


Pro 


Pro 


Ala 


Gin 


Pro Cys 


Gin Gly 


Ser 


Pro 


50 




55 








60 










Ala Met Asn Leu Phe 


Arg 


Phe 


Leu 


Gly Asp 


Leu Ser 


His 


Leu 


Leu 


Ala 



65 70 75 80 



wo 02/083898 

He lie Leu Leu Leu Leu Lys He Trp 
85 

His Pro Gin Leu Pro Leu Ser Phe Cys 
100 105 
Ser Leu Ser Leu Ser Xaa Ser Leu Ser 

115 120 
Lys Lys Lys Lys 
130 



PCT/IBOl/00914 

164 

Lys Ser Arg Ser Cys Ala Ala 
90 95 
Leu Ser Val Cys Leu Ser Val 
110 

Leu Ser Phe Ser Val Ser Lys 
125 



<210> 256 
<211> 163 
<212> PRT 

<213> Homo sapiens 



<400> 256 


























Pro His 


Arg 


Leu 


Thr 


Leu 


Pro 


Thr 


Leu 


Gin Ser 


Val 


Thr 


Phe 


Arg 


Cys 


1 






5 










10 








15 




Pro Ser 


Arg 


Ser 


Glu 


Lys 


Leu Gly 


Lys 


Asn Met 


Val 


Ser 


Ser 


Phe 


Arg 






20 










25 








30 






Val Ser 


Glu. 


Leu 


Gin 


Val 


Leu 


Leu 


Gly 


Phe Ala 


Gly Arg Asn 


Lys 


Ser 




35 










40 








45 








Gly Arg 


Lys 


His 


Asp 


Leu 


Leu 


Met 


Arg 


Ala Leu 


His 


Leu 


Leu 


Lys 


Ser 


50 










55 








60 










Gly Cys 


Ser 


Pro 


Ala 


Val 


Gin 


He 


Lys 


He Arg 


Glu 


Leu 


Tyr Arg 


Arg 


65 








70 








75 










80 


Arg Tyr 


Pro 


Arg 


Thr 


Leu 


Glu 


Gly 


Leu 


Ser Asp 


Leu 


Ser 


Thr 


He 


Lys 








85 










90 








95 




Ser Ser 


Val 


Phe 


Ser 


Leu 


Asp Gly 


Gly Ser Ser 


Pro 


Val 


Glu 


Pro 


Asp 






100 










105 








110 






Leu Ala 


Val 


Ala 


Gly 


He 


His 


Ser 


Leu 


Pro Ser 


Thr 


Ser 


Val 


Thr 


Pro 




115 










120 








125 








His Ser 


Pro 


Ser 


Ser 


Pro 


Val 


Gly 


Ser 


Val Leu 


Leu Gin Asp 


Thr 


Lys 


130 










135 








140 










Pro Thr 


Phe 


Glu 


Met 


Gin 


Gin 


Pro 


Ser 


Pro Pro 


He 


Pro 


Pro 


Val 


His 


145 








150 








155 










160 


Pro Asp 


Val 



























<210> 257 
<211> 166 
<212> PRT 

<213> Homo sapiens 



<400> 257 



Pro 


His 


Arg 


Leu 


Thr 


Leu 


Pro 


Thr 


Leu 


Gin 


Ser 


Val Thr Phe 


Arg 


Cys 


1 








5 










10 






15 




Pro 


Ser 


Arg 


Ser 


Glu 


Lys 


Leu Gly 


Lys 


Asn 


Met 


Val Ser Ser 


Phe 


Arg 








20 










25 






30 






Val 


Ser 


Glu 


Leu 


Gin 


Val 


Leu 


Leu 


Gly 


Phe 


Ala 


Gly Arg Asn 


Lys 


Ser 






35 










40 








45 






Gly Arg 


Lys 


His 


Asp 


Leu 


Leu 


Met 


Arg 


Ala 


Leu 


His Leu Leu 


Lys 


Ser 




50 










55 










60 






Gly Cys 


Ser 


Pro 


Ala 


Val 


Gin 


He 


Lys 


He Arg 


Glu Leu Tyr Arg 


Arg 


65 










70 










75 






80 


Arg 


Tyr 


Pro 


Arg 


Thr 


Leu 


Glu 


Gly 


Leu 


Ser Asp 


Leu Ser Thr 


He 


Lys 










85 










90 






95 




Ser 


Ser 


Val 


Phe 


Ser 


Leu Asp Gly 


Gly 


Ser 


Ser 


Pro Val Glu 


Pro 


Asp 








100 










105 






110 






Leu 


Ala 


Val 


Ala 


Gly 


He 


His 


Ser 


Leu 


Pro 


Ser 


Thr Ser Val 


Thr 


Pro 






115 










120 








125 






His 


Ser 


Pro 


Ser 


Ser 


Pro 


Val 


Gly 


Ser 


Val 


Leu 


Leu Gin Asp Thr 


Lys 




130 










135 










140 






Pro 


Thr 


Phe 


Glu 


Met 


Gin 


Gin 


Pro 


Ser 


Pro 


Pro 


He Pro Pro 


Val 


His 


145 










150 










155 






160 



wo 02/083898 
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Pro Asp Val 


Gin 


Leu 


Lys 










165 








<210> 258 












<211> 118 












<212> PRT 












<213> Homo 1 


sapiens 








<400> 258 












Met Ser Tyr 


Glu 


Gin 


Leu 


Met 


Gin 


1 




5 








Arg Leu Asn 


Arg 


Gly 


Leu 


Arg 


Arg 




20 










Leu Arg Lys 


Ala 


Lys 


Lys 


Glu 


Ala 


35 










40 


Val Lys Thr 


His 


Leu 


Arg 


Asp 


Met 


50 








55 




Ser Met Val 


Gly 


Val 


Tyr 


Asn 


Gly 


65 






70 






Lys Pro Glu 


Met 


He 


Gly 


His 


Tyr 






85 








Lys Pro Val 


Lys 


His 


Gly 


Arg 


Pro 




100 










Arg Phe lie 


Pro 


Leu 


Lys 






115 












<210> 259 












<211> 101 












<212> PRT 












<213> Homo sapiens 








<220> 












<221> unsure 










<222> 98 












<223> Xaa = 


lie, 


Asn 








<220> 












<221> unsure 










<222> 19 












<223> Xaa = 


Ser, 


Thr 








<400> 259 












Arg Thr Ala 


Gly 


Arg 


Gly 


Arg 


Pro 


1 




5 








Ala Arg Xaa 


Cys 


Ala 


Glu 


Asp 


Pro 




20 










Gly Ser Arg 


Ala 


Met 


Ala 


Glu 


Glu 


35 










40 


Pro Lys Pro 


Ser 


Val 


Leu 


Phe 


Leu 


50 








55 




Ala Glu Arg 


Leu 


Val 


Leu 


Leu 


Pro 


65 






70 






Phe Cys Thr 


Ala 


Pro 


Arg 


Ser 


Leu 






85 








Lys Xaa Xaa 


Lys 


Ser 










100 











<210> 260 
<211> 100 
<212> PRT 

<213> Homo sapiens 



165 



Leu 


Tyr Ser Ala Arg 


Gin Arg Arg 




10 




15 


Lys 


Gin His Ser Leu 


Leu 


Lys Arg 


25 




30 




Pro 


Pro Met Glu Lys 


Pro 


Glu Val 




45 






He 


He Leu Pro Glu 


Met 


Val Gly 




60 






Lys 


Thr Phe Asn Gin 


Val 


Glu He 




75 




80 


Leu 


Gly Glu Phe Ser 


He 


Thr Tyr 




90 




95 


Gly 


He Gly Ala Thr 


His 


Ser Ser 


105 




110 





Ala 


Val Ala Ser Trp 


Glu 


Leu Arg 




10 




15 


His 


Gin Gly Ala Gly 


Cys 


Ser Cys 


25 




30 




Gin 


Gly Arg Glu Arg 


Asp 


Ser Val 




45 






His 


Pro Asp Leu Gly 


Val 


Gly Gly 




60 






Phe 


Pro Thr Glu Arg 


Arg 


Leu Leu 




75 




80 


Asn 


Ser Leu Pro Leu 


Lys 


Lys Lys 




90 




95 



<400> 260 



wo 02/083898 



Val Arg 


Gin 


Gly 


Gly 


Val 


Gly 


Pro 


Gin 


1 






5 










Arg Gly 


Ala 


Cys 


Ala 


Glu Asp 


Pro 


His 






20 










25 


Gly Ser 


Arg 
35 


Ala 


Met 


Ala 


Glu 


Glu 
40 


Gin 


Pro Lys 


Pro 


Ser 


Val 


Leu 


Phe 


Leu 


His 


50 










55 






Ala Glu 


Arg 


Leu 


Val 


Leu 


Leu 


Pro 


Phe 


65 








70 








Phe Cys 


Thr 


Ala 


Pro 
85 


Arg 


Ser 


Leu 


Asn 


Lys Asn 


Ala 


Lys 
100 
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Trp Pro Ala 


Gly Ser Tyr 


Glu 


10 




15 




Gin Gly Ala 


Gly Cys 


Ser 


Cys 




30 






Gly Arg Glu 


Arg Asp 


Ser 


Val 




45 






Pro Asp Leu 


Gly Val 


Gly 


Gly 


60 








Pro Thr Glu 


Arg Arg 


Leu 


Leu 


75 






80 


Ser Leu Pro 


Leu Lys 


Lys 


Lys 


90 




95 





<210> 261 
<211> 147 
<212> PRT 

<213> Homo sapiens 

<220> 

<221> unsure 
<222> 90 

<223> Xaa = Ala, Gly 



<400> 261 








Met 


Glu 


Tyr 


Val 


Leu 


Glu Val Lys 


1 








5 




Gin 


Phe 


Thr 


Val 


Cys 


Asp Val Pro 








20 






Val 


Ser 


Arg 


Asp 


Arg 


Cys Gin Glu 






35 






40 


Xaa 


Cys 


Tyr 


Lys 


Lys 


Ala Val Pro 




50 








55 


Leu 


lie 


Val 


lie 


Lys 


Ala Gly Ala 


65 










70 


Val 


lie 


Gin 


Glu 


Ser 


Arg Lys Glu 










85 




Leu 


Pro 


Gin 


Lys 


Ser 


Ser Glu Lys 








100 






Lys 


Leu 


Gly 


Leu 


Lys 


Pro Ala Ser 






115 






120 


Ser 


Met 


Lys 


Ser 


Asp 


Glu Asp Lys 




130 








135 


Glu 


Ala 


Glu 








145 













Asn Ser Pro Arg 


His 


Leu 


Leu 


Lys 


10 






15 




Leu Tyr Asp He 


Cys 


Asp 


Tyr 


Asn 


25 




30 






Leu Gly Cys Cys 


Xaa 


Tyr 


Glu 


Gly 




45 








He Tyr He His 


Val 


Phe 


Ser 


Ala 


60 










Phe Val He Thr 


He 


He 


Tyr 


Arg 


75 








80 


Lys Xaa He Pro 


Val 


Tyr 


Val 


Ala 


90 






95 




Ala Glu Leu Ala 


Ser 


Ser 


Ser 


Ser 


105 




110 






Pro Gly Pro Pro 


Ser 


Ala 


Gly 


Pro 




125 








Asp Asp Val Thr 


Gly Thr 


He 


Thr 



140 



<210> 262 
<211> 147 
<212> PRT 

<213> Homo sapiens 



<400> 262 



Met 


Glu 


Tyr 


Val 


Leu 


Glu 


Val 


Lys Asn Ser Pro Arg 


His 


Leu 


Leu 


Lys 


1 








5 






10 






15 




Gin 


Phe 


Thr 


Val 


Cys 


Asp 


Val 


Pro Leu Tyr Asp He 


Cys 


Asp 


Tyr 


Asn 








20 








25 




30 






Val 


Ser 


Arg Asp Arg 


Cys 


Gin 


Glu Leu Gly Cys Cys 


Phe 


Tyr 


Glu 


Gly 






35 










40 


45 








Val 


Cys 


Tyr 


Lys 


Lys 


Ala 


Val 


Pro He Tyr He His 


Val 


Phe 


Ser 


Ala 




50 










55 


60 










Leu 


He 


Val 


He 


He 


Ala 


Gly Ala Phe Val He Thr 


He 


He 


Tyr 


Arg 


65 










70 




75 








80 



wo 02/083898 



Val lie Gin 


Glu 


Ser Arg 


Lvs 


Glu 


Lys 






85 










Leu Pro Gin 


Lys 


Ser 


Ser 


Glu 


Lys 


Ala 




100 










105 


Lys Leu Gly 


Leu 


Lys 


Pro 


Ala 


Ser 


Pro 


115 










120 




Ser Met Lys 


Ser 


Asp 


Glu 


Asp 


Lys 


Asp 


130 








135 






Glu Ala Glu 














145 














<210> 263 














<2H> 149 














<212> PRT 














<213> Homo sapiens 










<400> 263 














Met Arg Tyr 


Asn 


Glu Lys 


Glu 


Leu 


Gin 


1 




5 










Glu Met Ala 


Ala 


Glu 


Leu 


Gly 


Met 


Arg 




20 










25 


Leu Lys Arg 


Arg 


Leu 


Val 


Lys 


Leu 


Val 


35 










40 




Arg Thr Asp 


Glu 


Ala 


Glu 


Pro 


Val 


Gly 


50 








55 






Arg Val Val 


Arg 


Glu 


Glu 


Pro 


Gly 


Thr 


65 






70 








Asp Pro Glu 


Arg 


Lys 


Tyr 


His 


Phe 


Glu 






85 










Gin Glu Trp 


Met 


Glu 


Ala 


Leu 


Arg 


Arg 




100 










105 


Arg Ser Leu 


lie 


Phe 


Tyr 


Arg 


Asn 


Glu 


115 










120 




Asp Pro Leu 


Glu 


Gin 


Phe 


Gly 


He 


Ser 


130 








135 






Ser Gly Leu 


Gin 


Ala 










145 














<210> 264 














<211> 103 














<212> PRT 














<213> Homo sapiens 










<400> 264 














Met Pro Val 


Val 


Pro 


Ala 


Leu 


Gly 


Arg 


1 




5 










Arg Ser Gly 


Val 


Arg Asp 


Gin 


Pro 


Gly 




20 










25 


Ser Leu Leu 


Lys 


lie 


Gin 


Lys 


Leu 


Ala 


35 










40 




Trp Ser Gin 


Leu 


Leu 


Gly 


Arg 


Leu 


Arg 


50 








55 






Gly Gly Gly 


Gly 


Cys 


Ser 


Glu 


Pro 


Arg 


65 






70 








Trp Val Thr 


Glu 


Gin Asp 


Ser 


He 


Ser 






85 










Leu Pro Lys 


Lys 


Lys 


Lys 


Lys 








100 













<210> 265 
<211> 78 
<212> PRT 

<213> Homo sapiens 
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Ala 


He 


Pro Val 


Tyr Val 


Ala 


90 








95 




Glu 


Leu 


Ala Ser 


Ser 


Ser 


Ser 








110 






Gly 


Pro 


Pro Ser 


Ala Gly 


Pro 






125 








Asp 


Val 


Thr Gly 


Thr 


He 


Thr 






140 








Ala 


Leu 


Ser Arg 


Gin 


Pro 


Ala 


10 








15 




Gly Pro Lys Lys 


Gly 


Ser 


Val 








30 






Val 


Asn 


Phe Leu 


Phe 


Tyr 


Phe 






45 








Ala 


Leu 


Leu Leu 


Glu Arg 


Cvs 






60 








Phe 


Ser 


He Ser 


Phe 


He 


Glu 




75 








80 


Cys 


Ser 


Ser Glu 


Glu 


Gin 


Cys 


90 








95 




Ala 


Ser 


Tyr Glu 


Phe 


Met 


Arg 








110 






He Arg Lys Val 


Thr Gly 


Lys 






125 








Glu 


Glu 


Ala Arg 


Phe 


Gin 


Leu 






140 








Pro Arg Trp Ala 


Asp 


His 


Leu 


10 








15 




Gin 


Pro 


Gly Glu 


Ala 


Pro 


Pro 








30 






Gly 


Tyr Gly Gly 


Gly 


Cys 


Leu 






45 








Arg 


Glu 


Asn His 


Leu 


Ser 


Pro 






60 








Leu 


Cys 


His Cys 


Thr 


Pro 


Ala 




75 








80 


Lys 


He 


Glu Lys 


He 


Ser 


Gin 


90 








95 





wo 02/083898 
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<400> 265 



Met Asp Pro Asn 


Cys 


Ser Cys 


Ala Ala Gly Val 


Ser 


Cys 


Thr 


Cys 


Ala 


1 


5 








10 










15 




Ser Ser Cys Lys 


Cys 


Lys 


Glu 


Cys 


Lys Cys 


Thr 


Ser 


Cys 


Lys 


Lys 


Ser 


20 










25 








30 






Cys Cys Ser Trp Cys 


Thr Gly 


Pro 


Thr Leu 


Val 


Asp 


Thr 


Cys 


Pro 


Ala 


35 








40 








45 








Leu Pro Pro Val 


Cys 


Leu 


Ser 


Gin Arg Ser 


Ser 


Gly Met Asn 


Leu Arg 


50 






55 








60 










Thr His Val Gin 


Trp 


Glu 


Val 


Arg 


Pro Pro 


Leu 


Asn 


He 


Gin 






65 




70 








75 












<210> 266 
























<211> 92 
























<212> PRT 
























<213> Homo sapiens 






















<400> 266 
























Met Ser Pro Gin 


Thr 


His 


Ser 


Gin Thr Cys 


He 


Arg Asn 


Leu 


Val 


Thr* 


1 


5 








10 










15 




Cys lie Asn Tyr 


Pro 


Arg Thr Ser Thr Gly Cys 


Lys Gly Thr 


1 nr 


1 nr 


20 










25 








30 






Gin Arg He Met 


Glu 


Pro 


Val 


Glu 


Leu Glu 


Val 


Glu 


Gly Thr 


u 




35 








40 








45 








Asp Asn Ala Lys 


Thr 


Cys Gly Ser Leu Gly Arg 


Gly Asn Glu 




Thr 

X tlx 


50 






55 








60 










Met Leu Arg Gly 


Gly 


Phe 


Ser 


Met 


Asn Thr 


Thr 


Val 


Gly Gin 






65 




70 








75 












Ser Lys Gin Thr 


His 


His 


Thr 


Ser 


Thr Thr 


Ser 


Ser 












85 








90 














<210> 267 
























<211> 51 
























<212> PRT 
























<213> Homo sapiens 






















<400> 267 
























Met Leu Ala Lys 


Thr 


Gly Val 


His 


His Tyr 


Ser 


Gly Asn Asn 


lie 


Glu 


1 


5 








10 










15 




Leu Gly Thr Ala 


Cys 


Gly Lys 


Tyr Tyr Arg Val 


Cys 


Thr 


Leu 


Ala 


He 


20 










25 








30 






He Asp Pro Gly Asp 


Ser Asp 


He 


He Arg Ser 


Met 


Pro 


Glu 


Gin 


Thr 


35 








40 








45 








Gly Glu Lys 
























50 
























<210> 268 
























<211> 99 
























<212> PRT 
























<213> Homo sapiens 






















<400> 268 
























Met Lys Lys Lys 


Glu 


Glu 


Thr 


Thr 


Leu Ser 


Glu 


Met 


Glu 


Pro 


Val 


Glu 


1 


5 








10 










15 




Pro Gin Tyr Gin 


Leu 


Val 


Asn 


Ala 


Glu Ser 


Thr 


Ser 


Pro 


Phe 


Leu 


His 


20 










25 








30 






Cys Leu Arg Glu 


Val 


He 


Gly Glu 


Tyr Ser 


Val 


His 


Glu 


Phe 


Ser 


Leu 


35 








40 








45 








Leu Gly Lys Thr 


Glu 


Ser 


Gin 


Gly 


He Gly Leu 


Trp 


He 


Ala 


Leu 


Val 


50 






55 








60 










Val Phe Leu Ser 


Phe 


Leu 


He 


Phe 


Ser Thr 


Ser 


Phe 


Tyr 


He 


Ser 


Asn 


65 




70 








75 








80 
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Ala Glu Gin Pro Phe Phe Lys Glu Pro Pro Thr Glu Ala Ala Lys Glu 
85 90 95 

Leu Ser Leu 



<210> 269 
<211> 128 
<212> PRT 

<213> Homo sapiens 



<400> 269 



Met 


Ala 


Gly 


Asn 


Leu Leu Ser Gly Ala Gly Arg Arg Leu Trp 


Asp 


Trp 


1 








5 10 






15 




Val 


Pro 


Leu 


Ala 


Cys Arg Ser Phe Ser Leu Gly Val 


Pro 


Arg 


Leu 


He 








20 


25 




30 






Gly 


He 


Arg 


Leu 


Thr Leu Pro Pro Pro Lys Val Val Asp Arg 


Trp Asn 






35 




40 


45 








Glu 


Lys 


Arg 


Ala 


Met Phe Gly Val Tyr Asp Asn He 


Gly 


He 


Leu 


Gly 




50 






55 60 










Asn 


Phe 


Glu 


Lys 


His Pro Lys Glu Leu He Arg Gly 


Pro 


He 


Trp 


Leu 


65 








70 75 








80 


Arg 


Gly 


Trp 


Lys 


Gly Asn Glu Leu Gin Arg Cys He 


Arg 


Lys 


Arg 


Lys 










85 90 






95 




Met 


Val 


Gly 


Ser 


Arg Met Phe Ala Asp Asp Leu His 


Asn 


Leu 


Asn 


Lys 








100 


105 




110 






Arg 


He 


Arg 


Tyr 


Leu Tyr Lys His Phe Asn Arg His 


Gly 


Lys 


Phe 


Arg 






115 




120 


125 









<210> 270 
<211> 52 
<212> PRT 

<213> Homo sapiens 



<400> 270 

Met Phe Leu Thr Leu Ala Asp Thr 
1 5 
Leu Leu Asn Val Tyr Glu Gly Glu 
20 

Leu Leu Glu Ser Trp He He Val 

35 40 
He Ser Ser His 
50 



Cys Lys Leu Arg Gly Met Ser Phe 

10 15 
Ala Thr Val Ser Ser Val Leu Glu 
25 30 
Gly Asn Glu Arg Tyr Phe Asp Gly 
45 



<210> 271 
<2n> 84 
<212> PRT 

<213> Homo sapiens 



<400> 271 












Met Lys 


Val 


Lys 


He 


Lys 


Cys 


Trp 


1 






5 








Val Ala 


Asn 


Asp 


Glu 


Asn 


Cys 


Gly 






20 










Cys Cys 


Pro 


Asp 


Cys 


Lys 


Val 


Pro 




35 










40 


Gly Gin 


Cys 


Ser 


His 


Cys 


Phe 


His 


50 










55 




His Ala 


Gin 


Gin 


Val 


Gin 


Gin 


His 


65 








70 






Lys Phe 


Lys 


Glu 











Asn Gly Val Ala Thr Trp Leu Trp 

10 15 
He Cys Arg Met Ala Phe Asn Gly 
25 30 
Gly Asp Asp Cys Pro Leu Val Trp 
45 

Met His Cys He Leu Lys Trp Leu 

60 

Cys Pro Met Cys Arg Gin Glu Trp 
75 80 



<210> 272 
<211> 84 
<212> PRT 
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<213> Homo sapiens 
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<400> 272 



Met Lys 


Val 


Lys lie 


Lys Cys Trp Asn Gly 


Val Ala 


Thr 


Trp Leu Trp 


1 




5 


10 






15 


Val Ala 


Asn Asp Glu Asn Cys Gly lie Cys 


Arg Met 


Ala 


Phe Asn Gly 






20 


25 






30 


Cys Cys 


Pro 


Asp Cys 


Lys Val Pro Gly Asp 


Asp Cys 


Pro 


Leu Val Trp 




35 




40 




45 




Gly Gin 


Cys 


Ser His 


Cys Phe His Met His 


Cys lie 


Leu 


Lys Trp Leu 


50 






55 


60 






His Ala 


Gin 


Gin Val 


Gin Gin His Cys Pro 


Met Cys 


Arg Gin Glu Trp 


65 






70 


75 




80 


Lys Phe 


Lys 


Glu 











<210> 273 
<211> 84 
<212> PRT 

<213> Homo sapiens 



<4Q0> 273 


























Met Lys Val 


Lys 


He 


Lys 


Cys 


Trp Asn 


Gly 


Val Ala 


Thr 


Trp 


Leu 


Trp 


1 




5 










10 








15 




Val Ala Asn 


Asp 


Glu Asn Cys Gly 


He Cys 


Arg Met 


Ala 


Phe Asn Gly 




20 










25 








30 






Cys Cys Pro 


Asp 


Cys 


Lys 


Val 


Pro 


Gly Asp 


Asp Cys 


Pro 


Leu 


Val 


Trp 


35 










40 








45 








Gly Gin Cys 


Ser 


His 


Cys 


Phe 


His 


Met 


His 


Cys He 


Leu 


Lys 


Trp 


Leu 


50 








55 








60 










His Ala Gin 


Gin 


Val 


Gin 


Gin 


His 


Cys 


Pro 


Met Cys 


Arg 


Gin 


Glu 


Trp 


65 






70 










75 








80 


Lys Phe Lys 


Glu 
























<210> 274 


























<211> 84 


























<212> PRT 


























<213> Homo sapiens 






















<400> 274 


























Met Lys Val 


Lys 


He 


Lys 


Cys 


Trp Asn 


Gly 


Val Ala 


Thr 


Trp 


Leu 


Trp 


1 




5 










10 








15 




Val Ala Asn 


Asp 


Glu 


Asn 


Cys Gly 


He 


Cys 


Arg Met 


Ala 


Phe 


Asn 


Gly 




20 










25 








30 






Cys Cys Pro 


Asp 


Cys 


Lys 


Val 


Pro 


Gly Asp 


Asp Cys 


Pro 


Leu 


Val 


Trp 


35 










40 








45 








Gly Gin Cys 


Ser 


His 


Cys 


Phe 


His 


Met 


His 


Cys He 


Leu 


Lys 


Trp 


Leu 


50 








55 








60 










His Ala Gin 


Gin 


Val 


Gin 


Gin 


His 


Cys 


Pro 


Met Cys 


Arg Gin Glu Trp 


65 






70 










75 








80 


Lys Phe Lys 


Glu 

























<210> 275 
<211> 84 
<212> PRT 

<213> Homo sapiens 



<400> 275 

Met Lys Val Lys He Lys Cys Trp 
1 5 
Val Ala Asn Asp Glu Asn Cys Gly 
20 

Cys Cys Pro Asp Cys Lys Val Pro 
35 40 



Asn Gly Val Ala Thr Trp Leu Trp 

10 15 
He Cys Arg Met Ala Phe Asn Gly 
25 30 
Gly Asp Asp Cys Pro Leu Val Trp 
45 
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Gly Gin Cys Ser His 


Cys Phe His 


Met 


His 


Cys 


He 


Leu 


Lys 


Trp Leu 


50 


55 








60 








His Ala Gin Gin Val 


Gin Gin His 


Cys 


Pro 


Met 


Cys 


Arg 


Gin 


Glu Trp 


bo 


70 






75 








80 


Lys Phe Lys Glu 


















<210> 276 


















<211> 80 


















<212> PRT 


















<213> Homo sapiens 


















<400> 276 


















Met Ala Thr Pro Thr 


Gin Thr Pro 


Thr Lys Ala 


Pro 


Glu 


Glu 


Pro Asp 


1 ^ 5 






10 










15 


Pro Phe Tyr Tyr Asp 


Tyr Asn Thr 


Val 


Gin 


Thr 


Val 


Gly 


Met 


Thr Leu 


20 




25 










30 




Ala Thr lie Leu Phe 


Leu Leu Gly 


He 


Leu 


He 


Val 


He 


Ser 


Lys Lys 


35 


40 










45 






Val Lys Cys Arg Lys 


Ala Asp Ser 


Arg 


Ser 


Glu 


Ser 


Pro 


Thr 


Cys Lys 


50 


55 








60 








Ser Cys Lys Ser Glu 


Leu Pro Ser 


Ser 


Ala 


Pro Gly Gly 


Gly Gly Val 


65 


70 






75 








80 



<210> 277 
<211> 127 
<212> PRT 

<213> Homo sapiens 



<400> 277 



Met 


Ser 


Phe 


Ser 


Gly 


Lys 


Tyr Gin 


Leu 


Gin 


Ser Gin Glu 


Asn 


Phe 


Glu 


1 








5 








10 






15 




Ala 


Phe 


Met 


Lys 
20 


Ala 


He 


Gly Leu 


Pro 
25 


Glu 


Glu Leu He 


Gin 
30 


Lys 


Gly 


Lys 


Asp 


He 
35 


Lys 


Gly 


Val 


Ser Glu 
40 


He 


Val 


Gin Asn Gly 
45 


Lys 


His 


Phe 


Lys 


Phe 
50 


Thr 


He 


Thr 


Ala 


Gly Ser 
55 


Lys 


Val 


He Gin Asn 
60 


Glu 


Phe 


Thr 


Val 


Gly 


Glu 


Glu 


Cys 


Glu 


Leu Glu 


Thr 


Met 


Thr Gly Glu 


Lys 


Val 


Lys 


65 










70 








75 






80 


Thr 


Val 


Val 


Gin 


Leu 


Glu Gly Asp 


Asn 


Lys 


Leu Val Thr 


Thr 


Phe 


Lys 










85 








90 






95 




Asn 


He 


Lys 


Ser 


Val 


Thr 


Glu Leu 


Asn Gly Asp He He 


Thr 


Asn 


Thr 








100 








105 






110 






Met 


Thr 


Leu 
115 


Gly 


Asp 


He 


Val Phe 
120 


Lys 


Arg 


He Ser Lys 
125 


Arg 


He 





<210> 278 
<211> 87 
<212> PRT 

<213> Homo sapiens 



<400> 278 



Asp 


Ala 


Val 


Ala 


Val 


Gly Lys Phe 


Ser 


Gin 


Trp 


Pro 


Pro 


Glu Arg 


Ser 


1 








5 








10 








15 




Pro 


Ala 


Asp 


Ala 


Leu 


Cys 


Asp Gly 


Lys 


Lys 


Ala 


Ala 


Gly 


Pro Thr 


Ser 








20 








25 










30 




Gly 


Ala 


Phe 


Tyr 


Asn 


Glu 


Leu Arg 


Ala 


Leu 


Glu 


Ser 


Leu 


Pro Gly Asn 






35 








40 










45 






Tyr 


Ser 


Ser 


Ala 


Ser 


Thr 


Val Val 


Leu 


Leu 


Trp 


He 


Ala 


Ser Arg 


Gin 




50 










55 








60 








Lys 


Ser 


Gin 


Ser 


Ser 


Arg 


Asn Ser 


Gin 


Cys 


Ser Arg 


Lys 


Asp Lys 


Gly 


65 










70 








75 








80 


Gly Lys 


Gin 


Glu 


Glu 


His 


Glu 
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85 



<210> 279 
<211> 96 
<212> PRT 

<213> Homo sapiens 



<400> 279 








Met Pro 


Gly 


Pro 


Thr Pro Ser 


Gly 


1 






5 




Ser Pro 


Ser 


Lys 


Ala Val Ala 


Ala 






20 






Gin Arg 


Lys 


Asn 


Ala Ser Cys 


Gly 




35 






40 


Ser Ala 


Gly Thr 


Gly Gly Met 


Trp 


50 






55 




Gly Leu 


Lys 


Val 


Gly Pro Val 


Pro 


65 






70 




lie Ala 


Ser 


Val 


Phe Met Leu 


His 



85 



Thr Asn Val Gly Ser Ser Gly Arg 

10 15 
Arg Ala Ala Gly Ser Thr Val Arg 
25 30 
Thr Arg Ser Ala Gly Arg Thr Thr 
45 

Arg Phe Tyr Thr Glu Asp Ser Pro 

60 

Val Leu Val Met Ser Leu Leu Phe 

75 80 
lie Trp Gly Lys Tyr Thr Arg Ser 
90 95 



<210> 280 
<211> 56 
<212> PRT 

<213> Homo sapiens 
<400> 280 

Met Ser Ser Trp Ser Leu Met Cys 

1 5 
Ala Arg Leu Leu Tyr Glu Ala Asn 
20 

Gin Thr Asp Gin Ala Val Glu Phe 

35 40 
Lys Lys Tyr Tyr Glu Lys Met Pro 
50 55 



Met Gin Asp Met Gly Asn Thr Lys 

10 15 
Leu Pro Glu Asn Phe Arg Arg Pro 
25 30 
Phe lie Arg Asp Lys Tyr Glu Lys 
45 



<210> 281 
<211> 56 
<212> PRT 

<213> Homo sapiens 
<400> 281 

Met Ser Ser Trp Ser Leu Met Cys 
1 5 
Ala Arg Leu Leu Tyr Glu Ala Asn 
20 

Gin Thr Asp Gin Ala Val Glu Phe 

35 40 
Lys Lys Tyr Tyr Glu Lys Met Pro 
50 55 



Met Gin Asp Met Gly Asn Thr Lys 

10 15 
Leu Pro Glu Asn Phe Arg Arg Pro 
25 30 
Phe lie Arg Asp Lys Tyr Glu Lys 
45 



<210> 282 
<211> 56 
<212> PRT 

<213> Homo sapiens 
<400> 282 

Met Ser Ser Trp Ser Leu Met Cys 
1 5 
Ala Arg Leu Leu Tyr Glu Ala Asn 
20 

Gin Thr Asp Gin Ala Val Glu Phe 
35 40 



Met Gin Asp Met Gly Asn Thr Lys 

10 15 
Leu Pro Glu Asn Phe Arg Arg Pro 
25 30 
Phe lie Arg Asp Lys Tyr Glu Lys 
45 
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Lys Lys Tyr Tyr Glu Lys Met Pro 
50 55 



<210> 283 
<211> 56 
<212> PRT 

<213> Homo sapiens 
<400> 283 

Met Ser Ser Trp Ser Leu Met Cys Met Gin Asp Met Gly Asn Thr Lys 

15 10 15 

Ala Arg Leu Leu Tyr Glu Ala Asn Leu Pro Glu Asn Phe Arg Arg Pro 

20 25 30 

Gin Thr Asp Gin Ala Val Glu Phe Phe lie Arg Asp Lys Tyr Glu Lys 

35 40 45 

Lys Lys Tyr Tyr Glu Lys Met Pro 
50 55 



<210> 284 
<211> 56 
<212> PRT 

<213> Homo sapiens 



<400> 284 

Met Ser Ser Trp Ser Leu Met Cys 
1 5 
Ala Arg Leu Leu Tyr Glu Ala Asn 
20 

Gin Thr Asp Gin Ala Val Glu Phe 

35 40 
Lys Lys Tyr Tyr Glu Lys Met Pro 

50 55 



Met Gin Asp Met Gly Asn Thr Lys 

10 15 
Leu Pro Glu Asn Phe Arg Arg Pro 
25 30 
Phe lie Arg Asp Lys Tyr Glu Lys 
45 



<210> 285 
<211> 56 
<212> PRT 

<213> Homo sapiens 



<400> 285 

Met Ser Ser Trp Ser Leu Met Cys 
1 5 
Ala Arg Leu Leu Tyr Glu Ala Asn 
20 

Gin Thr Asp Gin Ala Val Glu Phe 

35 40 
Lys Lys Tyr Tyr Glu Lys Met Pro 
50 55 



Met Gin Asp Met Gly Asn Thr Lys 

10 15 
Leu Pro Glu Asn Phe Arg Arg Pro 
25 30 
Phe lie Arg Asp Lys Tyr Glu Lys 
45 



<210> 286 

<211> 56 
<212> PRT 

<213> Homo sapiens 



<400> 286 

Met Ser Ser Trp Ser Leu Met Cys 
1 5 
Ala Arg Leu Leu Tyr Glu Ala Asn 
20 

Gin Thr Asp Gin Ala Val Glu Phe 

35 40 
Lys Lys Tyr Tyr Glu Lys Met Pro 
50 55 



Met Gin Asp Met Gly Asn Thr Lys 

10 15 
Leu Pro Glu Asn Phe Arg Arg Pro 
25 30 
Phe lie Arg Asp Lys Tyr Glu Lys 
45 



wo 02/083898 
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<210> 287 

<211> 56 

<212> PRT 

<213> Homo sapiens 

<400> 287 

Met Ser Ser Trp Ser Leu Met Cys Met Gin Asp Met Gly Asn Thr Lys 

15 10 15 

Ala Arg Leu Leu Tyr Glu Ala Asn Leu Pro Glu Asn Phe Arg Arg Pro 

20 25 30 

Gin Thr Asp Gin Ala Val Glu Phe Phe He Arg Asp Lys Tyr Glu Lys 

35 40 45 

Lys Lys Tyr Tyr Glu Lys Met Pro 
50 55 

<210> 288 

<211> 56 

<212> PRT 

<213> Homo sapiens 

<400> 288 

Met Ser Ser Trp Ser Leu Met Cys Met Gin Asp Met Gly Asn Thr Lys 

15 10 15 

Ala Arg Leu Leu Tyr Glu Ala Asn Leu, Pro Glu Asn Phe Arg Arg Pro 

20 25 30 

Gin Thr Asp Gin Ala Val Glu Phe Phe He Arg Asp Lys Tyr Glu Lys 

35 40 45 

Lys Lys Tyr Tyr Glu Lys Met Pro 
50 55 

<210> 289 

<211> 56 

<212> PRT 

<213> Homo sapiens 

<400> 289 

Met Ser Ser Trp Ser Leu Met Cys Met Gin Asp Met Gly Asn Thr Lys 

15 10 15 

Ala Arg Leu Leu Tyr Glu Ala Asn Leu Pro Glu Asn Phe Arg Arg Pro 

20 25 30 

Gin Thr Asp Gin Ala Val Glu Phe Phe He Arg Asp Lys Tyr Glu Lys 

35 40 45 

Lys Lys Tyr Tyr Glu Lys Met Pro 
50 55 

<210> 290 
<211> 56 
<212> PRT 

<213> Homo sapiens 
<400> 290 

Met Ser Ser Trp Ser Leu Met Cys Met Gin Asp Met Gly Asn Thr Lys 

15 10 15 

Ala Arg Leu Leu Tyr Glu Ala Asn Leu Pro Glu Asn Phe Arg Arg Pro 

20 25 30 

Gin Thr Asp Gin Ala Val Glu Phe Phe He Arg Asp Lys Tyr Glu Lys 

35 40 45 

Lys Lys Tyr Tyr Glu Lys Met Pro 
50 55 

<210> 291 
<211> 56 
<212> PRT 



wo 02/083898 

<213> Homo sapiens 
<400> 291 

Met Ser Ser Trp Ser Leu Met Cys Met 
1 5 

Ala Arg Leu Leu Tyr Glu Ala Asn Leu 

20 25 
Gin Thr Asp Gin Ala Val Glu Phe Phe 

35 40 
Lys Lys Tyr Tyr Glu Lys Met Pro 
50 55 
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Gin Asp Met Gly Asn Thr Lys 
10 15 
Pro Glu Asn Phe Arg Arg Pro 
30 

lie Arg Asp Lys Tyr Glu Lys 
45 



<210> 292 
<2H> 56 
<212> PRT 

<213> Homo sapiens 



<400> 292 

Met Ser Ser Trp Ser Leu Met Cys 

1 5 

Ala Arg Leu Leu Tyr Glu Ala Asn 

20 

Gin Thr Asp Gin Ala Val Glu Phe 

35 40 
Lys Lys Tyr Tyr Glu Lys Met Pro 
50 55 



Met Gin Asp Met Gly Asn Thr Lys 

10 15 
Leu Pro Glu Asn Phe Arg Arg Pro 
25 30 
Phe lie Arg Asp Lys Tyr Glu Lys 
45 



<210> 293 

<211> 56 
<212> PRT 

<213> Homo sapiens 



<400> 293 

Met Ser Ser Trp Ser Leu Met Cys 
1 5 
Ala Arg Leu Leu Tyr Glu Ala Asn 
20 

Gin Thr Asp Gin Ala Val Glu Phe 

35 40 
Lys Lys Tyr Tyr Glu Lys Met Pro 
50 55 



Met Gin Asp Met Gly Asn Thr Lys 

10 15 
Leu Pro Glu Asn Phe Arg Arg Pro 
25 30 
Phe lie Arg Asp Lys Tyr Glu Lys 
45 



<210> 294 
<211> 56 
<212> PRT 

<213> Homo sapiens 



<400> 294 

Met Ser Ser Trp Ser Leu Met Cys 

1 5 
Ala Arg Leu Leu Tyr Glu Ala Asn 
20 

Gin Thr Asp Gin Ala Val Glu Phe 

35 40 
Lys Lys Tyr Tyr Glu Lys Met Pro 
50 55 



Met Gin Asp Met Gly Asn Thr Lys 

10 15 
Leu Pro Glu Asn Phe Arg Arg Pro 
25 30 
Phe He Arg Asp Lys Tyr Glu Lys 
45 



<210> 295 

<211> 122 

<212> PRT 

<213> Homo sapiens 



<220> 
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<221> UNSURE 
<222> 113 

<223> Xaa = Ala, Asp 



<400> 295 



Met 


Glu 


Ala 


Ser 


Ala 


Leu 


Thr 


Ser 


Ser 


1 








5 










Val 


Val 


Arg 


Val 


Ala 


Ser 


Gly Ser Ala 








20 










25 


He 


Ala 


Thr 


Val 


Val 


He 


Gly Gly Val 






35 










40 




Met 


Val 

50 


Leu 


Ser 


Ala 


Met 


Gly 

55 


Phe 


Thr 


Ser 


He 


Ala 


Ala 


Lys 


Met 


Met 


Ser 


Ala 


65 










70 








Gly 


Val 


Ala 


Ser 


Gly Ser 


Leu 


Val 


Ala 










85 










Thr 


Gly 


Leu 


Ser 


Gly Leu 


Thr 


Lys 


Phe 








100 










105 


Xaa 


He 


Ala 
115 


Ala 


Val 


He 


Ala 


Arg 
120 


Phe 



<210> 296 
<211> 122 
<212> PRT 
<213> Homo sapiens 

<220> 

<221> UNSURE 
<222> 79 

<223> Xaa = Gly, Arg 



<400> 296 



Met 


Glu 


Ala 


Ser 


Ala 


Leu 


Thr Ser Ser 


1 








5 






Val 


Val 


Arg 


Val 


Ala 


Ser 


Gly Ser Ala 








20 






25 


He 


Ala 


Thr 


Val 


Val 


He 


Gly Gly Val 






35 








40 


Met 


Val 


Leu 


Ser 


Ala 


Met 


Gly Phe Thr 




50 










55 


Ser 


He 


Ala 


Ala 


Lys 


Met 


Met Ser Ala 


65 










70 




Gly 


Val 


Ala 


Ser 


Gly 


Ser 


Leu Val Ala 










85 






Thr 


Gly 


Leu 


Ser 


Gly Leu 


Thr Lys Phe 








100 






105 


Ala 


He 


Ala 


Ala 


Val 


He 


Ala Arg Phe 






115 








120 



<210> 297 

<211> 74 

<212> PRT 

<213> Homo sapiens 

<400> 297 

Met Ala Ser Ser Thr Val Pro Val Ser 
1 5 

Thr Pro Glu He Pro Asp Asn Val Gly 

20 25 
Arg Phe Ala Thr Asp Arg Asn Asp Phe 

35 40 
Leu Gly Leu Phe Ala Ala Gly Val Trp 
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Ala 


Val 


Thr Ser 


Val Ala Lys 


10 






15 


Val 


Val 


Leu Pro Leu Ala Arg 








30 


Val 


Ala 


Met Ala 


Ala Val Pro 






45 




Ala 


Ala 


Gly He 


Ala Ser Ser 






60 




Ala 


Ala 


He Ala 


Asn Gly Gly 




75 




80 


Thr 


Leu 


Gin Ser 


Leu Gly Ala 


90 






95 


He 


Leu 


Gly Ser 


He Gly Ser 








110 


Tyr 








Ala 


Val 


Thr Ser 


Val Ala Lys 


10 






15 


Val 


Val 


Leu Pro 


Leu Ala Arg 








30 


Val 


Ala 


Met Ala 


Ala Val Pro 






45 




Ala 


Ala 


Gly He 


Ala Ser Ser 






60 




Ala 


Ala 


He Ala 


Asn Xaa Gly 




75 




80 


Thr 


Leu 


Gin Ser 


Leu Gly Ala 


90 






95 


He 


Leu 


Gly Ser 


He Gly Ser 








110 


Tvr 








Ala 


Ala 


Gly Ser Ala Asn Glu 


10 






15 


Asp 


Trp 


Leu Arg Gly Val Tyr 








30 


Arg 


Arg 


Asn Leu 


He Leu Asn 






45 




Leu 


Ala 


Arg Asn 


Leu Ser Asp 
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50 








55 










60 










lie Asp Leu 


Met 


Ala 


Pro 


Gin 


Pro 


Gly 


Val 














65 






70 






















<210> 298 




























<211> 74 




























<212> PRT 




























<213> Homo sapiens 
























<400> 298 




























Met Ala Ser 


Ser 


Thr 


Val 


Pro 


Val 


Ser 


Ala 


Ala 


Gly 


Ser 


Ala 


Asn 


Glu 


1 




5 










10 










15 




Thr Pro Glu 


He 


Pro 


Asp Asn 


Val 


Gly 


Asp 


Trp 


Leu Arg 


Gly Val 


Tyr 




20 










25 










30 






Arg Phe Ala 


Thr Asp 


Arg Asn 


Asp 


Phe 


Arg 


Arg 


Asn 


Leu 


He 


Leu 


Asn 


35 










40 










45 








Leu Gly Leu 


Phe 


Ala 


Ala Gly 


Val 


Trp 


Leu 


Ala 


Arg Asn 


Leu 


Ser Asp 


50 








55 










60 










He Asp Leu Met Ala 


Pro 


Gin 


Pro 


Gly 


Val 














65 






70 






















<210> 299 




























<211> 147 




























<212> PRT 




























<213> Homo sapiens 
























<400> 299 




























Met Asp Val 


Leu 


Val 


Ser 


Glu 


Cys 


Ser 


Ala 


Arg 


Leu 


Leu 


Gin 


Gin 


Glu 


1 




5 










10 










15 




Glu Glu He 


Lys 


Ser 


Leu 


Thr 


Ala 


Glu 


He 


Asp 


Ara 


Leu 


Lys 


Asn 


Cys 




20 










25 










30 






Gly Cys Leu 


Gly 


Ala 


Ser 


Pro 


Asn 


Leu 


Glu 


Gin 


Leu 


Gin 


Glu 


Glu 


Asn 


35 










40 










45 








Leu Lys Leu 


Lys 


Tyr 


Arg Leu 


Asn 


He 


Leu 


Arg 


Lys 


Ser 


Leu 


Gin 


Ala 


50 








55 










60 










Glu Arg Asn 


Lys 


Pro 


Thr 


Lys 


Asn 


Met 


He 


Asn 


He 


He 


Ser Arg 


Leu 


65 






70 










75 










80 


Gin Glu Val 


Phe 


Gly 


His 


Ala 


He 


Lys 


Ala 


Ala 


Tyr 


Pro 


Asp 


Leu 


Glu 






85 










90 










95 




Asn Pro Pro 


Leu 


Leu 


Val 


Thr 


Pro 


Ser 


Gin 


Gin 


Ala 


Lys 


Phe 


Gly Asp 




100 










105 










110 






Tyr Gin Cys 


Asn 


Ser 


Ala 


Met 


Gly 


He 


Ser 


Gin 


Val 


Met 


Tyr 


Cys 


His 


115 










120 










125 








Asp Ser Trp 


Leu 


Phe 


Asp 


Phe 


Phe 


Lys 


Tyr 


Tyr 


Tyr 


His 


His 


Cys 


His 


130 








135 










140 










Leu Gin Lys 




























145 




























<210> 300 




























<211> 97 




























<212> PRT 




























<213> Homo sapiens 
























<400> 300 




























Met Ser Ser 


Ala 


Pro 


Glu 


Pro 


Pro 


Thr 


Phe 


Lys 


Lys 


Glu 


Pro 


Pro 


Lys 


1 




5 










10 










15 




Glu Lys Glu 


Phe 


Gin 


Ser 


Pro 


Gly 


Leu 


Arg 


Gly 


Val Arg 


Thr 


Thr 


Thr 




20 










25 










30 






Leu Phe Arg Ala Val 


Asn 


Pro 


Glu 


Leu 


Phe 


He 


Lys 


Pro 


Asn 


Lys 


Pro 


35 










40 










45 








Val Met Ala 


Phe Gly 


Leu 


Val 


Thr 


Leu 


Ser 


Leu 


Cys 


Val 


Ala 


Tyr 


He 


50 








55 










60 










Gly Tyr Leu 


His 


Ala 


He 


Gin 


Glu 


Asn 


Lys 


Lys 


Asp 


Leu 


Tyr 


Glu 


Ala 
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65 70 75 

lie Asp Ser Glu Gly His Ser Tyr Met Arg Arg Lys Thr Ser 
85 90 

Asp 

<210> 301 
<211> 98 
<212> PRT 

<213> Homo sapiens 
<220> 

<221> UNSURE 
<222> 30 

<223> Xaa « Lys,Asn 



<400> 301 



Met 


Ala 


Ala 


Glu 


Pro 


Leu 


Thr Glu Leu Glu 


Glu 


Ser 


lie Glu 


Asn 


Val 


1 








5 




10 








15 




Val 


Thr 


Thr 


Phe 


Phe 


Thr 


Phe Ala Arg Gin Glu Gly Arg Xaa 


Asp 


Ser 








20 






25 






30 






Leu 


Ser 


Val 


Asn 


Glu 


Phe 


Lys Glu Leu Val 


Thr 


Gin 


Gin Leu 


Pro 


His 






35 








40 . 






45 






Leu 


Leu 


Lys 


Asp 


Val 


Gly Ser Leu Asp Glu 


Lys 


Met 


Lys Ser 


Leu 


Asp 




50 










55 




60 








Val 


Asn 


Gin 


Asp 


Ser 


Glu 


Leu Lys Phe Asn 


Glu 


Tyr 


Trp Arg 


Leu 


lie 


65 










70 




75 








80 


Gly Glu 


Leu 


Ala 


Lys 


Glu 


lie Arg Lys Lys 


Lys 


Asp 


Leu Lys 


lie Arg 










85 




90 








95 





Lys Lys 



<210> 302 
<211> 115 
<212> PRT 

<213> Homo sapiens 
<220> 

<221> UNSURE 
<222> 103 

<223> Xaa = Asp, Tyr 
<220> 

<221> UNSURE 
<222> 100 

<223> Xaa = Gly, Ser 
<400> 302 



Met 


Val 


Ala 


Ala 


Lys 


Lys Thr 


Lys 


Lys Ser 


Leu Glu 


Ser He 


Lys 


Ser 


1 








5 






10 






15 




Arg 


Leu 


Gin 


Leu 


Val 


Met Lys 


Ser Gly Lys 


Tyr Val 


Leu Gly 


Tyr 


Lys 








20 








25 




30 






Gin 


Thr 


Leu 


Lys 


Met 


He Arg 


Gin 


Gly Lys 


Ala Lys 


Leu Val 


He 


Leu 






35 








40 






45 






Ala 


Asn 


Asn 


Cys 


Pro 


Ala Leu Arg 


Lys Ser 


Glu He 


Glu Tyr 


Tyr 


Ala 




50 








55 






60 








Met 


Leu 


Ala 


Lys 


Thr Gly Val 


His 


His Tyr 


Ser Gly Asn Asn 


He 


Glu 


65 










70 






75 






80 


Leu Gly 


Thr 


Ala 


Cys 


Gly Lys 


Tyr 


Tyr Arg 


Val Cys 


Thr Leu 


Ala 


He 










85 






90 






95 




He 


Asp 


Pro 


Xaa 


Asp 


Ser Xaa 


He 


He Arg Ser Met 


Pro Glu 


Gin 


Thr 








100 








105 




110 







Gly Glu Lys 
115 
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80 

Lys Trp . 
95 
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<210> 303 
<211> 102 
<212> PRT 

<213> Homo sapiens 
<400> 303 



Met 

1 


Ala 


Gly 


Gin 


Ala 

5 


Phe 


Arg 


Lys 


Leu 


Val 


Glu 


Arg 


Ser 


Pro 


Ala 


Glu 








20 










Leu 


Pro 


Glu 


Lys 


Ser 


Gin 


Gly 


Lys 






35 










40 


Val 


Gly 


Ser 


Gly 


Ser 


Lys 


Gly 


Lys 




50 










55 




Val 


Lys 


Val 


Gly 


Asp 


Lys 


Val 


Leu 


65 










70 






Val 


Val 


Leu 


Asp 


Asn 


Lys Asp Tyr 










85 








Leu Gly 


Lys 


Tyr 


Val 


Asp 












100 











<210> 304 
<211> 102 
<212> PRT 

<213> Homo sapiens 



<400> 304 



Met Ala 


Gly 


Gin 


Ala 


Phe Arg Lys 


1 






5 




Leu Val 


Glu 


Arg 


Ser 


Ala Ala Glu 






20 






Leu Pro 


Glu 


Lys 


Ser 


Gin Gly Lys 




35 






40 


Val Gly 


Ser 


Gly 


Ser 


Lys Gly Lys 


50 








55 


Val Lys 


Val 


Gly 


Asp Lys Val Leu 


65 








70 


Val Val 


Leu 


Asp 


Asp 


Lys Asp Tyr 








85 




Leu Gly 


Lys 


Tyr 


Val 


Asp 






100 







<210> 305 

<211> 116 

<212> PRT 

<213> Homo sapiens 

<400> 305 



Met Ser 


Ala 


Thr 


Ala 


Ala Thr Ala 


1 






5 




Gly Gly 


Pro 


Pro 


Ala 


Pro Pro Pro 






20 






Gin Gin 


Thr 


Gin 


Ala Gin Val Asp 




35 






40 


Asn Val 


Asp 


Lys 


Val 


Leu Glu Arg 


50 








55 


Asp Arg 


Ala 


Asp 


Ala 


Leu Gin Ala 


65 








70 


Ala Ala 


Lys 


Leu 


Lys 


Arg Lys Tyr 








85 




lie He 


Leu 


Gly 


Val 


He Cys Ala 






100 






Tyr Phe 


Ser 


Thr 







179 



Phe 


Leu 


Pro 


Leu Phe Asp Arg 


Val 




10 




15 




Thr 


Val 


Thr 


Lys Gly Gly He 


Met 


25 






30 




Val 


Leu 


Gin 


Ala Thr Val Val 


Ala 








45 




Gly 


Gly 


Glu 


He Gin Pro Val 


Ser 








60 




Leu 


Pro 


Glu 


Tyr Gly Gly Thr 


Lys 






75 




80 


Phe 


Leu 


Phe Arg Asp Gly Asp 


He 




90 




95 





Phe 


Leu 


Pro 


Leu Phe Asp Arg 


Val 




10 




15 




Thr 


Val 


Thr 


Lys Gly Gly He 


Met 


25 






30 




Val 


Leu 


Gin 


Ala Thr Val Val 


Ala 








45 




Gly 


Gly 


Glu 


He Gin Pro Val 


Ser 








60 




Leu 


Pro 


Glu 


Tyr Gly Gly Thr 


Lys 






75 




80 


Phe 


Leu 


Phe Arg Asp Gly Asp 


He 




90 




95 





Pro 


Pro 


Ala 


Ala 


Pro 


Ala 


Gly 


Glu 




10 










15 




Asn 


Leu 


Thr 


Ser 


Asn 


Arg 


Arg 


Leu 


25 










30 






Glu 


Val 


Val 


Asp 


He 


Met 


Arg 


Val 










45 








Asp 


Gin 


Lys 


Leu 


Ser 


Glu 


Leu 


Asp 








60 










Gly 


Pro 


Ser 


Gin 


Phe 


Glu 


Thr 


Ser 






75 










80 


Trp 


Trp 


Lys Asn 


Leu 


Lys 


Met 


Met 




90 










95 




He 


He 


Leu 


He 


He 


He 


He 


Val 


105 










110 
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115 

<210> 306 

<211> 97 

<212> PRT 

<213> Homo sapiens 



<400> 306 



Met Ser 


Leu 


Ser 


Leu 


Val 


Phe Arg Ala Ala Ser Tyr 


Phe 


Lys 


Leu 


Val 


1 






5 




10 








15 




Pro Phe 


His 


Ser 


Ser 


Ser 


Ser Asn Gin Phe Leu 


Gin 


Pro 


Pro Gly Trp 






20 


















Val Val 


Leu 


Thr 


Gin 


Thr 


Leu Val Leu Leu His 


Phe 


Glu 


Arg 


Phe 


Ser 




35 








40 




45 








Tyr Gin 


Asn 


Val 


Pro 


Lys 


Ser Ala Gin Gly Lys 


Gly Asn 


Leu 


Gin 


Pro 


50 










55 


60 










Glu Thr 


Asn 


He 


His 


Leu 


Phe His Phe Leu Thr 


Phe 




Lys 


Gin 


He 


65 








70 


75 










80 


Ser Arg 


Asn 


Leu 


Phe 


Asn 


Ser Leu Leu Cys Leu 


Met 


Cys 


Leu 


Thr Tyr 








85 




90 








95 




Phe 






















<210> 307 




















<211> 110 




















<212> PRT 




















<213> Homo sapiens 
















<400> 307 




















Met Ser 


Met 


Thr 


Asp 


Leu Leu Asn Ala Glu Asp 


He 


jjys 


Lys 


Ala 


Val 


1 






5 




10 








15 




Gly Ala 


Phe 


Ser 


Ala 


Thr Asp Ser Phe Asp His 


Lys 




Phe 


Phe 


Gin 






20 






25 






30 






Met Val 


Gly 


Leu 


Lys 


Lys 


Lys Ser Ala Asp Asp Val 


T.\7C 


Lys 


Val 


Phe 




35 








40 




45 








His Met 


Leu 


Asp 


Lys 


Asp 


Lys Ser Gly Phe He 


Glu 


Glu 


Asp 


Glu 


Leu 


50 










55 


60 










Gly Phe 


He 


Leu 


Lys 


Gly Phe Ser Pro Asp Ala 


Arg 


Asp 


Leu 


Ser 


Ala 


65 








70 


75 










80 


Lys Glu 


Thr 


Lys 


Met 


Leu 


Met Ala Ala Gly Asp 


Lys 


Asp 


Gly Asp Gly 








85 




90 








95 




Lys lie 


Gly 


Val 


Asp 


Glu 


Phe Ser Thr Leu Val 


Ala 


Glu 


Ser 










100 






105 






110 






<210> 308 




















<211> 110 




















<212> PRT 




















<213> Homo sapiens 
















<400> 308 




















Met Ser 


Met 


Thr 


Asp 


Leu Leu Asn Ala Glu Asp 


He 


Lys 


Lys 


Ala 


Val 


1 






5 




10 








15 




Gly Ala 


Phe 


Ser 


Ala 


Thr Asp Ser Phe Asp His 


Lys 


Lys 


Phe 


Phe 


Gin 






20 






25 






30 






Met Val 


Gly 


Leu 


Lys 


Lys 


Lys Ser Ala Asp Asp Val 


Lys 


Lys 


Val 


Phe 




35 








40 




45 








His Met 


Leu 


Asp 


Lys 


Asp 


Lys Ser Gly Phe He 


Glu 


Glu 


Asp 


Glu 


Leu 


50 










55 


60 










Gly Phe 


He 


Leu 


Lys 


Gly 


Phe Ser Pro Asp Ala Arg Asp 


Leu 


Ser 


Ala 


65 








70 


75 










80 


Lys Glu 


Thr 


Lys 


Met 


Leu 


Met Ala Ala Gly Asp Lys 


Asp 


Gly Asp Gly 








85 




90 








95 




Lys lie 


Gly 


Val 


Asp 


Glu 


Phe Ser Thr Leu Val 


Ala 


Glu 


Ser 










100 






105 






110 
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<210> 309 
<211> 110 
<212> PRT 

<213> Homo sapiens 
<400> 309 



Met 


Ser 


Met 


Thr 


Asp 


Leu 


Leu 


Asn 


Ala 


Glu 


Asp 


He Lys Lys Ala Val 


1 








5 










10 




15 


Gly Ala 


Phe 


Ser 


Ala 


Thr 


Asp 


Ser 


Phe 


Asp 


His 


Lys Lys Phe Phe Gin 








20 










25 






30 


Met 


Val 


Gly 


Leu 


Lys 


Lys 


Lys 


Ser 


Ala 


Asp 


Asp 


Val Lys Lys Val Phe 






35 










40 








45 


His 


Met 


Leu 


Asp 


Lys 


Asp 


Lys 


Ser 


Gly 


Phe 


He 


Glu Glu Asp Glu Leu 




50 










55 










60 


Gly 


Phe 


He 


Leu 


Lys 


Gly 


Phe 


Ser 


Pro 


Asp 


Ala 


Arg Asp Leu Ser Ala 


65 










70 










75 


80 


Lys 


Glu 


Thr 


Lys 


Met 


Leu 


Met 


Ala 


Ala 


Gly 


Asp 


Lys Asp Gly Asp Gly 










85 










90 




95 


Lys 


He 


Gly 


Val 


Asp Glu 


Phe 


Ser 


Thr 


Leu 


Val 


Ala Glu Ser 








100 










105 






110 



<210> 310 
<211> 112 
<212> PRT 

<2.13> Homo sapiens 
<220> 

<221> UNSURE 

<222> 8,33 

<223> Xaa = Asp, Glu 



<400> 310 



Met 


Asp 


Pro 


Arg 


Lys 


Val 


Asn Xaa 


Leu 


Arg 


Ala 


Phe Val Lys 


Met Cys 


1 








5 








10 






15 


Lys 


Gin 


Asp 


Pro 


Ser 


Val 


Leu His 


Thr 


Glu 


Glu 


Met Arg Phe 


Leu Arg 








20 








25 






30 




Xaa 


Trp 


Val 


Glu 


Ser 


Met 


Gly Gly Lys 


Val 


Pro 


Pro Ala Thr 


Gin Lys 






35 








40 








45 




Ala 


Lys 


Ser 


Glu 


Glu 


Asn 


Thr Lys 


Glu 


Glu 


Lys 


Pro Asp Ser 


Lys Lys 




50 










55 








60 




Val 


Glu 


Glu 


Asp 


Leu 


Lys 


Ala Asp 


Glu 


Pro 


Ser 


Ser Glu Glu 


Ser Asp 


65 










70 








75 




80 


Leu 


Glu 


He 


Asp 


Lys 


Glu 


Gly Val 


He 


Glu 


Pro 


Asp Thr Asp 


Ala Pro 










85 








90 






95 


Gin 


Glu 


Met 


Gly 


Asp 


Glu 


Asn Ala 


Glu 


He 


Thr 


Ala Gin His 


Phe Leu 








100 








105 






110 





<210> 311 
<211> 118 
<212> PRT 

<213> Homo sapiens 
<400> 311 



Met 


Val 


Leu Leu Glu Ser 


Glu 


Gin Phe 


Leu 


Thr 


Glu Leu Thr Arg Leu 


1 




5 






10 




15 


Phe 


Gin 


Lys Cys Arg Thr 


Ser 


Gly Ser 


Val 


Tyr 


He Thr Leu Lys Lys 






20 




25 






30 


Tyr Asp Gly Arg Thr Lys 


Pro 


He Pro 


Lys 


Lys 


Gly Thr Val Glu Gly 






35 




40 






45 


Phe 


Glu 


Pro Ala Asp Asn Lys Cys Leu 


Leu 


Arg 


Ala Thr Asp Gly Lys 




50 




55 








60 


Lys 


Lys 


He Ser Thr Val 


Val 


Ser Ser 


Lys 


Glu 


Val Asn Lys Phe Gin 
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65 70 
Met Ala Tyr Ser Asn Leu Leu Arg Ala 
85 

Arg Asp Lys Lys Asn Glu Thr Lys Lys 
100 105 
Ala Ala Thr Ala Ala Gin 
115 
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182 

75 80 
Asn Met Asp Gly Leu Lys Lys 
90 95 
Thr Lys Ala Ala Ala Ala Ala 
110 



<210> 312 
<211> 109 
<212> PRT 

<213> Homo sapiens 
<400> 312 



Met Asp 


Ser 


Leu 


Pro 


Ser Gly Lys 


He 


His 


Arg 


Lys 


Val 


Lys 


He 


He 


1 






5 










10 










15 




Leu Gly 


Arg 


Asn 


Arg 


Lys 


Glu 


Asn 


Leu 


Glu 


Pro 


Asn 


Ala 


Glu 


Phe Asp 






20 










25 










30 






Lys Arg 


Thr 
35 


Glu 


Phe 


He 


Thr 


Gin 
40 


Glu 


Glu 


Asn 


Arg 


He 
45 


Cys 


Ser 


Ser 


Pro Val 


Gin 


Ser 


Leu 


Leu 


Asp 


Leu 


Phe 


Gin 


Thr 


Ser 


Glu 


Glu 


Lys 


Ser 


5b 










55 










60 










Glu Phe 


Leu 


Gly 


Phe 


Thr 


Ser 


Tyr 


Thr 


Glu 


Lys 


Ser 


Gly 


He 


Cys 


Asn 


65 








70 










75 










80 


Val Leu 


Asp 


He 


Trp 


Glu 


Glu 


Glu 


Asn 


Ser Asp 


Asn 


Leu 


Leu 


Thr 


Ala 








85 










90 










95 




Phe Phe 


Ser 


Ser 
100 


Pro 


Ser 


Thr 


Ser 


Thr 
105 


Phe 


Thr 


Gly 


Phe 









<210> 313 
<211> 103 
<212> PRT 

<213> Homo sapiens 
<220> 

<221> unsure 
<222> 99 

<223> Xaa « Arg, Thr 
<220> 

<221> unsure 
<222> 103 

<223> Xaa = His, Asn 



<400> 313 










Met Ser 


Gly 


Asp 


Gly 


Ala Thr 


Glu 


1 






5 






Lys Val 


Lys 


Lys 


Ala 


Glu Lys 


Lys 






20 








Asp Ala 


Trp 


Ser 


He 


Leu He 


Arg 




35 








40 


Lys Ala 


Arg 


Lys 


Thr 


Tyr Glu Arg 


50 








55 




Gly Arg 


Phe 


Trp 


Lys 


Leu Tyr 


He 


65 








70 




Tyr Phe 


Phe 


Leu 


Tyr 


Gin Tyr 


Cys 








85 






Gin Val 


Xaa 


Asn 


He 


Ala Xaa 








100 









Gin 


Ala 


Ala 


Glu Tyr Val Pro 


Glu 




10 






15 




Leu 


Glu 


Glu 


Asn 


Pro Tyr Asp Leu 


25 








30 




Glu 


Ala 


Gin 


Asn 


Gin Pro He 


Asp 










45 




Leu 


Val 


Ala 


Gin 


Phe Pro Ser 


Ser 








60 






Glu 


Ala 


Glu 


Val 


Thr He Leu 


Phe 






75 






80 


Ser 


He 


His 


Cys 


Ser Asp Arg 


Lys 




90 






95 





<210> 314 
<211> 72 
<212> PRT 
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<213> Homo sapiens 



183 
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<400> 314 

Met Ser Giy Asp Gly Ala Thr Glu 
1 5 
Lys Val Lys Lys Ala Glu Lys Lys 
20 

Asp Ala Trp Ser He Leu He Arg 

35 40 
Lys Ala Arg Lys Thr Tyr Glu Arg 

50 55 
Gly Arg Phe Trp Lys Leu Tyr He 
65 70 



Gin Ala Ala Glu Tyr Val Ser Glu 

10 15 
Leu Glu Glu Asn Pro Tyr Asp Leu 
25 30 
Glu Ala Gin Asn Gin Pro He Asp 
45 

Leu Val Ala Gin Phe Pro Ser Ser 
60 



<210> 315 
<211> 106 
<212> PRT 

<213> Homo sapiens 



<400> 315 




Met 
1 


Leu 


Lys 


Ser 


Val 


Tyr 


Ala 


Arg 








20 


Ser 


Ser 


Ala 


Ala 






35 




Pro 


Gin 


Gin 


Gin 




50 






Ser Gly 


Leu 


Leu 


65 








Tyr 


Lys 


Met 


Leu 


Leu 


Lys 


Gin 


Arg 



100 



Asn Gly Glu Arg 

5 

Lys Met Ala Ala 

Gly Val Ser Gly 
40 

Pro Gin Pro Pro 
55 

Gin Gin Gin Gin 
70 

He Pro Gin Leu 
85 

Glu Ala Asn Trp 



Arg Ser Arg Asn 
10 

Ser Gin Gin Gin 
25 

Pro Ser Ser Ala 

Ala Gin Leu Val 
60 

Gin Asp Phe Asp 
75 

Lys Glu Ser Leu 

90 
He Trp 
105 



Ala Leu Pro Ala 
15 

Ala Ser Ala Ala 
30 

Gly Gly Pro Gly 
45 

Gly Pro Ala Gin 

Pro Val Gin Arg 
80 

Gin Val He Gly 
95 



<210> 316 
<211> 86 
<212> PRT 

<213> Homo sapiens 
<400> 316 



Met 


Ala 


Glu Asp 


Met 


Glu 


Thr 


Lys 


He Lys 


Asn 


Tyr 


Lys 


Thr 


Ala 


Pro 


1 






5 








10 










15 




Phe 


Asp 


Ser Arg 


Phe 


Pro 


Asn 


Gin 


Asn Gin 


Thr 


Arg 


Asn 


Cys 


Trp 


Gin 






20 










25 








30 






Asn 


Tyr 


Leu Asp 


Phe 


His 


Arg 


Cys 


Gin Lys 


Ala 


Met 


Thr 


Ala 


Lys 


Gly 






35 








40 








45 








Gly Asp 


He Ser 


Val 


Cys 


Glu 


Trp 


Tyr Gin Arg 


Val 


Tyr 


Gin 


Ser 


Leu 




50 








55 








60 










Cys 


Pro 


Thr Ser 


Trp 


Val 


Thr Asp 


Trp Asp 


Glu 


Gin Arg Ala 


Glu 


Gly 


65 








70 








75 










80 


Thr 


Phe 


Pro Gly 


Lys 


He 





















85 



<210> 317 
<211> 105 

<212> PRT 

<213> Homo sapiens 



<400> 317 

Met Cys Val Gly Leu Leu Met Val Ser He Leu Thr Asp He Asn Leu 

15 10 15 

Ser Asn Leu Val Ala Val Gin Tyr Ser Pro Asp Tyr Cys Asn Phe Arg 



wo 02/083898 
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20 








25 










30 




Lys Arg Ser 


Asp Lys 


Asn 


Gin 


ASD 


Ala 


Ser 


Thr 


Phe 


Cvs 


His 


Asn Cys 


35 








40 










45 






Asn Gin Phe 


His Leu 


Cys 


Leu 


Gin 


Tyr 


His 


His 


Lys 


He 


Val 


Leu Pro 


50 






55 










60 








Trp Ser Val 


Leu Ala 


He 


Leu 


Ser 


Gin 


Leu 


Phe 


Leu 


His 


He 


Thr Ser 


65 




70 










75 








80 


Arg lie Arg 


Pro Thr 


He 


Tvr 


Lys 


Ser 


Asn 


Lys 


Pro 


Lvs 


Ser 


He Glu 




85 










90 










95 


lie Leu lie 


Gly Phe 


Val 


Leu 


Asn 


Leu 
















100 








105 














<210> 318 
























<211> 101 
























<212> PRT 
























<213> Homo sapiens 






















<400> 318 
























Met Glu Arg 


Pro Asp Lys 


Ala 


Ala 


Leu 


Asn 


Ala 


Leu 


Gin 


Pro 


Pro Glu 


1 


5 










10 










15 


Phe Arg Asn 


Glu Ser 


Ser 


Leu 


Ala 


Ser 


Thr 


Leu 


Lvs 


Thr 


Leu 


Leu Phe 




20 








25 










30 




Phe Thr Ala 


Leu Met 


He 


Thr 


Val 


Pro 


He 


Gly 


Leu 


Tyr 


Phe 


Thr Thr 


35 








40 










45 






Lys Ser Tyr 


He Phe 


Glu 


Gly Ala 


Leu Gly Met 


Ser 


Asn 


Arg 


Asp Ser 


50 






55 










60 








Tyr Phe Tyr 


Ala Ala 


He 


Val 


Ala 


Val 


Val 


Ala 


Val 


His 


Val 


Val Leu 


65 




70 










75 








80 


Ala Leu Phe 


Val Tyr Val 


Ala 


Trp Asn Glu Gly Ser Arg 


Gin 


Tro Ara 




85 










90 










95 


Glu Gly Lys 


Gin Asp 
























100 






















<210> 319 
























<211> 92 
























<212> PRT 
























<213> Homo sapiens 






















<220> 
























<221> UNSURE 






















<222> 29 
























<223> Xaa = 


Gly, Arg 






















<220> 
























<221> UNSURE 






















<222> 28 
























<223> Xaa « 


Lys, Asn, Arg, 


Ser 


















<400> 319 
























Met Phe Arg 


Asp Phe 


Gly 


Arg Arg Leu Gin Arg Asp Leu 


Lys 


Arg Val 


1 


5 










10 










15 


Val Asp Ala 


Arg Leu 


Arg 


Leu 


Ser 


Glu 


Glu 


Leu 


Xaa 


Xaa 


Gly Arg He 




20 








25 










30 




Lys Pro Lys 


Pro Val 


Glu 


Val 


Gin 


Val 


Val 


Thr 


His 


His 


Met 


Gin Arg 


35 








40 










45 






Tyr Ala Val 


Trp Phe 


Gly 


Gly 


Ser 


Met 


Leu 


Ala 


Ser 


Thr 


Pro 


Glu Phe 


50 






55 










60 








Phe Gin Val 


Cys His 


Thr 


Lys 


Lys 


Asp 


Tyr 


Glu 


Glu 


Tyr 


Gly 


Pro Ser 


65 




70 










75 








80 


lie Cys Arg 


His Asn 


Pro 


Val 


Phe 


Gly Val 


Met 


Ser 









85 90 



<210> 320 



wo 02/083898 

<211> 92 

<212> PRT 

<213> Homo sapiens 

<400> 320 



Met Phe Arg Asp 


Phe Gly Arg 


Arg Leu 


1 




5 










Val Asp Ala 


Ara 


Leu Arg 


Leu 


Ser 


Glu 




20 










25 


Lys Pro Lys 


Pro 


Val 


Glu 


Val 


Gin 


Val 


35 










40 




Tyr Ala Val 


Trp 


Phe Gly Gly 


Ser 


Met 


50 








55 






Phe Gin Val 


Cys 


His 


Thr 


Lys 


Lys 


Asp 


65 






70 








lie Cys Arg 


His 


Asn 


Pro 


Val 


Phe 


Glv 






85 










<210> 321 














<211> 98 














<212> PRT 














<213> Homo sapiens 










<400> 321 














Met Ser Val 


Ser 


Phe 


His 


Thr 


His 


Thr 


1 




5 










Glu Asp Leu 


Gin 


Lys 


Glu 


Met 


Leu 


Glu 




20 










25 


Asp Ala Val 


Gin 


Glu 


Leu 


He 


Lys 


Gin 


35 










40 




Thr Leu Asp Ala 


Thr 


Leu 


Asn 


Val 


He 


50 








55 






Gin Gin Leu 


Arg 


Ser 


Ala 


Pro 


Pro 


Ser 


65 






70 








Arg Ser Ala 


Trp 


Ala 


Glu 


Leu 


Ser 


Ser 






85 










Val Arg 














<210> 322 














<211> 89 














<212> PRT 














<213> Homo sapiens 










<400> 322 














Met Val Ser Gly 


Asp Gly 


Phe 


Leu 


Val 


1 




5 










Leu Gly Pro Arg 


Gin 


Ala 


Val 


Arg 


Pro 




20 










25 


Arg Val Asp Gly 


Gly Gly Arg 


Ser 


Pro 


35 










40 




Arg Ser Arg 


Gin 


Ala 


Arg 


Phe 


Ser 


Pro 


50 








55 






Pro Asp Leu 


Leu 


Arg 


Ser 


Val 


Leu 


Gin 


65 






70 








Gly Val He 


Ala 


Ala Arg 


He 


Ser 


Val 



85 



<210> 323 
<211> 123 
<212> PRT 

<213> Homo sapiens 
<400> 323 
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185 



Gin 


Arg Asp 


Leu Lys 


Arg 


Val 


10 






15 




Glu 


Leu Ser Gly Gly Arg 


He 






30 






Val 


Thr His 


His Met 


Gin 


Arg 






45 






Leu 


Ala Ser 


Thr Pro 


Glu 


Phe 




60 








Tyr 


Glu Glu 


Tyr Gly 


Pro 


Ser 




75 






80 


Val 


Met Ser 








90 











Lys 


Glu Leu Trp Thr Trp 


Met 


10 


15 




Asp 


Val Cys Ala Asp Ser 


Val 




30 




Phe 


Gin Gin Gin Gin Thr 


Ala 




45 




Lys 


Glu Gly Glu Asp Leu 


He 




60 




Leu 


Gly Glu Pro Ser Glu 


Ala 




75 


80 


Gly 


Lys Cys Leu Gly Leu 


Asp 


90 


95 





Ser 


Arg 


Pro Glu 


Ala 


He 


His 


10 








15 




Ser 


Val 


Arg Ala 


Glu 


Ser 


Arg 








30 






Arg 


Glu 


Pro Asp 


Gly 


Arg 


Gly 






45 








Tyr 


Pro 


He Pro 


Ala 


Val 


Glu 






60 








Gin 


Arg 


Leu He 


Ala 


Leu 


Gly 




75 








80 
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186 



Met 


Ala 


Leu 


Asn 


Asn 


Val Ser Leu Ser Ser Gly Asp Gin 


Arg Ser Arg 


1 








5 


10 


15 


Val 


Ala 


Tyr 


Arg 


Ser 


Ser His Gly Asp Leu Arg Pro Arg 


Ala Ser Ala 








20 




25 


30 


Leu 


Ala 


Met 


Val 


Ser 


Gly Asp Gly Phe Leu Val Ser Arg 


Pro Glu Ala 






35 






40 45 




lie 


His 


Leu 


Gly 


Pro 


Arg Gin Ala Val Arg Pro Ser Val 


Arg Ala Glu 




50 








55 60 




Ser 


Arg 


Arg 


Val 


Asp 


Gly Gly Gly Arg Ser Pro Arg Glu 


Pro Asp Gly 


65 










70 75 


80 


Arg 


Gly 


Arg 


Ser 


Arg 


Gin Ala Arg Phe Ser Pro Tyr Pro 


He Pro Ala 










85 


90 


95 


Val 


Glu 


Pro 


Asp 


Leu 


Leu Arg Ser Val Leu Gin Gin Arg 


Leu He Ala 








100 




105 


110 


Leu 


Gly 


Gly 


Val 


He 


Ala Ala Arg He Ser Val 








115 






120 





<210> 324 
<211> 172 
<212> PRT 
<213> Homo sapiens 

<400> 324 



Met 


Ala 


Gly 


Ala 


Ala 


Thr 


Gin 


Ala Ser Leu Glu Ser Ala 


Pro Arg He 


1 








5 






10 


15 


Met 


Arg 


Leu 


Val 


Ala 


Glu 


Cys 


Ser Arg Ser Arg Ala Arg 


Ala Gly Glu 








20 








25 


30 


Leu 


Trp 


Leu 


Pro 


His 


Gly 


Thr 


Val Ala Thr Pro Val Phe 


Met Pro Val 






35 










40 45 




Gly 


Thr 


Gin 


Ala 


Thr 


Met 


Lys 


Gly He Thr Thr Glu Gin 


Leu Asp Ala 




50 










55 


60 




Leu 


Gly 


Cys 


Arg 


He 


Cys 


Leu 


Gly Asn Thr Tyr His Leu 


Gly Leu Arg 


65 










70 




75 


80 


Pro 


Gly 


Pro 


Glu 


Leu 


He 


Gin 


Lys Ala Asn Gly Leu His 


Gly Phe Met 










85 






90 


95 


Asn 


Trp 


Pro 


His 


Asn 


Leu 


Leu 


Thr Asp Ser Gly Gly Phe 


Gin Met Val 








100 








105 


110 


Ser 


Leu 


Val 


Ser 


Leu 


Ser 


Glu 


Val Thr Glu Glu Gly Val 


Arg Phe Arg 






115 










120 125 




Ser 


Pro 


Tyr 


Asp Gly 


Asn 


Glu 


Thr Leu Leu Ser Pro Glu 


Lys Ser Val 




130 










135 


140 




Gin 


He 


Gin 


Asn 


Ala 


Leu Gly Ser Asp He He Met Gin 


Leu Asp Asp 


145 










150 




155 


160 


Val 


Val 


Ser 


Ser 


Thr 


Val 


Thr 


Gly Pro Arg Val Glu 












165 






170 





<210> 325 

<211> 20 

<212> PRT 

• <213> Homo sapiens 

<400> 325 

Met Ala He Lys Phe Leu Glu Val He Lys Pro Phe Cys Val He Leu 
15 10 15 

Gin Leu Thr Ser 
20 

<210> 326 

<211> 23 

<212> PRT 

<213> Homo sapiens 



<400> 326 
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Phe Leu Asn Asn Ser Ser Pro Gin Glu Pro Leu Lys Leu Pro Pro Gly 
15 10 15 

Gin Leu Ser Ser Thr His Arg 
20 



<210> 327 
<211> 45 
<212> PRT 

<213> Homo sapiens 
<220> 

<221> UNSURE 
<222> 27 

<223> Xaa = Phe, Ser 



<400> 327 

Met Glu Ser Asp Asp Ala Arg Ala 

1 5 

Asp Cys Tyr Lys Pro Thr Glu Ala 

20 

lie Arg Gly Met Gin Lys Leu Ser 
35 40 



Ala Arg Val Lys Glu Leu Leu Val 

10 15 
Phe lie Xaa Gly Leu Leu Asp Lys 
25 30 
Thr Pro Gin Lys Lys 
45 



<210> 328 
<211> 109 
<212> PRT 

<213> Homo sapiens 



<400> 328 






Met 


Ala 


Ala 


Ser 


Ala Ala Arg Gly 


1 








5 


Asn 


Gin 


Pro 


Val 


Ala Phe Val Arg 








20 




Ser 


Gin 


Leu 


Lys 


Glu His Phe Ala 






35 




40 


He 


Leu 


Pro 


Phe 


Asp Lys Glu Thr 




50 






55 


Val 


Gin 


Phe 


Ser 


Ser Glu Glu Gly 


65 








70 


Asn 


His 


He 


He 


Asp Gly Val Lys 










85 


Lys 


Leu 


Pro 


Gin 


Thr Ser Asp Asp 








100 





Ala Ala Ala Leu Arg Arg Ser He 

10 15 
Arg He Pro Trp Thr Ala Ala Ser 
25 30 
Gin Phe Gly His Val Arg Arg Cys 
45 

Gly Phe His Arg Gly Leu Gly Trp 
60 

Leu Arg Asn Ala Leu Gin Gin Glu 

75 80 
Val Gin Val His Thr Arg Arg Pro 

90 95 
Glu Lys Lys Asp Phe 
105 



<210> 329 
<211> 51 
<212> PRT 

<213> Homo sapiens 



<400> 329 

Met Ala Ala Ser Ala Ala Arg Gly 
1 5 
Ser Ala Gly Cys Phe Cys Glu Lys 
20 

Ser Ala Glu Arg Thr Leu Cys Thr 

35 40 
Phe Thr Phe 
50 



Ala Ala Ala Leu His Lys Tyr Gin 

10 15 
Asn Ser Leu Asp Cys Gly Val Glu 
25 30 
Val Arg Pro Cys Gin Lys Val His 
45 



<210> 330 
<211> 99 
<212> PRT 

<213> Homo sapiens 
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<400> 330 

Pro Val Arg Gly Leu Asn Met Ser Glu Phe Val Cys Asn Leu Ser Ala 

15 10 15 

Arg Pro Tyr Val Tyr Asp Leu lie Ala Val Ser Asn His Tyr Gly Ala 

20 25 30 

Met Gly Val Gly His Tyr 
35 

<210> 331 
<211> 13 
<212> PRT 

<213> Homo sapiens 

<400> 331 

Met Leu Gly Phe Ala Lys Asn Trp lie Phe Ser lie Thr 
15 10 

<210> 332 
<211> 13 
<212> PRT 

<213> Homo sapiens 

<400> 332 

Met Leu Gly Phe Ala Lys Asn Trp lie Phe Ser lie Thr 
15 10 

<210> 335 
<211> 13 
<212> PRT 

<213> Homo sapiens 
<400> 335 

Met Leu Gly Phe Ala Lys Asn Trp lie Phe Ser lie Thr 
15 10 

<210> 336 
<211> 13 
<212> PRT 

<213> Homo sapiens 
<400> 336 

Met Leu Gly Phe Ala Lys Asn Trp lie Phe Ser lie Thr 
15 10 

<210> 337 
<211> 109 
<212> PRT 
<213> Homo sapiens 

<400> 337 

Met Ala Ala Ser Ala Ala Arg Gly Ala Ala Ala Leu Arg Arg Ser lie 

15 10 15 

Asn Gin Pro Val Ala Phe Val Arg Arg Xaa Pro Trp Thr Ala Ala Ser 

20 25 30 

Ser Gin Leu Lys Glu His Phe Ala Gin Phe Gly His Val Arg Arg Cys 

35 40 45 

lie Leu Pro Phe Asp Lys Glu Thr Gly Phe His Arg Gly Leu Gly Trp 

50 55 60 

Val Gin Phe Ser Ser Glu Glu Gly Leu Arg Asn Ala Leu Gin Gin Glu 
65 70 75 80 

Asn His He He Asp Gly Val Lys Val Gin Val His Thr Arg Arg Pro 
85 90 95 
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Lys Leu Pro Gin Thr Ser Asp Asp Glu Lys Lys Asp Phe 
100 105 

<210> 338 
<211> 59 
<212> PRT 

<213> Homo sapiens 
<400> 338 

Met Lys Phe Gly Cys Leu Ser Phe Arg 
1 5 

Leu Asn Gly He Lys Thr Val Glu Thr 

20 25 
Ser Gin Arg Asn Cys Thr He Ala Val 

35 40 
Glu Gly Asp Ala Cys Arg Glu Leu Leu 
50 55 



Gin Pro Tyr Ala Gly Phe Val 

10 15 

Arg Trp Arg Pro Leu Leu Ser 

30 

His lie Ala His Arg Asp Trp 
45 

Val Glu 



<210> 339 

<211> 506 

<212> DNA 

<213> Homo sapiens 



<220> 
<221> CDS 
<222> 88. .471 



<220> 

<221> sig_peptide 

<222> 88. .189 

<223> Von Heijne matrix 

score 13.6000003814697 
seq VLLLLSLLLTALA/SS 



<400> 339 

acttatctgc agacttgtag gcagcaactc accctcactc agaggtcttc tggttctgga 60 
aacaactcta gctcagcctt ctccacc atg age etc aga ctt gat acc acc cct 114 

Met Ser Leu Arg Leu Asp Thr Thr Pro 
-30 



tec 


tgt 


aac 


agt 


gcg 


aga 


cca 


ctt 


cat 


gee 


ttg 


cag 


gtg 


ctg 


ctg 


ctt 


162 


Ser 


Cys 


Asn 


Ser 


Ala 


Arg 


Pro 


Leu 


His 


Ala 


Leu 


Gin 


Val 


Leu 


Leu 


Leu 




-25 










-20 










-15 










-10 




ctg 


tea 


ttg 


ctg 


ctg 


act 


get 


ctg 


get 


tec 


tec 


acc 


aaa 


gga 


caa 


act 


210 


Leu 


Ser 


Leu 


Leu 


Leu 
-5 


Thr 


Ala 


Leu 


Ala 


Ser 
1 


Ser 


Thr 


Lys 


Gly 
5 


Gin 


Thr 




aag 


aga 


aac 


ttg 


gcg 


aaa 


ggc 


aaa 


gag 


gaa 


agt 


eta 


gac 


agt 


gac 


ttg 


258 


Lys Arg 


Asn 


Leu 


Ala 


Lys 


Gly 


Lys 


Glu 


Glu 


Ser 


Leu 


Asp 


Ser 


Asp 


Leu 








10 










15 










20 










tat 


get 


gaa 


etc 


cge 


tgc 


atg 


tgt 


ata 


aag 


aca 


acc 


tct 


gga 


att. 


cat 


306 


Tyr 


Ala 


Glu 


Leu 


Arg 


Cys 


Met 


Cys 


He 


Lys 


Thr 


Thr 


Ser 


Gly 


He 


His 






25 










30 










35 












ccc 


aaa 


aac 


ate 


caa 


agt 


ttg 


gaa 


gtg 


ate 


ggg 


aaa 


gga 


acc 


eat 


tgc 


354 


Pro 


Lys 


Asn 


He 


Gin 


Ser 


Leu 


Glu 


Val 


He 


Gly 


Lys 


Gly 


Thr 


His 


Cys 




40 










45 










50 










55 




aac 


caa 


gte 


gaa 


gtg 


ata 


gee 


aca 


ctg 


aag 


gat 


ggg 


agg 


aaa 


ate 


tgc 


402 


Asn 


Gin 


Val 


Glu 


Val 


He 


Ala 


Thr 


Leu 


Lys 


Asp 


Gly 


Arg 


Lys 


He 


Cys 












60 










65 










70 






ctg 


gac 


cca 


gat 


get 


ccc 


aga 


ate 


aag 


aaa 


att 


gta 


cag 


aaa 


aaa 


ttg 


450 


Leu Asp 


Pro 


Asp 


Ala 


Pro 


Arg 


He 


Lys 


Lys 


He 


Val 


Gin 


Lys 


Lys 


Leu 










75 










80 










85 








gca 


ggt 


gat 


gaa 


tct 


get 


gat 


taatttgtte tgtttctgcc aaaettcttt 


501 


Ala Gly 


Asp 


Glu 


Ser 


Ala 


Asp 























90 



wo 02/083898 
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190 



PCT/reOl/00914 
506 



<210> 340 
<211> 523 
<212> DNA 

<213> Homo sapiens 

<220> 
<221> CDS 
<222> 41. .235 

<220> 

<221> sig_peptide 
<222> 41. .94 

<223> Von Heijne matrix 

score 12.1999998092651 
seq LTLAVLFLTGSQA/RH 

<400> 340 

ttagagactg cgagaaggag gtcccccacg gcccttcagg atg aaa get gcg gtg 55 

Met Lys Ala Ala Val 
-15 



ctg 


acc 


ttg 


gcc 


gtg 


etc 


ttc 


ctg 


acg 


ggg 


age 


cag 


get 


egg 


eat ttc 


103 


Leu 


Thr 


Leu 


Ala 
-10 


Val 


Leu 


Phe 


Leu 


Thr 
-5 


Gly 


Ser 


Gin 


Ala 


Arg 
1 


His Phe 




tgg 


cag 


caa 


gat 


gaa 


ecc 


cec 


cag 


age 


ecc 


tgg 


gat 


cga 


gtg 


aag gac 


151 


Trp 


Gin 


Gin 


Asp 


Glu 


Pro 


Pro 


Gin 


Ser 


Pro 


Trp Asp Arg 


Val 


Lys Asp 






5 










10 










15 










ctg 


gcc 


act 


gtg 


tac 


gtg 


gat 


gtg 


etc 


aaa 


gac 


age 


ggc 


aga 


gac tat 


199 


Leu 


Ala 


Thr 


Val 


Tyr 


Val 


Asp 


Val 


Leu 


Lys 


Asp 


Ser 


Gly Arg 


Asp Tyr 




20 










25 










30 








35 




gtg 


tec 


cag 


ttt 


gaa 


gge 


tec 


get 


tgg 


gaa 


aac 


age 


taaacctaaa 


245 


Val 


Ser 


Gin 


Phe 


Glu 
40 


Gly 


Ser 


Ala 


Trp 


Glu 
45 


Asn 


Ser 











gctccttgac aactgggaca gcgtracete ercetteage aakctgegcg aacagctcgg 305 

ccctgtgacc caggagttct gggataacct ggaaaaggag acagagggcc tgaggcagga 365 

gatgagcaag gatctggagg aggtgaaggc caasgtgcag cectacctgg acgaettcca 425 

gaagaagtgg caggaggaga tggagctcta ccgeeagaag gcagctttet taactatcct 485 

aacaagectt ggaecaaatg gaaataaage tttttgat 523 



<210> 341 
<211> 580 

<212> DNA 

<213> Homo sapiens 

<220> 
<221> CDS 
<222> 134.. 373 

<220> 

<221> sig_peptide 
<222> 134 . .211 
<223> Von Heijne matrix 
score 12 

seq LLLALLLPTQIYS/SE 
<400> 341 

atttgtggct ttcctggtat ataaggtctc gccggctcgc egcgctcece aeettgcetg 60 
cgcccgcccg gagccagcgg ttctccaagc acccagcate etgctagacg cgccgegeac 120 
cgacggaggg gac atg ggc aga gea atg gtg gee agg etc ggg ctg ggg 169 
Met Gly Arg Ala Met Val Ala Arg Leu Gly Leu Gly 
-25 -20 -15 

ctg ctg ctg ctg gca ctg etc eta ecc acg cag att tat tec agt gaa 217 
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Leu Leu Leu Leu Ala Leu Leu Leu Pro Thr Gin lie Tyr Ser Ser Glu 

-10 -5 1 

aca aca act gga act tea agt aac tec tec cag agt act tec aac tet 

Thr Thr Thr Gly Thr Ser Ser Asn Ser Ser Gin Ser Thr Ser Asn Ser 

5 10 15 

aat gee acc ace aag gyg get ggt ggt 

Lys Xaa Ala Gly Gly 



act 



ggg ttg gee cea aat cea 
Gly Leu Ala Pro Asn 
20 

gee ctg cag tea aca gee 
Ala Leu Gin Ser Thr Ala Ser Leu Phe Val Val Ser Leu Ser Leu Leu 



Pro Thr Asn Ala Thr Thr 
25 



30 



agt etc tte gtg gte tea etc tet ctt ctg 



35 
cat 



40 



45 



50 



etc tac tet taagagactc aggeeaagaa aegtettcta aatttcccca 
His Leu Tyr Ser 

tcttctaaac ccaatccaaa tggcgtctgg aagtccaatg tggcaaggaa aaacaggtct 
tcatcgaatc tactaattcc acacctttta ttgacacaga aaatgttgag aatcccaaat 
ttgattgatt tgaagaacat gtgagaggkt tgaetagatg atggatg 



265 
313 
361 

413 

473 
533 
580 



<210> 342 
<211> 508 
<212> DNA 
<213> Homo sapiens 

<220> 
<221> CDS 
<222> 76.. 378 



<220> 

<221> sig_peptide 

<222> 76. . 156 

<223> Von Heijne matrix 

score 11.8000001907349 
seq LFLGLLLLPLVVA/FA 



<400> 342 

aattggeeae agagacccag cccgagtttc ccatcgeact gagcactgag atcctgctgg 60 
aagctctgcc gcagc atg age tec gca gee ggg ttc tgc gee tea cge ecc 111 
Met Ser Ser Ala Ala Gly Phe Cys Ala Ser Arg Pro 
-25 -20 



ggg 


ctg 


ctg 


tte 


ctg 


ggg 


ttg 


ctg 


etc 


ctg 


cea 


ctt 


gtg 


gte 


gee 


tte 


159 


Gly 


Leu 


Leu 


Phe 


Leu 


Gly 


Leu 


Leu 


Leu 


Leu 


Pro 


Leu 


Val 


Val 


Ala 


Phe 




-15 










-10 










-5 










1 




gee 


age 


get 


gaa 


get 


gaa 


gaa 


gat 


ggg 


gae 


ctg 


cag 


tgc 


ctg 


tgt 


gtg 


207 


Ala 


Ser 


Ala 


Glu 
5 


Ala 


Glu 


Glu 


Asp 


Gly 

10 


Asp 


Leu 


Gin 


Cys 


Leu 

15 


Cys 


Val 




aag 


acc 


acc 


tec 


cag 


gte 


cgt 


ccc 


agg 


cac 


ate 


ace 


age 


ctg 


gag 


gtg 


255 


Lys 


Thr 


Thr 
20 


Ser 


Gin 


Val 


Arg 


Pro 
25 


Arg 


His 


He 


Thr 


Ser 
30 


Leu 


Glu 


Val 




ate 


aag 


gee 


gga 


ccc 


cac 


tgc 


ccc 


act 


gee 


caa 


ctg 


ata 


gee 


acg 


ctg 


303 


He 


Lys 
35 


Ala 


Gly 


Pro 


His 


Cys 
40 


Pro 


Thr 


Ala 


Gin 


Leu 
45 


He 


Ala 


Thr 


Leu 




aag 


aat 


gga 


agg 


aaa 


att 


tgc 


ttg 


gae 


ctg 


caa 


gee 


eeg 


ctg 


tac 


aag 


351 


Lys 


Asn 


Gly 


Arg 


Lys 


He 


Cys 


Leu 


Asp 


Leu 


Gin 


Ala 


Pro 


Leu 


Tyr 


Lys 




50 










55 










60 










65 




aaa 


ata 


att 


aag 


aaa 


ctt 


ttg 


gag 


agt 


tagctactag ctgcctacgt 




398 


Lys 


He 


He 


Lys 


Lys 
70 


Leu 


Leu 


Glu 


Ser 



















gtgtgcattt gctatatagc atacttcttt tttccagttt caatctaact gtgaaagaac 458 
ttctgatatt tgtgttatce ttatgatttt aaataaacaa aataaatcta 508 



<210> 343 
<211> 459 
<212> DNA 
<213> Homo sapiens 



wo 02/083898 
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<220> 
<221> CDS 
<222> 75. .332 



<220> 

<221> sig_peptide 

<222> 75. . 128 

<223> Von Heijne matrix 

score 11.6999998092651 
seq VLLLLLLVEQAAA/LG 



<400> 343 

gactctttgg tcagggaact gctcgctgag cacagctgca cagtgctgtc agaacggccg 60 



atctccagcc caag 


atg 


att 


cca 


gca 


gtg 


gtc 


ttg 


etc 


tta 


etc 


ctt 


ttg 


110 










Met 


He 


Pro 


Ala 
-15 


Val 


Val 


Leu 


Leu 


Leu 
-10 


Leu 


Leu 


Leu 




gtt 


gaa 


caa 


gca 


gcg 


gcc 


ctg 


gga 


gag 


cct 


cag 


etc 


tgc 


tat 


ate 


ctg 


158 


Val 


Glu 
-5 


Gin 


Ala 


Ala 


Ala 


Leu 
1 


Gly 


Glu 


Pro 


Gin 
5 


Leu 


Cys 


Tyr 


He 


Leu 
10 




gat 


gcc 


ate 


ctg 


ttt 


ctg 


tat 


gga 


att 


gtc 


etc 


acc 


etc 


etc 


tac 


tgt 


206 


Asp Ala 


He 


Leu 


Phe 


Leu 


Tyr 


Gly 


He 


Val 


Leu 


Thr 


Leu 


Leu 


Tyr 


Cys 












15 










20 










25 






cga 


ctg 


aag 


ate 


caa 


gtg 


cga 


aag 


gca 


get 


ata 


ace 


age 


tat 


gag 


aaa 


254 


Arg 


Leu 


Lys 


He 
30 


Gin 


Val 


Arg 


Lys 


Ala 
35 


Ala 


He 


Thr 


Ser 


Tyr 
40 


Glu 


Lys 




tea 


gat 


ggt 


gtt 


tac 


acg 


ggc 


ctg 


age 


ace 


agg 


aac 


cag 


gag 


act 


tac 


302 


Ser 


Asp 


Gly 
45 


Val 


Tyr 


Thr 


Gly 


Leu 

50 


Ser 


Thr 


Arg 


Asn 


Gin 

55 


Glu 


Thr 


Tyr 




gag 


act 


ctg 


aag 


cat 


gag 


aaa 


cca 


cca 


cag 


tagecttaga atagatgegg 


352 


Glu 


Thr 
60 


Leu 


Lys 


His 


Glu 


Lys 
65 


Pro 


Pro 


Gin 

















tcatattctt ctttggcttc tggttcttce agccctcatg tttggcatca eatatgcctg 412 
catgecatta acaccagetg gcectaecee tataatgate ctgtgte 459 

<210> 344 
<211> 553 
<212> DNA 
<213> Homo sapiens 

<220> 
<221> CDS 
<222> 160. .435 



<220> 

<221> sig^peptide 

<222> 160. .231 

<223> Von Heijne matrix 

score 11.6999998092651 
seq FVLGLGLTPPTLA/QD 



<400> 344 

agatgeaaag eaggattcaa aagaacatct ttgcgttttc taccggctec ecatcatcgt 60 
aetagggagg aagaagcggg tgagaaacaa aacttctttc cattgtcctg ccegtttctg 120 
cggacttgtt ctgaggccga ggagcctgtg ttggaagag atg gtg atg ggc ctg 174 

Met Val Met Gly Leu 

-20 

ggc gtt ttg ttg ttg gtc ttc gtg ctg ggt ctg ggt ctg acc cca ccg 222 
Gly Val Leu Leu Leu Val Phe Val Leu Gly Leu Gly Leu Thr Pro Pro 

-15 -10 -5 

acc ctg get cag gat aac tee agg tac aca cac ttc ctg acc cag eac 270 
Thr Leu Ala Gin Asp Asn Ser Arg Tyr Thr His Phe Leu Thr Gin His 
1 5 10 
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tat gat gcc aaa cca cag ggc egg gat gac aga tac tgt gaa age ate 318 
Tyr Asp Ala Lys Pro Gin Gly Arg Asp Asp Arg Tyr Cys Glu Ser lie 

15 20 25 

atg agg aga egg ggc ctg ace tea ccc tgc aaa gac ate aae aca ttt 366 
Met Arg Arg Arg Gly Leu Thr Ser Pro Cys Lys Asp He Asn Thr Phe 
30 35 40 45 

att cat ggc aac aag cge aga tea agg cca tct gtg aaa aca aga atg 414 
He His Gly Asn Lys Arg Arg Ser Arg Pro Ser Val Lys Thr Arg Met 

50 55 60 

gaa acc etc aca gag aaa ace taagaataag caagtcttet ttceaggtca 4 65 

Glu Thr Leu Thr Glu Lys Thr 
65 

ccacttgcaa gctacatgga ggttccectg gectecatge cagtaccgag eeacagcggg 525 
gttcagaaac gttgttgttg cttgtgaa 553 

<210> 345 
<211> 499 
<212> DNA 
<213> Homo sapiens 

<220> 
<221> CDS 
<222> 39. .371 

<220> 

<221> sig_peptide 
<222> 39. .86 
<223> Von Heijne matrix 
score 11.5 

seq ILLSVALLAFSSA/QD 
<400> 345 

ageataaagt tgggagtgae aceagagcet tctgcaag atg ctt ctg att ctg ctg 56 

Met Leu Leu He Leu Leu 
-15 



tea 


gtg 


gcc 


ctg 


ctg 


gcc 


ttc 


age 


tea 


get 


cag 


gac 


tta 


gat 


gaa 


gat 


104 


Ser 


Val 


Ala 


Leu 


Leu 


Ala 


Phe 


Ser 


Ser 


Ala 


Gin 


Asp 


Leu 


Asp 


Glu 


Asp 




-10 










-5 










1 








5 






gtc 


age 


caa 


gaa 


gac 


gtt 


ccc 


ttg 


gta 


ata 


tea 


gat 


gga 


gga 


gac 


tct 


152 


Val 


Ser 


Gin 


Glu 
10 


Asp 


Val 


Pro 


Leu 


Val 
15 


He 


Ser 


Asp 


Gly 


Gly 

20 


Asp 


Ser 




gag 


cag 


ttc 


ata 


gat 


gag 


gag 


cgt 


cag 


gga 


cca 


cet 


ttg 


gga 


gga 


cag 


200 


Glu 


Gin 


Phe 


He 


Asp 


Glu 


Glu 


Arg 


Gin 


Gly 


Pro 


Pro 


Leu 


Gly 


Gly Gin 








25 










30 










35 










caa 


tct 


caa 


ccc 


tct 


get 


ggt 


gat 


ggg 


aac 


cag 


rat 


gat 


ggc 


cet 


cag 


248 


Gin 


Ser 
40 


Gin 


Pro 


Ser 


Ala 


Gly 
45 


Asp 


Gly 


Asn 


Gin 


Xaa 
50 


Asp 


Gly 


Pro 


Gin 




cag 


gga 


cca 


ccc 


caa 


caa 


gga 


ggc 


cag 


cag 


caa 


caa 


ggt 


cca 


cca 


cet 


296 


Gin 


Gly 


Pro 


Pro 


Gin 


Gin 


Gly 


Gly 


Gin 


Gin 


Gin 


Gin 


Gly 


Pro 


Pro 


Pro 




55 










60 










65 










70 




cet 


cag 


gga 


aag 


cca 


car 


gga 


cca 


cet 


ccc 


caa 


ggg 


ggc 


cge 


cca 


caa 


344 


Pro 


Gin 


Gly 


Lys 


Pro 
75 


Gin 


Gly 


Pro 


Pro 


Pro 
80 


Gin 


Gly 


Gly 


Arg 


Pro 
85 


Gin 




gga 


cet 


cca 


cag 


ggg 


cag 


tct 


cet 


cag 


taatctagga ttcaatgaca 




391 


Gly 


Pro 


Pro 


Gin 


Gly 


Gin 


Ser 


Pro 


Gin 



















90 95 



ggaagtgaat aagaagatga cagtgtttca aatgccttga aacataatgt gatcatgetc 451 
taacttcaat ataceaataa aataatcage ttgcaaaaaa aaaaaaaa 499 

<210> 346 
<211> 398 
<212> DNA 
<213> Homo sapiens 



wo 02/083898 



PCT/lBOl/00914 



194 

<220> 
<221> CDS 
<222> 36.. 353 

<220> 

<221> sig_peptide 
<222> 36. .83 
<223> Von Heijne matrix 
score 11.5 

seq ILLSVALLAFSSA/QD 
<400> 346 

ataaagttgg gagtgacacc agagccttct gcaag atg ctt ctg att ctg ctg 53 

Met Leu Leu lie Leu Leu 
-15 

tea gtg gcc ctg ctg gcc ttc age tea get cag gac tta gat gaa gat 101 

Ser Val Ala Leu Leu Ala Phe Ser Ser Ala Gin Asp Leu Asp Glu Asp 

-10 -5 15 

gtc age caa gaa gac gtt ccc ttg gta ata tea gat gga gga gac tct 149 

Val Ser Gin Glu Asp Val Pro Leu Val lie Ser Asp Gly Gly Asp Ser 

10 15 20 

gag cag ttc ata gat gag gag cgt cag gga cca cet ttg gga gga cag 197 
Glu Gin Phe lie Asp Glu Glu Arg Gin Gly Pro Pro Leu Gly Gly Gin 

25 30 35 

caa tct caa ccc tct get ggt gat ggg aac cag gat gat ggc cet cag 24 5 

Gin Ser Gin Pro Ser Ala Gly Asp Gly Asn Gin Asp Asp Gly Pro Gin 

40 45 50 

cag gga cea ccc caa caa gga gge cag cag caa caa ggt cca eca cet 293 
Gin Gly Pro Pro Gin Gin Gly Gly Gin Gin Gin Gin Gly Pro Pro Pro 
55 60 65 70 

cet cag gga aag cca caa gga cca ccc caa cag gga ggc cag tct tgt 341 
Pro Gin Gly Lys Pro Gin Gly Pro Pro Gin Gin Gly Gly Gin Ser Cys 

75 80 85 

tgc tgt gac aag taacagcctt ttttttaaat tgttttttat acaaaaaaaa aaaaa 398 
Cys Cys Asp Lys 
90 

<210> 347 
<211> 525 
<212> DNA 
<213> Homo sapiens 

<220> 
<221> CDS 
<222> 48. .470 

<220> 

<221> sig_peptide 

<222> 48. .119 

<223> Von Heijne matrix 

score 10.8999996185303 

seq LAGLCCLVPVSLA/ED 

<400> 347 

ctcctcagct teaggcacca ccactgacct gggacagtga atcgaea atg ccg tct 56 

Met Pro Ser 

tct gtc teg tgg ggc ate etc ctg ctg gea ggc ctg tgc tgc ctg gtc 104 
Ser Val Ser Trp Gly lie Leu Leu Leu Ala Gly Leu Cys Cys Leu Val 

-20 -15 -10 

cet gtc tec ctg get gag gat ccc cag gga gat get gcc cag aag aca 152 
Pro Val Ser Leu Ala Glu Asp Pro Gin Gly Asp Ala Ala Gin Lys Thr 
-5 15 10 



wo 02/083898 PCT/IBO 1/009 14 



195 



gat 


aca 


tec 


cac 


cat 


gat 


cag 


gat 


cac 


cca 


aec 


ttc 


aae 


aag 


ate 


acc 


200 


Asp 


Thr 


Ser 


His 
15 


His 


Asp 


Gin 


Asp 


His 
20 


Pro 


Thr 


Phe 


Asn 


Lys 
25 


He 


Thr 




ccc 


aae 


ctg 


get 


Qaq 


ttc 


gcc 


ttc 


age 


eta 


tac 


cgc 


cag 


etg 


gca 


cac 


248 


Pro 


Asn 


Leu 


Ala 


Glu 


Phe 


Ala 


Phe 


Ser 


Leu Tyr 


Arg 


Gin 


Leu 


Ala 


His 








30 










35 










40 










cag 


tec 


aae 


age 


ace 


aat 


ate 


ttc 


ttc 


tec 


cca 


gtg 


age 


ate 


get 


aca 


296 


Gin 


Ser 
45 


Asn 


Ser 


Thr 


Asn 


He 
50 


Phe 


Phe 


Ser 


Pro 


Val 
55 


Ser 


He 


Ala 


Thr 




gcc 


ttt 


gea 


atg 


etc 


tec 


ctg 


ggg 


ace 


aag 


get 


gae 


act 


cac 


gat 


gaa 


344 


Ala 


Phe 


Ala 


Met 


Leu 


Ser 


Leu 


Gly 


Thr 


Lys Ala 


Asp 


Thr 


His 


Asp 


Glu 




60 










65 










70 










75 




ate 


ctg 


gag 


ggc 


ctg 


aat 


ttc 


aae 


etc 


aeg 


gag 


att 


ccg 


gag 


get 


cag 


392 


He 


Leu 


Glu 


Gly 


Leu 
80 


Asn 


Phe 


Asn 


Leu 


Thr 
85 


Glu 


He 


Pro 


Glu 


Ala 
90 


Gin 




ate 


cat 


gaa 


ggc 


ttc 


cag 


gaa 


etc 


etc 


egt 


aec 


etc 


aae 


cag 


cca 


gae 


440 


He 


His 


Glu 


Gly 
95 


Phe 


Gin 


Glu 


Leu 


Leu 
100 


Arg 


Thr 


Leu 


Asn 


Gin 
105 


Pro 


Asp 




age 


cag 


etc 


cag 


ctg 


ace 


ace 


ggc 


aag 


aat 


taatgcttte ttaaetaagt 


490 


Ser 


Gin 


Leu 


Gin 


Leu 


Thr 


Thr 


Gly 


Lys 


Asn 

















110 115 
ttactttgag tattaacaag tagcttatae tgagg 525 



<210> 348 
<211> 500 
<212> DNA 
<213> Homo sapiens 

<220> 
<221> CDS 
<222> 91. .462 

<220> 

<221> sig^peptide 

<222> 91., 180 

<223> Von Heijne matrix 

score 9.89999961853027 

seq LLFGFTLVSGTGA/EK 

<400> 348 

acecctaccc cacgecccet cccgcgcgcg cggttaaatc cecgcacctg agcateggct 60 
cacacctgca ccccgcccgg geatagcacc atg cct get tgt cgc eta ggc ccg 114 

Met Pro Ala Cys Arg Leu Gly Pro 
-30 -25 



eta 


gcc 


gcc 


gcc 


etc 


etc 


etc 


age 


ctg 


ctg 


ctg 


ttc 


ggc 


ttc 


acc 


eta 


162 


Leu 


Ala 


Ala 
-20 


Ala 


Leu 


Leu 


Leu 


Ser 
-15 


Leu 


Leu 


Leu 


Phe 


Gly 
-10 


Phe 


Thr 


Leu 




gtc 


tea 


ggc 


aca 


gga 


gea 


gag 


aag 


act 


ggc 


gtg 


tgc 


ccc 


gag 


etc 


cag 


210 


Val 


Ser 
-5 


Gly 


Thr 


Gly 


Ala 


Glu 
1 


Lys 


Thr 


Gly 


Val 
5 


Cys 


Pro 


Glu 


Leu 


Gin 
10 




get 


gae 


cag 


aae 


tgc 


aeg 


caa 


gag 


tgc 


gtc 


teg 


gae 


age 


gaa 


tgc 


gcc 


258 


Ala 


Asp 


Gin 


Asn 


Cys 
15 


Thr 


Gin 


Glu 


Cys 


Val 
20 


Ser 


Asp 


Ser 


Glu 


Cys 
25 


Ala 




gae 


aac 


etc 


aag 


tgc 


tgc 


age 


gcg 


ggc 


tgt 


gee 


ace 


ttc 


tgc 


tet 


ctg 


306 


Asp 


Asn 


Leu 


Lys 
30 


Cys 


Cys 


Ser 


Ala 


Gly 
35 


Cys 


Ala 


Thr 


Phe 


Cys 
40 


Ser 


Leu 




ccc 


aat 


gat 


aag 


gag 


ggt 


tec 


tgc 


ccc 


cag 


gtg 


aac 


att 


aac 


ttt 


ccc 


354 


Pro 


Asn 


Asp 
45 


Lys 


Glu 


Gly 


Ser 


Cys 
50 


Pro 


Gin 


Val 


Asn 


He 
55 


Asn 


Phe 


Pro 




cag 


etc 


ggc 


etc 


tgt 


egg 


gae 


cag 


tgc 


cag 


gtg 


gae 


age 


cag 


tgt 


cct 


402 


Gin 


Leu 
60 


Gly 


Leu 


Cys 


Arg 


Asp 

65 


Gin 


Cys 


Gin 


Val 


Asp 
70 


Ser 


Gin 


Cys 


Pro 




ggc 


cag 


atg 


aaa 


tgc 


tgc 


cgc 


aat 


ggc 


tgt 


ggg 


aag 


gtg 


tec 


tgt 


gtc 


450 
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196 

Gly Gin Met Lys Cys Cys Arg Asn Gly Cys Gly Lys Val Ser Cys Val 

75 80 85 90 

act ccc aat ttc tgagctccag ccaccaccag gctgagcagt gaggagag 500 

Thr Pro Asn Phe 

<210> 349 
<211> 519 
<212> DNA 
<213> Homo sapiens 

<220> 
<221> CDS 
<222> 28. .399 

<220> 

<221> sig_pepticle 

<222> 28. .117 

<223> Von Heijne matrix 

score 9.89999961853027 

seq LLFGFTLVSGTGA/EK 

<400> 349 

acctgcaccc cgcccgggca tagcacc atg cct get tgt cgc eta ggc ccg eta 54 

Met Pro Ala Cys Arg Leu Gly Pro Leu 
-30 -25 



gcc 


gcc 


gcc 


etc 


etc 


etc 


age 


ctg 


ctg 


ctg 


ttc 


ggc 


ttc 


ace 


eta 


gte 


102 


Ala 


Ala 


Ala 


Leu 


Leu 


Leu 


Ser 


Leu 


Leu 


Leu 


Phe 


Gly 


Phe 


Thr 


Leu 


Val 






-20 










-15 










-10 












tea 


ggc 


aca 


gga 


gca 


gag 


aag 


act 


ggc 


gtg 


tge 


ccc 


gag 


etc 


eag 


get 


150 


Ser 


Gly 


Thr 


Gly 


Ala 


Glu 


Lys 


Thr 


Gly 


Val 


Cys 


Pro 


Glu 


Leu 


Gin 


Ala 




-5 










1 








5 










10 






gac 


cag 


aac 


tge 


aeg 


caa 


gag 


tge 


gte 


teg 


gac 


age 


gaa 


tge 


gcc 


gac 


198 


Asp 


Gin 


Asn 


Cys 


Thr 


Gin 


Glu 


Cys 


Val 


Ser 


Asp 


Ser 


Glu 


Cys 


Ala 


Asp 










15 










20 










25 








aac 


ate 


aag 


tge 


tge 


age 


gcg 


ggc 


tgt 


gee 


ace 


ttc 


tge 


tet 


ctg 


ccc 


246 


Asn 


He 


Lys 


Cys 


Cys 


Ser 


Ala 


Gly 


Cys 


Ala 


Thr 


Phe 


Cys 


Ser 


Leu 


Pro 








30 










35 










40 










aat 


gat 


aag 


gag 


ggt 


tec 


tge 


ccc 


eag 


gtg 


aac 


att 


aac 


ttt 


ccc 


cag 


294 


Asn 


Asp 


Lys 


Glu 


Gly 


Ser 


Cys 


Pro 


Gin 


Val 


Asn 


He 


Asn 


Phe 


Pro 


Gin 






45 










50 










55 












etc 


ggc 


etc 


tgt 


egg 


gac 


cag 


tge 


eag 


gtg 


gac 


age 


cag 


tgt 


cct 


ggc 


342 


Leu 


Gly 


Leu 


Cys 


Arg 


Asp 


Gin 


Cys 


Gin 


Val 


Asp 


Ser 


Gin 


Cys 


Pro 


Gly 




60 










65 










70 










75 




eag 


atg 


aaa 


tge 


tge 


cgc 


aat 


ggc 


tgt 


ggg 


aag 


gtg 


tee 


tgt 


gte 


act 


390 


Gin 


Met 


Lys 


Cys 


Cys 


Arg 


Asn 


Gly 


Cys 


Gly 


Lys 


Val 


Ser 


Cys 


Val 


Thr 





80 85 90 



ccc aat ttc tgagctccag ceacaggetg ageagtgagg agagaaagtt 439 
Pro Asn Phe 

tetgcctgge cctgeatctg gttecagecc acctgceetc ccctttttcg gggaetetgt 499 

attceetctt gggctgacea 519 

<210> 350 
<211> 482 
<212> DNA 
<213> Homo sapiens 

<220> 
<221> CDS 
<222> 40. .246 



<220> 

<221> sig_peptide 
<222> 40. .90 
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197 

<223> Von Heijne matrix 
score 9.5 

seq FVFLLFVISLAAA/AH 
<400> 350 

aaaaagtagg acaccggagc tagcagtggc aacccactt atg ttt cct ggc ttt 54 

Met Phe Pro Gly Phe 
-15 



gtt 


ttt 


ttg 


etc 


ttt 


gta 


ata 


agt 


ctt 


get 


get 


get 


get 


cat 


ctt tgg 


102 


Val 


Phe 


Leu 


Leu 


Phe 


Val 


He 


Ser 


Leu 


Ala 


Ala 


Ala 


Ala 


His 


Leu Trp 








-10 










-5 










1 








gtc 


ctg 


gee 


gcc 


ttt 


atg 


ggc 


cgt 


ate 


acc 


gtg 


aag 


gtc 


tgc 


age ttc 


150 


Val 


Leu 


Ala 


Ala 


Phe 


Met 


Gly 


Arg 


He 


Thr 


Val 


Lys 


Val 


Cys 


Ser Phe 




5 










10 










15 








20 




act 


cct 


gag 


gcc 


age 


aag 


acc 


gtg 


age 


eca 


cca 


gaa 


gga 


gcg 


aac aac 


198 


Thr 


Pro 


Glu 


Ala 


Ser 


Lys 


Thr 


Val 


Ser 


Pro 


Pro 


Glu 


Gly 


Ala 


Asn Asn 












25 










30 










35 




tog 


aga 


cgc 


act 


get 


ttt 


aag 


age 


tgt 


aac 


act 


cac 


cac 


aaa 


ggt ctg 


246 


Ser Arg 


Arg 


Thr 


Ala 


Phe 


Lys 


Ser 


Cys 


Asn 


Thr 


His 


His 


Lys 


Gly Leu 





40 45 50 



tagcatcact cetgaagtea gegggaccac gaacctaeca gaaagaagaa actccataca 306 

gatctgaaaa tctgaaggaa caaactcegg acacaccate tttaagaact gtaacactca 366 

ctgcgagggt ccaeggctte atttttgaag tcagccagac caagaaccea ecaattcege 426 

acacaettct getgtttcgt tccattctca aggaaaactg tgatgggtcc attttt 482 

<210> 351 
<211> 462 
<212> DNA 
<213> Homo sapiens 

<220> 
<221> CDS 
<222> 114.. 344 

<220> 

<221> sig_peptide 

<222> 114 . .212 

<223> Von Heijne matrix 

score 9.30000019073486 

seq MVLLSFALTLCSA/FW 

<400> 351 

wetgctcact gctgggtact gttctgctgt gggtgeccag gaagggacta cacctctteg 60 
cagtgtttta tacetttggt aatatcgeat caattgggag taceatcttc etc atg 116 

Met 



gga cca 


gtg 


aaa 


cag 


ctg 


aag 


ega 


atg 


ttt 


gag 


cct 


act 


cgt 


ttg 


att 


164 


Gly Pro 


Val 
-30 


Lys 


Gin 


Leu 


Lys 


Arg 
-25 


Met 


Phe 


Glu 


Pro 


Thr 
-20 


Arg 


Leu 


He 




gca act 


ate 


atg 


gtg 


ctg 


ttg 


agt 


ttt 


gca 


ctt 


acc 


ctg 


tgt 


tet 


gcc 


212 


Ala Thr 


He 


Met 


Val 


Leu 


Leu 


Ser 


Phe 


Ala 


Leu 


Thr 


Leu 


Cys 


Ser 


Ala 




-15 










-10 










-5 












ttt tgg 


tgg 


cat 


aac 


atg 


gga 


ctt 


gca 


ctt 


ate 


ttc 


tgc 


att 


ttg 


cag 


260 


Phe Trp 


Trp 


His 


Asn 


Met 


Gly 


Leu 


Ala 


Leu 


He 


Phe 


Cys 


He 


Leu 


Gin 




1 






5 










10 










15 






tet ttg 


gca 


ttg 


acg 


tgg 


tac 


age 


ctt 


tec 


ttc 


ata 


cca 


ttt 


gca 


agg 


308 


Ser Leu 


Ala 


Leu 

20 


Thr 


Trp 


Tyr 


Ser 


Leu 

25 


Ser 


Phe 


He 


Pro 


Phe 

30 


Ala 


Arg 




gat get 


gtg 


aag 


aag 


tgt 


ttt 


gcc 


gtg 


tgt 


ctt 


gca 


taattcatgg 




354 


Asp Ala 


Val 
35 


Lys 


Lys 


Cys 


Phe 


Ala 
40 


Val 


Cys 


Leu 


Ala 













ccagttttat gaagctttgg aaggcactat ggacagaagc tggtggacag ttttgtaact 414 
atcttcgaaa cctctgtctt acagacatgt gccttttatc ttgcagca 4 62 



■i 



1 
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198 

<210> 352 
<211> 496 
<212> DNA 

<213> Homo sapiens 

<220> 
<22l> CDS 
<222> 29. .466 

<220> 

<22l> sig_peptide 

<222> 29. .106 

<223> Von Heijne matrix 

score 8.89999961853027 

seq GLLWAFCAPGARA/EE 

<400> 352 

cgtctgcttc ggagaccgta agatattg atg acc atg aga tec ctg etc aga 52 

Met Thr Met Arg Ser Leu Leu Arg 
-25 -20 



acc ccc ttc 


ctg 


tgt 


ggc 


ctg 


etc 


tgg gee 


ttt 


tgt 


gee 


cea 


ggc 


gee 


100 


Thr Pro Phe 


Leu 


Cys 


Gly Leu Leu Trp Ala 


Phe 


Cys 


Ala 


Pro 


Gly 


Ala 






-15 










-10 








-5 








agg get gag 


gag 


ect 


gea 


gee 


age 


ttc tec 


caa 


ccc 


ggc 


age 


atg 


ggc 


148 


Arg Ala Glu 


Glu 


Pro 


Ala 


Ala 


Ser 


Phe Ser 


Gin 


Pro 


Gly 


Ser 


Met 


Gly 




1 








5 








10 












ctg gat aag 


aac 


aca 


gtg 


cac 


gac 


caa gag 


cat 


ate 


atg 


gag 


cat 


eta 


196 


Leu Asp Lys 


Asn 


Thr 


Val 


His 


Asp Gin Glu 


His 


He 


Met 


Glu 


His 


Leu 




15 






20 








25 










30 




gaa ggt gtc 


ate 


aac 


aaa 


cea 


gag 


gcg gag 


atg 


teg 


cea 


caa 


gaa 


ttg 


244 


Glu Gly Val 


lie 


Asn 


Lys 


Pro 


Glu 


Ala Glu 


Met 


Ser 


Pro 


Gin 


Glu 


Leu 








35 








40 










45 






cag etc cat 


tae 


ttc 


aaa 


atg 


cat 


gat tat 


gat 


ggc 


aat 


aat 


ttg 


ctt 


292 


Gin Leu His 


Tyr 


Phe 


Lys 


Met 


His 


Asp Tyr Asp 


Gly 


Asn 


Asn 


Leu 


Leu 






50 










55 








60 








gat ggc tta 


gaa 


etc 


tec 


aca 


gee 


ate act 


cat 


gtc 


eat 


aag 


gag 


gaa 


340 


Asp Gly Leu 


Glu 


Leu 


Ser 


Thr 


Ala 


He Thr 


His 


Val 


His 


Lys 


Glu 


Glu 




65 










70 








75 










ggg agt gaa 


cag 


gea 


cea 


eta 


atg 


agt gaa 


gat 


gaa 


ctg 


att 


aac 


ata 


388 


Gly Ser Glu 


Gin 


Ala 


Pro 


Leu 


Met 


Ser Glu Asp 


Glu 


Leu 


He 


Asn 


He 




80 








85 








90 












ata gat ggt 


gtt 


ttg 


aga 


gat 


gat 


gac aag 


aac 


aat 


gat 


gga 


tac 


att 


436 


lie Asp Gly Val 


Leu 


Arg Asp Asp Asp Lys Asn 


Asn 


Asp 


Gly 


Tyr 


He 




95 






100 








105 










110 




gac tat get 


gaa 


ttt 


gea 


aaa 


tea 


ctg cag 


tagatgttat ttggcatctc 


486 


Asp Tyr Ala 


Glu 


Phe 


Ala 


Lys 


Ser 


Leu Gin 




















115 








120 
















ctggttatat 


























496 


<210> 353 




























<211> 499 




























<212> DNA 




























<213> Homo sapiens 
























<220> 




























<221> CDS 




























<222> 155. .340 


























<220> 




























<221> sig peptide 
























<222> 155. .292 


























<223> Von Heijne 


1 matrix 























score 8.60000038146973 



wo 02/083898 

seq AVLLLILFAIVFG/LL 



199 
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<400> 353 

cttttcctct caacagttgc ttctttgagt cagggtgcag ctctggtcac ctggcggcct 60 
cttcagctca gccctccaca aagtgtgagc ctgaaggacc accctgaatt gcccttgtag 120 
gacccagaac agcwaccagc agaatcagat tctc atg gac caa ytg gta ttc aaa 175 

Met Asp Gin Leu Val Phe Lys 
-45 -40 



gag 


aca 


ate 


tgg 


aat 


gat 


gcg 


tte tgg cag aac 


cce tgg gac cag ggg 


223 


Glu 


Thr 


lie 


Trp 


Asn 


Asp 


Ala 


Phe Trp Gin Asn 


Pro Trp Asp Gin Gly 












-35 






-30 


-25 




ggc 


ctg 


gca 


gtg 


att 


ate 


tta 


ttc ate ace get 


gtc ctg ctt etc ate 


271 


Gly Leu 


Ala 


Val 


He 


He 


Leu 


Phe He Thr Ala 


Val Leu Leu Leu He 










-20 








-15 


-10 




tta 


ttt 


gcc 


ate 


gtg 


ttt 


ggt 


tta etc act tee 


aca gaa aac act cag 


319 


Leu 


Phe 


Ala 
-5 


He 


Val 


Phe 


Gly 


Leu Leu Thr Ser 


Thr Glu Asn Thr Gin 
c 




tgt 


gaa 


gcg 


ggt 


gaa 


gag 


gag 


tgacctgaet tgctggggac tgagatggca 


370 


Cys 


Glu 


Ala 


Gly 


Glu 


Glu 


Glu 








10 










15 











gcaggggagg cgagctgace tgcccccatt ccagtggtgg gcccttcgcg gttccctctg 430 
gctcaggggc caageetggt gtcttecttt cecaccagga aaaagtctag taaaatactg 4 90 
tatctggct 499 

<210> 354 
<211> 537 
<212> DNA 

<213> Homo sapiens 

<220> 
<221> CDS 
<222> 13. .498 

<220> 

<221> sig__peptide 

<222> 13. . 153 

<223> Von Heijne matrix 

score 8.30000019073486 

seq LALSSLLSLLLFA/GM 

<400> 354 

tagcgctgae gc atg cgc ata get aac cge ace egg tte age teg cet ttc 51 

Met Arg He Ala Asn Arg Thr Arg Phe Ser Ser Pro Phe 

-45 -40 -35 



ttg 


gcc 


aga 


ggc 


gcc 


ggt 


tgg 


act 


cae 


ggg 


egg 


ggc 


atg 


atg 


gtg 


gtg 


99 


Leu 


Ala 


Arg 


Gly 


Ala 
-30 


Gly 


Trp 


Thr 


His 


Gly 
-25 


Arg 


Gly 


Met 


Met 


Val 
-20 


Val 




ggt 


aeg 


ggc 


ace 


teg 


ctg 


gcg 


etc 


tec 


tec 


etc 


ctg 


tec 


ctg 


ctg 


etc 


147 


Gly 


Thr 


Gly 


Thr 
-15 


Ser 


Leu 


Ala 


Leu 


Ser 
-10 


Ser 


Leu 


Leu 


Ser 


Leu 

-5 


Leu 


Leu 




ttt 


get 


ggg 


atg 


cag 


atg 


tac 


age 


egt 


cag 


ctg 


gcc 


tec 


ace 


gag 


tgg 


195 


Phe 


Ala 


Gly 
1 


Met 


Gin 


Met 


Tyr 
5 


Ser 


Arg 


Gin 


Leu 


Ala 
10 


Ser 


Thr 


Glu 


Trp 




etc 


ace 


ate 


cag 


ggc 


ggc 


ctg 


ctt 


ggt 


teg 


ggt 


etc 


ttc 


gtg 


ttc 


teg 


243 


Leu 


Thr 


He 


Gin 


Gly 


Gly 


Leu 


Leu 


Gly 


Ser 


Gly 


Leu 


Phe 


Val 


Phe 


Ser 




15 










20 










25 










30 




etc 


act 


gee 


ttc 


aat 


aat 


ctg 


gag 


aat 


ctt 


gtc 


ttt 


ggc 


aaa 


gga 


ttc 


291 


Leu 


Thr 


Ala 


Phe 


Asn 
35 


Asn 


Leu 


Glu 


Asn 


Leu 
40 


Val 


Phe 


Gly 


Lys 


Gly 
45 


Phe 




caa 


gca 


aag 


ate 


tte 


cet 


gag 


att 


etc 


Ctg 


tge 


etc 


ctg 


ttg 


get 


etc 


339 


Gin 


Ala 


Lys 


He 
50 


Phe 


Pro 


Glu 


He 


Leu 
55 


Leu 


Cys 


Leu 


Leu 


Leu 
60 


Ala 


Leu 




ttt 


gca 


tet 


ggc 


etc 


ate 


cac 


cga 


gtc 


tgt 


gtc 


ace 


acc 


tge 


tte 


ate 


387 
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200 



435 



Phe 


Ala 


Ser 


Gly 


Leu 


He 


His 


Arg 


Val 


Cys 


Val 


Thr Thr 


Cys 


Phe He 






65 










70 








75 






ttc 


tec 


atg 


gtt 


ggt 


ctg 


tac 


tac 


ate 


aac 


aag 


ate tec 


tec 


ace ctg 


Phe 


Ser 


Met 


Val 


Gly 


Leu 


Tyr 


Tyr 


He 


Asn 


Lys 


He Ser 


Ser 


Thr Leu 




80 










85 










90 






tac 


eag 


gca 


gca 


get 


cca 


gtc 


etc 


aca 


cca 


gee 


aag gtc 


aca 


ggc aag 


Tyr 


Gin 


Ala 


Ala 


Ala 


Pro 


Val 


Leu 


Thr 


Pro 


Ala 


Lys Val 


Thr 


Gly Lys 


95 










100 










105 






110 


age 


aag 


aag 


aga 


aac 


tgacectgaa i 


tgttcaataa agttgattct 


: ttgeaaaaa 


Ser 


Lys Lys Arg 


Asn 




























115 





















483 



537 



<210> 355 
<211> 514 
<212> DNA 
<213> Homo sapiens 

<220> 
<221> CDS 
<222> 132. .401 



<220> 

<221> sig_peptide 

<222> 132. .236 

<223> Von Heijne matrix 

score 8.30000019073486 
seq LGLGLVTLTASWA/AL 



<400> 355 

cettcageke ecggccgaga geeeggtggc tgcgactgak agcetggtta ctgccggcac 60 

ctttggatcc tcggccaatt ctctgcttcg cctacacttg gcgtgcggtg taccagecte 120 

cccgegtcac c atg gaa aca gga gcc tct gca tec ate cca gag ctg ate 170. 

Met Glu Thr Gly Ala Ser Ala Ser He Pro Glu Leu He 

-35 -30 -25 

218 



266 



tgt 


gaa 


get 


atg 


aga 


aga 


ate 


tgg 


age 


ctg 


gga 


ttg 


ggg 


ttg 


gtg 


act 


Cys 


Glu 


Ala 


Met 


Arg 


Arg 


He 


Trp 


Ser 


Leu 


Gly 


Leu 


Gly 


Leu 


Val 


Thr 






-20 










-15 










-10 








ctg 


a eg 


gee 


age 


tgg 


gca 


get 


ett 


tte 


cae 


gat 


ggc 


ttt 


geg 


gtt 


ett 


Leu 


Thr 


Ala 


Ser 


Trp 


Ala 


Ala 


Leu 


Phe 


His 


Asp 


Gly 


Phe 


Ala 


Val 


Leu 




-5 










1 








5 










10 


gga 


gga 


aac 


att 


gtg 


age 


gat 


etc 


age 


aca 


gta 


aga 


ttt 


gtt 


gca 


caa 


Gly 


Gly 


Asn 


He 


Val 


Ser 


Asp 


Leu 


Ser 


Thr 


Val 


Arg 


Phe 


Val 


Ala 


Gin 










15 










20 










25 




cag 


eag 


cae 


tte 


cag 


etc 


ett 


gae 


gtg 


tgg 


ace 


agg 


aat 


tte 


egg 


aag 


Gin 


Gin 


His 


Phe 


Gin 


Leu 


Leu 


Asp 


Val 


Trp 


Thr 


Arg 


Asn 


Phe 


Arg 


Lys 








30 










35 










40 






cca 


ctg 


ggc 


age 


atg 


tgc 


ttt 


gtt 


tte 


ttg 


ttg 


etc 


cca 


taatcaatgt 


Pro 


Leu 


Gly 


Ser 


Met 


Cys 


Phe 


Val 


Phe 


Leu 


Leu 


Leu 


Pro 












45 










50 










55 









314 



362 



411 



tgggcateaa gatetaacee ttgaaecttc cacgaaecet gttgtcaata cctctgggtt 471 
tecgceggtt aegettaatt ttgaeatate ggteagaetg gtg 514 



<210> 356 
<211> 589 
<212> DNA 
<213> Homo sapiens 



<220> 
<221> CDS 
<222> 41. .223 



<220> 

<221> sig__peptide 
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201 

<222> 41. .115 

<223> Von Heijne matrix 

score 8.10000038146973 

seq ATLLLVLCLQLGA/NK 



<400> 356 

accacttctc tgggacacat tgccttctgt tttctccagc atg cgc ttg etc cag 

Met Arg Leu Leu Gin 
-25 

etc ctg ttc agg gcc age cct gcc acc ctg etc ctg gtt etc tgc etg 
Leu Leu Phe Arg Ala Ser Pro Ala Thr Leu Leu Leu Val Leu Cys Leu 



-20 



-15 



-10 



-5 



cag ttg ggg gcc aac aaa get cag gac aac act egg aag ate ata ata 
Gin Leu Gly Ala Asn Lys Ala Gin Asp Asn Thr Arg Lys lie lie lie 

15 10 
aag aat ttt gac att ccc aag tea gta cgt cca aat gac gaa gya ctg 
Lys Asn Phe Asp lie Pro Lys Ser Val Arg Pro Asn Asp Glu Xaa Leu 

15 20 25 

cag tgc ttg cag ttc aaa cag aat tgaaagaatg catggtggtt aaaacttaec 
Gin Cys Leu Gin Phe Lys Gin Asn 

30 35 
tcattagcag catecctcta caaggtgcat ttaaetataa gtatactgcc tgcctatgtg 
acgacaatcc aaaaaccttc tactgggact tttacaccaa cagaactgtg caaattgcag 
ccgtcgttga tgttattegg gaattaggca tetgcctgat gatgetgctg taatccccat 
caaaaacaac cggttttata ctattgaaat cetaaaggta gaataatgga agccctgtct 
gtttgccaea cceaggtgat ttcctctaaa gaaacttggc tggaatttct gctgtggtct 
ataaaataaa cttettaaca tacaaaaaaa aaaaaa 



55 



103 



151 



199 



253 



313 
373 
433 
4 93 
553 
589 



<210> 357 
<211> 465 
<212> DNA 

<213> Homo sapiens 

<220> 
<221> CDS 
<222> 19.. 378 



<220> 

<221> sig_peptide 

<222> 19.. 114 

<223> Von Heijne matrix 

score 7.59999990463257 
seq VTLGIGFFALASA/LW 



<400> 357 



cctaatcgraw aaagtcat 


atg 


get 


tea 


cct 


aaa 


ggt 


ttt 


ttt 


aat 


tac 


ttg 












Met 


Ala 


Ser 


Pro 


Lys 


Gly 


Phe 


Phe 


Asn 


Tyr 


Leu 
















-30 










-25 








acc 


tat 


ttt 


ctt 


get 


get 


ggt 


get 


gtc 


act 


ttg 


gga 


att 


ggt 


ttc 


ttt 


Thr 


Tyr 


Phe 


Leu 


Ala 


Ala 


Gly 


Ala 


Val 


Thr 


Leu 


Gly 


He 


Gly 


Phe 


Phe 




-20 










-15 










-10 










get 


ttg 


gea 


tea 


get 


ttg 


tgg 


ttc 


etg 


att 


tgc 


aaa 


cga 


aga 


gaa 


ata 


Ala 


Leu 


Ala 


Ser 


Ala 


Leu 


Trp 


Phe 


Leu 


He 


Cys 


Lys 


Arg 


Arg 


Glu 


He 


-5 










1 








5 










10 




ttt 


caa 


aat 


tec 


aaa 


ttt 


aaa 


gea 


att 


gat 


gag 


aga 


tgc 


agg 


caa 


aga 


Phe 


Gin 


Asn 


Ser 


Lys 


Phe 


Lys 


Ala 


He 


Asp 


Glu 


Arg 


Cys 


Arg 


Gin 


Arg 








15 










20 










25 






cca 


tea 


atg 


gcg 


aaa 


att 


aaa 


tct 


eat 


tct 


cag 


tgt 


gtt 


ttt 


att 


tct 


Pro 


Ser 


Met 


Ala 


Lys 


lie 


Lys 


Ser 


His 


Ser 


Gin 


Cys 


Val 


Phe 


He 


Ser 






30 










35 










40 








cga 


aat 


ttt 


cat 


act 


ggg 


aga 


ttc 


caa 


tta 


cag 


caa 


tta 


aag 


ate 


att 


Arg Asn 


Phe 


His 


Thr 


Gly 


Arg 


Phe 


Gin 


Leu 


Gin 


Gin 


Leu 


Lys 


He 


He 




45 










50 










55 
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202 

eta aag atg aac ccc aac ttg caa caa aaa ata tea ttt gtg ate cot 339 

Leu Lys Met Asn Pro Asn Leu Gin Gin Lys lie Ser Phe Val lie Pro 

60 65 70 75 

cag aga oca get cca caa caa ate gca gca gtg tta cat taagcttatc 388 

Gin Arg Pro Ala Pro Gin Gin He Ala Ala Val Leu His 

80 85 
aacattacca tctgattctt attacagcca aagtatagaa gcagctgatg actggttttc 448 
tgatgattct. ctagtga 465 

<210> 358 

<211> 438 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> CDS . 
<222> 84. .317 

<220> 

<221> sig__peptide 

<222> 84. .140 

<223> Von Heijne matrix 

score 7.59999990463257 

seq ALVLALMISMISA/DS 

<400> 358 

atagaaaagg acatctcttg agacttcact teagcttcac tgacttcttg actctectct 60 
tgagtaaaag gactcagcca act atg aag ttt ttt gtc ttt get tta gtc ttg 113 

Met Lys Phe Phe Val Phe Ala Leu Val Leu 
-15 -10 
get etc atg att tec atg att age get gat tea eat gaa aag aga eat 161 
Ala Leu Met He Ser Met He Ser Ala Asp Ser His Glu Lys Arg His 

-5 15 
cat ggg tat aga aga aaa ttc cat gaa aag cat cat tea tac cat ate 209 
His Gly Tyr Arg Arg Lys Phe His Glu Lys His His Ser Tyr His He 

10 15 20 

aca eta eta cca ctt ttt gaa gaa tea tea aag age aat gca aat gaa 257 
Thr Leu Leu Pro Leu Phe Glu Glu Ser Ser Lys Ser Asn Ala Asn Glu 

25 30 35 

aaa cac tat aat tta ctg tat act ctt tgt ttc agg ata ctt gee ttt 305 
Lys His Tyr Asn Leu Leu Tyr Thr Leu Cys Phe Arg He Leu Ala Phe 
40 45 50 55 

tea att gtc act tgatgatata attgcaattt aaactgttaa gctgtgttca 357 
Ser He Val Thr 

gtactgtttc tgaataatag aaatcacttc tctaaaagca ataaatttca agcacatttt 417 
taaataaaaa aaatmtmaaa a 438 

<210> 359 
<211> 537 
<212> DNA 
<213> Homo sapiens 

<220> 
<221> CDS 
<222> 57. .317 

<220> 

<221> sig_peptide 

<222> 57. .116 

<223> Von Heijne matrix 

score 7.40000009536743 

seq LLVFLAGFPVLDA/ND 
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203 

<400> 359 

gatttctccc ggaacctctg ctcagcctgg tgaaccacac aggccagcgc tctgac atg 59 

Met 
-20 



cag 


aag 


gtg 


acc 


ctg 


ggc 


ctg 


ctt 


gtg 


ttc 


Ctg 


gca 


ggc 


ttt 


ect 


gtc 


107 


Gin 


Lys 


Val 


Thr 


Leu 
-15 


Gly 


Leu 


Leu 


Val 


Phe 
-10 


Leu 


Ala 


Gly 


Phe 


Pro 
-5 


Val 




ctg 


gac 


gcc 


aat 


gac 


eta 


gaa 


gat 


aaa 


aac 


agt 


ect 


ttc 


tac 


tat 


gac 


155 


Leu 


Asp 


Ala 


Asn 
1 


Asp 


Leu 


Glu 


Asp 
5 


Lys 


Asn 


Ser 


Pro 


Phe 
10 


Tyr 


Tyr 


Asp 




tgg 


cac 


age 


etc 


cag 


gtt 


ggc 


ggg 


etc 


ate 


tge 


get 


ggg 


gtt 


ctg 


tge 


203 


Trp 


His 
15 


Ser 


Leu 


Gin 


Val 


Gly 

20 


Gly 


Leu 


He 


Cys 


Ala 

25 


Gly 


Val 


Leu 


Cys 




gcc 


atg 


ggc 


ate 


ate 


ate 


gtc 


atg 


agt 


gea 


aaa 


tge 


aaa 


tge 


aag 


ttt 


251 


Ala 


Met 


Gly 


He 


He 


He 


Val 


Met 


Ser 


Ala 


Lys 


Cys 


Lys 


Cys 


Lys 


Phe 




30 










35 










40 










45 




ggc 


cag 


aag 


tec 


ggt 


cac 


cat 


eca 


ggg 


gag 


act 


eca 


ect 


etc 


ate 


ace 


299 


Gly 


Gin 


Lys 


Ser 


Gly 
50 


His 


His 


Pro 


Gly 


Glu 
55 


Thr 


Pro 


Pro 


Leu 


He 
60 


Thr 





eca ggc tea gcc caa age tgatgaggae agaccagetg aaattgggtg 347 
Pro Gly Ser Ala Gin Ser 

65 



gaggaecgtt ctetgtccce aggtcctgte tctgeacaga aacttgaaet ecaggatgga 407 

attettecte ctctgctggg aetectttgc atggeagggc teatetcaec tetcgeaaga 467 

gggtctettt gtteaatttt ttttaatcta aaatgattgt gectctgece aaaaaaaaaa 527 

aaaaaaaget 537 

<210> 360 

<211> 499 
<212> DNA 

<213> Homo sapiens 

<220> 
<221> CDS 
<222> 69. .383 

<220> 

<221> sig_peptide 

<222> 69. .125 

<223> Von Heijne matrix 

score 7.19999980926514 

seq VSIMLLLVTVSDC/AV 

<400> 360 

acaaggctga gcgggaggaa gcgagaggca tctaagcagg cagtgttttg ccttcacccc 60 
aagtgaee atg aga ggt gee aeg ega gtc tea ate atg etc etc eta gta 110 
Met Arg Gly Ala Thr Arg Val Ser He Met Leu Leu Leu Val 
-15 -10 



act 


gtg 


tet 


gac 


tgt 


get 


gtg 


ate 


aca 


ggg 


gee 


tgt 


gag 


ega 


gat 


gtc 


158 


Thr 


Val 


Ser 


Asp 


Cys 


Ala 


Val 


He 


Thr 


Gly 


Ala 


Cys 


Glu 


Arg 


Asp 


Val 




-5 










1 








5 










10 






cag 


tgt 


ggg 


gca 


ggc 


ace 


tge 


tgt 


gcc 


ate 


age 


ctg 


tgg 


ctt 


ega 


ggg 


206 


Gin 


Cys 


Gly 


Ala 


Gly 


Thr 


Cys 


Cys 


Ala 


He 


Ser 


Leu 


Trp 


Leu 


Arg 


Gly 










15 










20 










25 








ctg 


egg 


atg 


tge 


acc 


ecg 


ctg 


ggg 


egg 


waa 


ggc 


gag 


gag 


tge 


cac 


ecc 


254 


Leu 


Arg 


Met 


Cys 


Thr 


Pro 


Leu 


Gly 


Arg 


Xaa 


Gly 


Glu 


Glu 


Cys 


His 


Pro 








30 










35 










40 










ggc 


age 


cac 


aag 


ate 


ecc 


ttc 


ttc 


agg 


aaa 


cge 


aag 


cac 


cac 


ace 


tgt 


302 


Gly Ser 


His 


Lys 


He 


Pro 


Phe 


Phe 


Arg 


Lys 


Arg 


Lys 


His 


His 


Thr 


Cys 






45 










50 










55 












ect 


tge 


ttg 


ecc 


aac 


ctg 


ctg 


tge 


tec 


agg 


ttc 


ccg 


gac 


ggc 


agg 


tac 


350 


Pro 


Cys 


Leu 


Pro 


Asn 


Leu 


Leu 


Cys 


Ser 


Arg 


Phe 


Pro 


Asp 


Gly 


Arg 


Tyr 




60 










65 










70 










75 
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204 

cgc tgc tec atg gac ttg aag aac ate aat ttt taggcgcttg cctggtctca 403 
Arg Cys Ser Met Asp Leu Lys Asn lie Asn Phe 

80 85 

ggatacccac catccttttc ctgagcacag cctggatttt tatttctgcc atgaaaccca 4 63 

gctcccatga ctctcccagt ccctasactg actacc 4 99 

<210> 361 
<211> 479 
<212> DNA 
<213> Homo sapiens 

<220> 
<221> CDS 
<222> 129.. 434 

<220> 

<221> sig_peptide 
<222> 129.. 236 
<223> Von Heijne matrix 
score 7 

seq PFFLITLVGVWA/W 
<400> 361 

tccggggccc gggagccaac cgagggcgtt cctgtcgggg ctgcagcggc gggagggagc 60 
ccagtggagg cgccctcccg aagcgccact gcycatgctg accacccagc cctccggctg 120 
ctgatgtc atg agt aac acc act gtg ccc aat gcc ccc cag gcc aac age 170 
Met Ser Asn Thr Thr Val Pro Asn Ala Pro Gin Ala Asn Ser 
-35 -30 -25 

gac tec atg gtg ggc tat gtg ttg ggg ece ttc ttc etc ate ace ctg 218 
Asp Ser Met Val Gly Tyr Val Leu Gly Pro Phe Phe Leu lie Thr Leu 

-20 -15 -10 

gtc ggg gtg gtg gtg get gtg gta atg tat gta cag aag aaa aag egg 266 
Val Gly Val Val Val Ala Val Val Met Tyr Val Gin Lys Lys Lys Arg 

"5 15 10 

gtg gac egg ctg cgc cat cac ctg etc ccc atg tac age tat gac cea 314 
Val Asp Arg Leu Arg His His Leu Leu Pro Met Tyr Ser Tyr Asp Pro 

15 20 25 

get gag gaa ctg ert gag get gag eag gag ctg etc tet gac atg gga 362 
Ala Glu Glu Leu Xaa Glu Ala Glu Gin Glu Leu Leu Ser Asp Met Gly 

30 35 40 

gac ccc aag gtg gta cat ggc tgg cag agt ggc tac cag cac aag egg 410 
Asp Pro Lys Val Val His Gly Trp Gin Ser Gly Tyr Gin His Lys Arg 

45 50 55 

atg cca ctg ctg gat gtc aag acg tgacetgace eccttgeece accctteaga 4 64 

Met Pro Leu Leu Asp Val Lys Thr 

60 65 
gcctggggtc ctgga 479 

<210> 362 
<211> 453 
<212> DNA 
<213> Homo sapiens 

<220> 
<221> CDS 
<222> 30.. 440 

<220> 

<221> sig_pepticie 
<222> 30. .83 
<223> Von Heijne matrix 
score 7 

seq VLGLVLLSVTVQG/KV 



wo 02/083898 
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205 

<400> 362 



cagcctagca ctctgaccta gcagtcaac 


atg 


aag 


get 


etc 


att 


gtt 


ctg 


ggg 


53 


















Met 


Lys 


Ala 


Leu 
-15 


He 


Val 


Leu 


Gly 




ctt 


gtc 


etc 


ctt 


tct 


gtt 


aeg 


gtc 


cag 


ggc 


aag 


gtc 


ttt 


gaa 


agg 


tgt 


101 


Leu 


Val 


Leu 


Leu 


Ser 


Val 


Thr 


Val 


Gin 


Gly 


Lys 


Val 


Phe 


Glu 


Arg 


Cys 




-10 










-5 










1 








5 






gag 


ttg 


gee 


aga 


act 


ctg 


aaa 


aga 


ttg 


gga 


atg 


gat 


ggc 


tac 


agg 


gga 


149 


Glu 


Leu 


Ala 


Arg 


Thr 


Leu 


Lys 


Arg 


Leu 


Gly 


Met 


Asp 


Gly 


Tyr 


Arg 


Gly 










10 










15 










20 








ate 


age 


eta 


gca 


aac 


tgq 


atg 


tgt 


ttg 


gee 


aaa 


tgg 


gag 


agt 


ggt 


tac 


197 


lie 


Ser 


Leu 


Ala 


Asn 


Trp 


Met 


Cvs 


Leu 


Ala 


Lys 


Trp 


Glu 


Ser 


Gly 


Tyr 








25 










30 










35 










aac 


aca 


caa 


CJCt 


aea 


aac 


tac 


aat 


get 


gga 


gae 


aga 


age 


act 


gat 


tat 


245 


Asn 


Thr 


Arg 


Ala 


Thr 


Asn 


Tvr 


Asn 


Ala 


Gly 


Asp 


Arg 


Ser 


Thr 


Asp 


Tyr 






40 










45 










50 












ggg 


ata 


ttt 


cag 


ate 


aat 


age 


cgc 


tac 


tgg 


tgt 


aat 


gat 


ggc 


aaa 


ace 


293 


Gly 


lie 


Phe 


Gin 


He 


Asn 


Ser 


Arg 


Tyr 


Trp 


Cys 


Asn 


Asp 


Gly 


Lys 


Thr 




55 










60 










65 










70 




cca 


gga 


gca 


gtt 


aat 


gee 


tgt 


cat 


tta 


tec 


tgc 


agt 


ggg 


tgg 


eat 


gga 


341 


Pro 


Gly 


Ala 


Val 


Asn 


Ala 


Cys 


His 


Leu 


Ser 


Cys 


Ser 


Gly 


Trp 


His 


Gly 












75 










80 










85 






gaa 


ate 


gtt 


gtc 


aaa 


aca 


gag 


atg 


tec 


gtc 


agt 


atg 


ttc 


aag 


gtt 


gtg 


389 


Glu 


He 


Val 


Val 


Lys 


Thr 


Glu 


Met 


Ser 


Val 


Ser 


Met 


Phe 


Lys 


Val 


Val 










90 










95 










100 








gag 


tgt 


aac 


tec 


aga 


att 


ttc 


ctt 


ctt 


cag 


etc 


att 


ttg 


tct 


etc 


tea 


437 


Glu 


Cys 


Asn 


Ser 


Arg 


He 


Phe 


Leu 


Leu 


Gin 


Leu 


He 


Leu 


Ser 


Leu 


Ser 





105 110 115 

cat taaggggagt agg 453 
His 



<210> 363 

<211> 417 

<212> DNA 

<213> Homo sapiens 

<220> 
<221> CDS 
<222> 82. .384 

<220> 

<221> sig_peptide 

<222> 82. .168 

<223> Von Heijne matrix 

score 6.90000009536743 

seq VCMCVFSTQGALG/EM 

<220> 

<221> misc_feature 
<222> 404 

<223> n=a, g, c or t 
<400> 363 

gctacctcct ggtgtctgtt gcctggtgtg tgggaaaccc 
gtcttctgtg ttgactgttg a atg agg gaa gag aaa 

Met Arg Glu Glu Lys 
-25 

gag aga gag agt gtg tgt gtg tgt atg tgt gtg 
Glu Arg Glu Ser Val Cys Val Cys Met Cys Val 

-15 -10 
get ttg ggg gaa atg get gca cae ttc ata gat 
Ala Leu Gly Glu Met Ala Ala His Phe He Asp 



ceacacgctg gaeeacagaa 60 
aag cec ttt gag aga 111 
Lys Pro Phe Glu Arg 

-20 

ttt tec act caa gga 159 
Phe Ser Thr Gin Gly 
-5 

gaa aag ctg agg ccc 207 
Glu Lys Leu Arg Pro 



wo 02/083898 

1 5 
agt gag ggg aat ggt cac aga ggg aca 
Ser Glu Gly Asn Gly His Arg Gly Thr 

15 20 
caa gag tec tac ate ecc tee aee gcc 
Gin Glu Ser Tyr lie Pro Ser Thr Ala 
30 35 
gag ctg ctg cac aag aac ttg ccc gtg 
Glu Leu Leu His Lys Asn Leu Pro Val 
50 

ccc tct gac etc gcg ate eca gcc get 
Pro Ser Asp Leu Ala lie Pro Ala Ala 

65 70 
gctgccctcg ctg 



PCT/IBO 1/009 14 

206 

10 

ctg gat age ctg agt teg gae 255 
Leu Asp Ser Leu Ser Ser Asp 
25 

gac ccc ace cag get gge cet 303 
Asp Pro Thr Gin Ala Gly Pro 

40 45 
acc tec cga tec cag ecg ctg 351 
Thr Ser Arg Ser Gin Pro Leu 
55 60 
gee etc tgaceteeeg ateeeagecn 404 
Ala Leu 

417 



<210> 364 
<211> 529 
<212> DNA 
<213> Homo sapiens 

<220> 
<221> CDS 
<222> 114. .497 



<220> 

<221> sig_peptide 

<222> 114 . .191 

<223> Von Heijne matrix 

score 6.80000019073486 
seq VFLSLFLIQLLIS/FS 



<400> 364 

cteagcacce agggeggtgg taggtcaeag tctctgggeg ggteteagtg tecaacaetg 60 
tagctggtgc ctgceaggtt eccagtgget ggggteaeea ggtctgaaga gag atg 116 

Met 



tgc 


tgg 


ctg 


egg 


gca 


tgg 


gge 


cag 


ate 


etc 


ctg 


eca 


gtt 


ttc 


etc 


tec 


164 


Cys 


Trp 


Leu 


Arg 


Ala 


Trp 


Gly 


Gin 


He 


Leu 


Leu 


Pro 


Val 


Phe 


Leu 


Ser 




-25 










-20 










-15 










-10 




etc 


ttt 


etc 


ate 


caa 


ttg 


ett 


ate 


age 


ttc 


tea 


gag 


aat 


ggt 


ttt 


ate 


212 


Leu 


Phe 


Leu 


lie 


Gin 

-5 


Leu 


Leu 


He 


Ser 


Phe 
1 


Ser 


Glu 


Asn 


Gly 

5 


Phe 


He 




cac 


age 


ccc 


agg 


aac 


aat 


cag 


aaa 


eca 


aga 


gat 


ggg 


aat 


gaa 


gag 


gaa 


260 


His 


Ser 


Pro 


Arg 


Asn 


Asn 


Gin 


Lys 


Pro 


Arg 


Asp Gly 


Asn 


Glu 


Glu 


Glu 








10 










15 










20 










tgt 


get 


gta 


aag 


aag 


agt 


tgt 


caa 


ttg 


tgc 


aca 


gaa 


gat 


aag 


aaa 


tgt 


308 


Cys 


Ala 


Val 


Lys 


Lys 


Ser 


Cys 


Gin 


Leu 


Cys 


Thr 


Glu 


Asp 


Lys 


Lys 


Cys 






25 










30 










35 












gtt 


tgg 


tgt 


agt 


gaa 


gaa 


aaa 


gca 


tgc 


aaa 


aaa 


tac 


tgt 


ttt 


ccc 


tat 


356 


Val 


Trp 


Cys 


Ser 


Glu 


Glu 


Lys 


Ala 


Cys 


Lys 


Lys 


Tyr 


Cys 


Phe 


Pro 


Tyr 




40 










45 










50 










55 




ttc 


ggt 


tgt 


cga 


ttc 


agt 


tct 


ata 


tat 


tgg 


tta 


aac 


tgt 


aaa 


gtt 


gac 


404 


Phe 


Gly 


Cys 


Arg 


Phe 


Ser 


Ser 


He 


Tyr 


Trp 


Leu 


Asn 


Cys 


Lys 


Val 


Asp 












60 










65 










70 






atg 


ttt 


gga 


ate 


atg 


atg 


ett 


eta 


etc 


att 


gea 


gta 


tta 


att 


aca 


gga 


452 


Met 


Phe 


Gly 


lie 


Met 


Met 


Leu 


Leu 


Leu 


He 


Ala 


Val 


Leu 


He 


Thr 


Gly 










75 










80 










85 








ttc 


gtt 


tgg 


tac 


tgc 


tgc 


gcc 


tat 


cac 


ttt 


tac 


ctg 


cag 


gat 


ata 




4 97 


Phe 


Val 


Trp 


Tyr 


Cys 


Cys 


Ala 


Tyr 


His 


Phe 


Tyr 


Leu 


Gin 


Asp 


He 







90 95 100 

tgatgaatag ataattgraa agagatcetc ea 529 



<210> 365 
<211> 505 



1 



wo 02/083898 PCT/IBO 1/009 14 

207 

<212> DNA 

<213> Homo sapiens 

<220> 
<221> CDS 
<222> 71. .496 

<220> 

<221> sig_peptide 
<222> 71. .142 
<223> Von Heijne matrix 
score 6.5 

seq LSICLSAVATATG/AE 

<400> 365 

gacgaggacs cgcctgcgca gaggcggcag caccaccggg gttgactccg ggggcgcggc 60 
gaggagagac atg agg ctg age tgg ttc egg gte ctg aea gta etg tec 109 
Met Arg Leu Ser Trp Phe Arg Val Leu Thr Val Leu Ser 

















-20 










-15 










ate 


tgc 


ctg 


age 


gee 


gtg 


gee 


acg 


gee 


aeg 


ggg 


gee 


gag 


ggc 


aaa 


agg 


157 


He 


Cys 
-10 


Leu 


Ser 


Ala 


Val 


Ala 
-5 


Thr 


Ala 


Thr 


Gly 


Ala 
1 


Glu 


Gly 


Lys 


Arg 
5 




aag 


ctg 


cag 


ate 


ggg 


gtc 


aag 


aag 


egg 


gtg 


gac 


cac 


tgt 


cce 


ate 


aaa 


205 


Lys 


Leu 


Gin 


He 


Gly Val 


Lys 


Lys Arg 


Val 


Asp 


His 


Cys 


Pro 


He 


Lys 












10 










15 










20 






teg 


cgc 


aaa 


ggg 


gat 


gtc 


ctg 


cac 


atg 


cac 


tac 


acg 


ggg 


aag 


ctg 


gaa 


253 


Ser Arg 


Lys 


Gly 


Asp 


Val 


Leu 


His 


Met 


His 


Tyr 


Thr 


Gly 


Lys 


Leu 


Glu 










25 










30 










35 








gat 


ggg 


aca 


gag 


ttt 


gac 


age 


age 


etg 


eee 


cag 


aae 


eag 


cce 


ttt 


gtc 


301 


Asp 


Gly 


Thr 
40 


Glu 


Phe 


Asp 


Ser 


Ser 
45 


Leu 


Pro 


Gin 


Asn 


Gin 
50 


Pro 


Phe 


Val 




ttc 


tee 


ctt 


ggc 


aca 


ggc 


cag 


gtc 


ate 


aag 


ggc 


tgg 


gac 


cag 


ggg 


ctg 


349 


Phe 


Ser 
55 


Leu 


Gly 


Thr 


Gly 


Gin 
60 


Val 


He 


Lys 


Gly 


Trp 

65 


Asp 


Gin 


Gly 


Leu 




ctg 


ggg 


atg 


tgt 


gag 


ggg 


gaa 


aag 


cgc 


aas 


ykg 


gtg 


ate 


cca 


tec 


gag 


397 


Leu Gly 


Met 


Cys 


Glu 


Gly 


Glu 


Lys 


Arg 


Xaa 


Xaa 


Val 


He 


Pro 


Ser 


Glu 




70 










75 










80 










85 




eta 


ggg 


tat 


gga 


gag 


egg 


gga 


get 


cce 


eea 


aag 


att 


eca 


ggc 


ggt 


gca 


445 


Leu Gly 


Tyr 


Gly 


Glu Arg 


Gly Ala 


Pro 


Pro 


Lys 


He 


Pro 


Gly 


Gly 


Ala 












90 










95 










100 






ace 


ctg 


gtg 


ttc 


gag 


gtg 


gag 


ctg 


etc 


aaa 


ata 


gag 


ega 


ega 


act 


gag 


4 93 


Thr 


Leu 


Val 


Phe 
105 


Glu 


Val 


Glu 


Leu 


Leu 
110 


Lys 


He 


Glu 


Arg 


Arg 
115 


Thr 


Glu 





ctg taaceagac 505 
Leu 



<210> 366 
<2H> 539 
<212> DNA 
<213> Homo sapiens 

<220> 
<221> CDS 
<222> 65. .274 

<220> 

<221> sig_peptide 
<222> 65. .136 
<223> Von Heijne matrix 
score 6.5 

seq ILLTRPACLGSWA/EI 



<400> 366 



wo 02/083898 PCT/IBO 1/009 14 

208 

cactgattct ttggccatcg ctactacaat tttagaattg ctggcaattt tttctgcaca 60 
ctgg atg get tct tec aec agt gtc tea aca gga caa ate ttg etg aca 109 
Met Ala Ser Ser Thr Ser Val Ser Thr Gly Gin lie Leu Leu Thr 
-20 -15 -10 

aga cot get tgc ttg ggg tec tgg get gag ate egg tea ceg gtg agg 157 
Arg Pro Ala Cys Leu Gly Ser Trp Ala Glu lie Arg Ser Pro Val Arg 

-5 15 
acc ate tec ate gee age gac ttc cca aca gca egg gtg agt etc tgg 205 
Thr lie Ser lie Ala Ser Asp Phe Pro Thr Ala Arg Val Ser Leu Trp 

10 15 20 

gtg ceg ccc gca cct ggg atg gtt cet att aag ate tec ggc tgt gca 253 
Val Pro Pro Ala Pro Gly Met Val Pro lie Lys lie Ser Gly Cys Ala 

25 30 35 

aac tgg gcc ttc tea ceg gca tagatgatat cacacateat ggcaagctca 304 
Asn Trp Ala Phe Ser Pro Ala 
40 45 

cagcceccgc caaaggcata gecattgaca geagcgatga ctggattctt gacctgggtg 364 
a99tggtcce agtgcttcaa gaacttgetg gagtaacagt eetggaaaet caggttctge 424 
atttccttga tatcagctet agtgaaaagg geggtagttg gtggtggaac cagaaaegga 484 
cgecggtgct tkgagcggtt ettaaattat atttaaaaag taactttttg tataa 539 



<210> 367 
<211> 460 
<212> DNA 

<213> Homo sapiens 

<220> 
<221> CDS 
<222> 14.. 178 



<220> 

<221> sig^peptide 

<222> 14 . .82 

<223> Von Heijne matrix 

score 6.40000009536743 
seq LGALSGAAALGFA/SY 



<400> 367 

ctgttctaea get atg gcc ggg cca get gca. get 
Met Ala Gly Pro Ala Ala Ala 
-20 

gee ttg tec gga get gcg gee tta ggc ttc get 
Ala Leu Ser Gly Ala Ala Ala Leu Gly Phe Ala 

-10 -5 
ggc gem aat tec cag atg cct acg gga agg age 
Gly Ala Asn Ser Gin Met Pro Thr Gly Arg Ser 

10 15 
aca aac ace act tct tac aca gee tgg ccc tgt 
Thr Asn Thr Thr Ser Tyr Thr Ala Trp Pro Cys 

25 30 
aagceactct gggctgggtt attgctagct tccggaacga 
tactaccagg ctctgagtgg agaccccagc atccagaett 
etgetactet tgggetgget tgcettgget etttgagcte 
ttttctgggc agtttttttt ttaaagagtt ggagtaagaa 
aaataaaaaa aaaaaaaaaa gc 



ttc cge cge ttg ggc 4 9 

Phe Arg Arg Leu Gly 
-15 

tec tac ggg gcg cac 97 
Ser Tyr Gly Ala His 
1 5 
tgt ttg aca agg cca 145 
Cys Leu Thr Arg Pro 
20 

taggggtgcc ccattgcaga 198 



ccttattctg caccagcttt 258 
tggeecetgc gggagggacc 318 
cettttgctt aattactggg 378 
gaggattaaa aaggaaaggc 438 

460 



<210> 368 
<211> 482 
<212> DNA 

<213> Homo sapiens 



<220> 
<221> CDS 



wo 02/083898 
<222> 14. .178 



209 



PCT/TBOl/00914 



<220> 

<221> sig_peptide 

<222> 14. .82 

<223> Von Heijne matrix 

score 6.40000009536743 
seg LGALSGAAALGFA/SY 



<400> 368 

ctgttctaca get atg gcc ggg cca get gca get ttc cge cgc ttg gge 49 
Met Ala Gly Pro Ala Ala Ala Phe Arg Arg Leu Gly 
-20 -15 
gee ttg tec gga get geg gcc tta ggc ttc get tec tac ggg geg cac 97 
Ala Leu Ser Gly Ala Ala Ala Leu Gly Phe Ala Ser Tyr Gly Ala His 

-10 -5 15 

gge gem aat tec cag atg cct acg gga agg age tgt ttg aca agg cca 145 
Gly Ala Asn Ser Gin Met Pro Thr Gly Arg Ser Cys Leu Thr Arg Pro 

10 15 20 

aca aac ace act tct tac aca gee tgg cec tgt taggggtgcc ceattgcaga 198 
Thr Asn Thr Thr Ser Tyr Thr Ala Trp Pro Cys 

25 30 
aagceactet gggctgggtt attgctaget tceggaaega cettattctg caccagettt 258 
tactaccagg ctctgagtgg agaccccagc ateeagaett tggeecctgc gggagggaec 318 
etgetactct tgggetggct tgecttgget etttgagctc cettttgctt aattaetggg 378 
ttttctggge agtttttttt ttaaagagtt ggagtaagaa gaggattaaa aaggaaaggc ■ 4 38 
aaataaaett tggagtcttt gtteateaaa aaaaaaaaaa aaaa 482 

<210> 369 
<211> 497 
<212> DNA 
<213> Horao sapiens 

<220> 
<221> CDS 
<222> 161. .322 



<220> 

<221> sig_peptide 

<222> 161. .211 

<223> Von Heijne matrix 

score 6.40000009536743 
seq LTFAQLLFATVLG/IA 



<400> 369 

agatgggggt aggagctetc eagaeagtgt eccetcacte acececgegg cgcctggegt 60 
cttetectgg gtgttgaagt aeaagatttg aagagatttc etgggtgtgg agtgtgactt 120 
tccaaaacea gcttttcett gagctgtatt tgttgcagca atg ttt agg aga ttg 175 

Met Phe Arg Arg Leu 
-15 

act ttt gca eaa ctg ett ttt gee act gtc ctt gga att get gga gga 223 
Thr Phe Ala Gin Leu Leu Phe Ala Thr Val Leu Gly He Ala Gly Gly 

-10 -5 1 

gta tat att ttt caa cca gta ttt gaa cag tat gcc aaa gat cag aag 271 
Val Tyr He Phe Gin Pro Val Phe Glu Gin Tyr Ala Lys Asp Gin Lys 
5 10 15 20 

gaa tta aaa gaa aag atg cag ttg gta eaa gaa tea gaa gag aag aaa 319 
Glu Leu Lys Glu Lys Met Gin Leu Val Gin Glu Ser Glu Glu Lys Lys 

25 30 35 

agt taatactaea tggagttagg ectggegcag tggeteaege etgtaatcec 372 
Ser 

agcactttgg gaggccgagg cgggtggatc aggtggtcag gagtteaaga ccagcctgac 432 
caacatggtg aaacctgtct ctactgaaaa tacaaaaate agceaggett ggtggcatgc 492 



wo 02/083898 

210 

gctgt 

<210> 370 
<211> 422 
<212> DNA 

<213> Homo sapiens 

<220> 
<221> CDS 
<222> 47. .295 

<220> 

<221> sig__peptide 

<222> 47. ,91 

<223> Von Heijne matrix 

score 6.30000019073486 

seq LFLSLPVLVWLS/IV 

<400> 370 

aaccaagccc tccagcaagg attcagagtg cccctccggc 



ttc ctg teg ctt ccg gtc ctg gtg gtg gtt ctg 
Phe Leu Ser Leu Pro Val Leu Val Val Val Leu 

-10 -5 
ggc oca gcc oca gcc cag ggg ace cca gac gtc 
Gly Pro Ala Pro Ala Gin Gly Thr Pro Asp Val 
5 10 15 

aag ctg aag gag ttt gga aac aca ctg gag gac 
Lys Leu Lys Glu Phe Gly Asn Thr Leu Glu Asp 

25 30 
ate age cgc ate aaa eag agt gaa ctt tct gee 
He Ser Arg He Lys Gin Ser Glu Leu Ser Ala 

40 45 
ttt tea gag aca ttt eag aaa gtg aag gat aaa 
Phe Ser Glu Thr Phe Gin Lys Val Lys Asp Lys 

55 60 
tgaggacctg aagggtgaca tcccaggagg ggcctctgaa 
tgtgctgagg aetccctcca tgtggcccca ggtgceacca 
taatata 



PCT/IBOl/00914 
4 97 



ctcgcc atg agg etc 
Met Arg Leu 
-15 

teg ate gtc ttg gaa 
Ser He Val Leu Glu 
1 

tec agt gee ttg gat 
Ser Ser Ala Leu Asp 
20 

aag get egg gaa etc 
Lys Ala Arg Glu Leu 
35 

aag atg egg gag tgg 
Lys Met Arg Glu Trp 
50 

etc aag att gac tea 
Leu Lys He Asp Ser 
65 

atttcccaca ccccagcgcc 
ataaaaatcc tacagaaaac 



55 



103 



151 



199 



247 



295 



355 
415 
422 



<210> 371 
<211> 495 
<212> DNA 

<213> Homo sapiens 

<220> 
<221> CDS 
<222> 66. .359 

<220> 

<221> sig^peptide 
<222> 66. .194 
<223> Von Heijne matrix 
score 6 

seq FLIYVALLRVSEC/LP 
<400> 371 

gggcgcgagg cggccaccgt ggagagcaga gcgcggcggc tggaagctgc taagtcagag 
ecgcg atg ttc egg att gag ggc etc gcg ccg aag ctg gac ccg gag gag 
Met Phe Arg He Glu Gly Leu Ala Pro Lys Leu Asp Pro Glu Glu 
-40 -35 -30 

atg aaa egg aag atg cgc gag gat gtg ate tec tee ata egg aac ttt 



60 
110 



158 



wo 02/083898 PCT/IBO 1/009 14 



211 



Met Lys 


Arg 


Lvs 


Met 


Ara 


Glu 


ASD 


Val 


He 


Ser 


Ser 


He 


Arg 


Asn 


Phe 








-25 










-20 










-15 








etc ate 


tac 


gtg 


qcc 


etc 


CtQ 


caa 


gte 


agt 


gag 


tgt 


etc 


ecg 


ggc 


tgt 


206 


Leu lie 


Tyr 


Val 


Ala 


Leu 


Leu 


Ara 


Val 


Ser 


Glu 


Cys 


Leu 


Pro 


Gly 


Cys 






-10 










-5 










1 










gac tgt 


gat 


aee 


age 


ggg 


gag 


etc 


ace 


gac 


ggg 


cac 


ccc 


tta 


act 


eta 


254 


Asp Cys 


AsD 


Thr 


Ser 


Glv 


Glu 


Leu 


Thr 


Asp 


Glv 


His 


Pro 


Leu 


Thr 


Leu 




5 








10 










15 










20 




agg ggt 


cat 


egg 


ggc 


ctt 


cga 


act 


gag 


ctg 


aac 


ggt 


age 


ggg 


gag 


caa 


302 


Arg Gly 


His 


Arg 


Gly 


Leu 


Arg 


Thr 


Glu 


Leu 


Asn 


Gly 


Ser 


Gly 


Glu 


Gin 










25 










30 










35 






gga ggc 


tec 


att 


tat 


ctt 


aaa 


gaa 


att 


gga 


cag 


cat 


atg 


aag 


aca 


gga 


350 


Gly Gly 


Ser 


lie 


Tyr 


Leu 


Lys 


Glu 


He 


Gly 


Gin 


His 


Met 


Lys 


Thr 


Gly 








40 










45 










50 









cat cac ata tgaatgcacg atatgaagag cctggttaca gtttcgactc 399 
His His He 



55 

ctctctgcaa gtgaataggc ccagaaaggt gtaagagact ctttgaatgg acataaaatt 459 
ctgcttgtta agaacaagtt tggctctggt aactga 4 95 

<210> 372 
<211> 515 
<212> DNA 
<213> Homo sapiens 

<220> 
<221> CDS 
<222> 115. .360 

<220> 

<221> sig_peptide 

<222> 115. .243 

<223> Von Heijne matrix 

score 5.59999990463257 

seq VVSFALIATLVYA/LF 

<400> 372 

ataaggggac gtctagtggg ttgeccggga gggtggcggg agggtcctgg aaataatctg 60 
tcctctgtcg ccgggaactg gcgaggtagt tccttcgegg tggagagacc tgga atg 117 

Met 



gcc 


aaa 


tat 


caa 


ggt 


gaa 


gtt 


caa 


agt 


ttg 


aaa 


ctg 


gat 


gat 


gat 


tea 


165 


Ala 


Lys 


Tyr 
-40 


Gin 


Gly 


Glu 


Val 


Gin 
-35 


Ser 


Leu 


Lys 


Leu 


Asp 

-30 


Asp 


Asp 


Ser 




gtt 


ata 


gaa 


gga 


gta 


age 


gac 


caa 


gta 


ctt 


gtg 


gca 


gtt 


gtg 


gte 


agt 


213 


Val 


He 
-25 


Glu 


Gly 


Val 


Ser 


Asp 
-20 


Gin 


Val 


Leu 


Val 


Ala 
-15 


Val 


Val 


Val 


Ser 




ttc 


get 


ttg 


att 


get 


acc 


ctg 


gta 


tat 


gca 


ctt 


ttc 


aga 


aat 


gta 


cat 


261 


Phe 


Ala 


Leu 


He 


Ala 


Thr 


Leu 


Val 


Tyr 


Ala 


Leu 


Phe 


Arg 


Asn 


Val 


His 




-10 










-5 










1 








5 






caa 


aac 


att 


cac' 


cca 


gaa 


aac 


cag 


gag 


eta 


gta 


agg 


gta 


ctt 


cga 


gaa 


309 


Gin 


Asn 


He 


His 


Pro 


Glu 


Asn 


Gin 


Glu 


Leu 


Val 


Arg 


Val 


Leu 


Arg 


Glu 










10 










15 








20 








cag 


ctt 


caa 


aca 


gaa 


cag 


gat 


gca 


ect 


get 


gac 


teg 


aca 


gca 


gtt 


eta 


357 


Gin 


Leu 


Gin 
25 


Thr 


Glu 


Gin 


Asp 


Ala 
30 


Pro 


Ala 


Asp 


Ser 


Thr 
35 


Ala 


Val 


Leu 





cac tgacatgtac tgtcccatct gcctgeacca agcctccttc ccggtggaga 410 
His 



ccaactgtgg acatcttttt tgtggtgcct geattattgc ttactggcga tatggttcat 470 
ggcttggggc aatcagttgt ccaatctgta gacaaaegag acatg 515 



<210> 373 
<211> 513 
<212> DNA 



wo 02/083898 

<213> Homo sapiens 

<220> 
<221> CDS 
<222> 56.. 352 

<220> 

<221> sig_peptide 
<222> 56. .139 
<223> Von Heijne matrix 
score 5.5 

seq LGYLVLSEGAVLA/SS 



<400> 373 

ctgaagccgg aagctaccta tctggtaggg agctccccca gcaccgaaga ctgcg atg 58 

Met 



act 


tct 


gca 


ctg 


acc 


cag 


ggg 


ctg 


gag 


cga 


ate 


cca 


gae 


cag 


etc 


ggc 


106 


Thr 


Ser 


Ala 
-25 


Leu 


Thr 


Gin 


Gly 


Leu 
-20 


Glu 


Arg 


He 


Pro 


Asp 
-15 


Gin 


Leu 


Gly 




tac 


ctg 


gta 


ctg 


agt 


gaa 


ggt 


gca 


gtg 


ctg 


gcg 


tea 


tct 


ggg 


gac 


ctg 


154 


Tyr 


Leu 
-10 


Val 


Leu 


Ser 


Glu 


Gly 
-5 


Ala 


Val 


Leu 


Ala 


Ser 
1 


Ser 


Gly 


Asp 


Leu 

5 




gag 


aat 


gat 


gag 


cag 


gca 


gcc 


agt 


gcc 


ate 


tct 


gag 


ctg 


gtc 


age 


aca 


202 


Glu 


Asn 


Asp 


Glu 


Gin 
10 


Ala 


Ala 


Ser 


Ala 


He 

15 


Ser 


Glu 


Leu 


Val 


Ser 
20 


Thr 




gcc 


tgc 


ggt 


ttc 


egg 


ctg 


cac 


cgc 


ggc 


atg 


aat 


gtg 


cec 


ttc 


aag 


cgc 


250 


Ala 


Cys 


Gly 


Phe 
25 


Arg 


Leu 


His 


Arg 


Gly 
30 


Met 


Asn 


Val 


Pro 


Phe 
35 


Lys 


Arg 




ctg 


tct 


gtg 


gtc 


ttt 


gga 


gaa 


cac 


aca 


ctg 


ctg 


gtg 


acg 


gtg 


tea 


gga 


298 


Leu 


Ser 


Val 
40 


Val 


Phe 


Gly 


Glu 


His 
45 


Thr 


Leu 


Leu 


Val 


Thr 
50 


Val 


Ser 


Gly 




cag 


agg 


gtg 


ttt 


gtg 


gtg 


aag 


agg 


cag 


aac 


cga 


ggt 


egg 


gag 


cec 


att 


346 


Gin 


Arg 
55 


Val 


Phe 


Val 


Val 


Lys 
60 


Arg 


Gin 


Asn 


Arg 


Gly 
65 


Arg 


Glu 


Pro 


He 





gat gtc tgagcetgee ggaggcgagg gteggagaag eggattgggt cctgggcctc 402 

Asp Val 

70 

tgtgatgagg caggcaeacc tgteggtett ggcttgctgc tagaaactag ggcettetgc 4 62 
tegeecaeet cceaeeecet acctggaegg geeeaggett ggggaetctg a 513 

<210> 374 
<211> 632 
<212> DNA 

<213> Homo sapiens 

<220> 
<221> CDS 
<222> 2.. 559 

<220> 

<221> sig_peptide 
<222> 2. .46 

<223> Von Heijne matrix 

score 5.30000019073486 
seq VMAASMARGGVSA/RV 

<400> 374 

g atg cat gtc atg gcc gee tec atg gee egg gga ggc gtg agt gee agg 4 9 

Met His Val Met Ala Ala Ser Met Ala Arg Gly Gly Val Ser Ala Arg 
-15 -10 -5 1 

gtt eta ctg cag get gcc agg ggc ace tgg tgg aac aga eet ggg ggc 97 
Val Leu Leu Gin Ala Ala Arg Gly Thr Trp Trp Asn Arg Pro Gly Gly 
5 10 15 



PCT/IBOl/00914 

212 



wo 02/083898 



PCT/IBOl/00914 



213 



act 


tec 


9gg 


teg 


ggg 


gag 


ggg 


gtg 


gcg 


ctg 


999 


aca 


acc 


aga 


aag 


ttt 


145 


Thr 


Ser 


Gly 


Ser 


Gly 


Glu 


Gly 


Val 


Ala 


Leu 


Gly 


Thr 


Thr 


Arg 


Lys 


Phe 








20 










25 










30 










caa 


gcg 


aca 


ggc 


teg 


cgc 


ccg 


get 


gga 


gag 


gag 


gac 


gcg 


ggc 


ggc 


ccg 


193 


Gin 


Ala 


Thr 


Gly 


Ser 


Arg 


Pro 


Ala 


Gly 


Glu 


Glu 


Asp 


Ala 


Gly 


Gly 


Pro 






35 










40 










45 












gag 


egg 


ccc 


999 


gac 


gtg 


gtg 


aac 


gtg 


gtg 


ttc 


gta 


gac 


cgc 


tea 


ggc 


241 


Glu 


Arg 


Pro 


Gly 


Asp 


Val 


Val 


Asn 


Val 


Val 


Phe 


Val 


Asp 


Arg 


Ser 


Gly 




50 










55 










60 










65 




cag 


egg 


ate 


cca 


gtg 


agt 


ggc 


aga 


gtc 


ggg 


gac 


aat 


gtt 


ctt 


cac 


ctg 


289 


Gin Arg 


He 


Pro 


Val 


Ser 


Gly 


Arg 


Val 


Gly 


Asp 


Asn 


Val 


Leu 


His 


Leu 












70 










75 










80 






gcc 


cag 


cgc 


cac 


ggg 


gtg 


gac 


ctg 


gaa 


ggg 


gcc 


tgt 


gaa 


gcc 


tec 


ctg 


337 


Ala 


Gin 


Arg 


His 


Gly 


Val 


Asp 


Leu 


Glu 


Gly 


Ala 


Cys 


Glu 


Ala 


Ser 


Leu 










85 










90 










95 








gcc 


tgc 


tee 


ace 


tgc 


cat 


gtg 


tat 


gtg 


agt 


gaa 


gac 


cac 


ctg 


gat 


etc 


385 


Ala 


Cys 


Ser 


Thr 


Cys 


His 


Val 


Tyr 


Val 


Ser 


Glu 


Asp 


His 


Leu 


Asp 


Leu 








100 










105 










110 










ctg 


cct 


cct 


ccc 


gag 


gag 


agg 


gaa 


gac 


gac 


atg 


eta 


gac 


atg 


gcc 


ccc 


433 


Leu 


Pro 


Pro 


Pro 


Glu 


Glu 


Arg 


Glu 


Asp 


Asp 


Met 


Leu 


Asp 


Met 


Ala 


Pro 






115 










120 










125 












etc 


etc 


cag 


gag 


aac 


teg 


egg 


ctg 


ggc 


tgc 


cag 


att 


gtg 


ctg 


aca 


ccg 


481 


Leu 


Leu 


Gin 


Glu 


Asn 


Ser 


Arg 


Leu 


Gly 


Cys 


Gin 


He 


Val 


Leu 


Thr 


Pro 




130 










135 










140 










145 




gag 


ctg 


saa 


gga 


gcg 


gaa 


ttc 


acc 


ctg 


ccc 


aag 


ate 


acc 


agg 


aac 


ttc 


529 


Glu 


Leu 


Xaa 


Gly 


Ala 


Glu 


Phe 


Thr 


Leu 


Pro 


Lys 


He 


Thr 


Arg 


Asn 


Phe 












150 










155 










160 






tac 


gtg 


gat 


ggc 


cat 


gtc 


ccc 


aag 


ccc 


cac 


tgacatgaae acctggacca 


579 


Tyr Val 


Asp 


Gly 


His 


Val 


Pro 


Lys 


Pro 


His 

















165 170 
ttccacattg ccatggeccc agggccagat tgagggaata gcaggtgcag cct 632 



<210> 375 
<211> 503 
<212> DNA 
<213> Homo sapiens 

<220> 
<221> CDS 
<222> 59. .427 



<220> 

<221> sig^peptide 

<222> 59. .202 

<223> Von Heijne matrix 

score 5.19999980926514 
seq LLMLLLFLSELQY/YL 



<400> 375 

aggceggaag agggagtctg taggggcggg cegggegtec cctttccggc cggtcccc 58 

106 



154 



atg 


gag 


gcg 


ctg 


999 


aag 


ctg 


aag 


cag 


ttc 


gat 


gcc 


tac 


ccc 


aag 


act 


Met 


Glu 


Ala 


Leu 


Gly 


Lys 


Leu 


Lys 


Gin 


Phe 


Asp 


Ala 


Tyr 


Pro 


Lys 


Thr 








-45 










-40 










-35 






ttg 


gag 


gac 


ttc 


egg 


gtc 


aag 


acc 


tgc 


ggg 


ggc 


gcc 


acc 


gtg 


acc 


att 


Leu 


Glu 


Asp 


Phe 


Arg 


Val 


Lys 


Thr 


Cys 


Gly 


Gly 


Ala 


Thr 


Val 


Thr 


He 






-30 










-25 










-20 








gtc 


agt 


ggc 


ctt 


etc 


atg 


ctg 


eta 


ctg 


ttc 


ctg 


tee 


gag 


ctg 


cag 


tat 


Val 


Ser 


Gly 


Leu 


Leu 


Met 


Leu 


Leu 


Leu 


Phe 


Leu 


Ser 


Glu 


Leu 


Gin 


Tyr 




•15 










-10 










-5 










tac 


etc 


acc 


acg 


gag 


gtg 


cat 


cct 


gag 


etc 


tac 


gtg 


gac 


aag 


teg 


egg 


Tyr 


Leu 


Thr 


Thr 


Glu 


Val 


His 


Pro 


Glu 


Leu 


Tyr 


Val 


Asp 


Lys 


Ser 


Arg 


1 








5 










10 










15 




gga 


gat 


aaa 


ctg 


aag 


ate 


aac 


ate 


gat 


gta 


ctk 


ttt 


ccg 


cac 


atg 


cct 



202 



250 



298 



wo 02/083898 PCT/IBO 1/009 14 

214 



Gly Asp 


Lys 


Leu 


Lys 


He 


Asn 


He Asp 


Val 


Leu 


Phe Pro His Met Pro 










20 








25 






30 




tgt 


qcc 


tat 


ctg 


agt 


att 


gat 


gcc atg 


gat 


gtg 


gcc gga gaa cag cag 


346 


Cys 


Ala 


Tyr 


Leu 


Ser 


He 


Asp 


Ala Met 


Asp 


Val 


Ala Gly Glu Gin Gin 








35 










40 






45 




ctg 


gat 


gtg 


gaa 


cac 


aac 


ctg 


ttc aag 


caa 


cga 


eta gat aaa gat ggc 


394 


Leu 


Asp 


Val 


Glu 


His 


Asn 


Leu 


Phe Lys 


Gin 


Arg 


Leu Asp Lys Asp Gly 






50 










55 








60 




ate 


ccc 


gtg 


age 


tea 


gag 


get 


gag egg 


cat 


gat 


taatgeagga caecagtgga 


447 


He 


Pro 


Val 


Ser 


Ser 


Glu 


Ala 


Glu Arg 


His 


Asp 






65 










70 








75 







accaacataa tttgctatag gactggatgt ggagtggagg agagcaaagt catggt 503 



<210> 376 
<211> 521 
<212> DNA 

<213> Homo sapiens 

<220> 
<221> CDS 
<222> 145. .375 

<220> 

<221> sig_peptide 

<222> 145. .231 

<223> Von Heijne matrix 

score 5.09999990463257 

seq LLLKLQRLPQAEP/VE 

<400> 376 

gacaaacttg ttctgcgggc tgcggatggg tgcgagggtg gaatctcggt gctgcgacga 60 
gtgtggggec arccgtggag gctccaggtg ttctctctgc cccagcagag cccggcagga 120 
gceccaaeag gaagccageg eggc atg get gcc acc gac ttc gtg cag gag 171 

Met Ala Ala Thr Asp Phe Val Gin Glu 

























•25 












atg 


cgc 


gcc 


gtg 


ggc 


gag 


agg 


ctg 


ctg 


etc 


aag 


ctg 


cag 


aga 


ctg 


ccc 


219 


Met 


Arg 


Ala 


Val 


Gly 


Glu 


Arg 


Leu 


Leu 


Leu 


Lys 


Leu 


Gin 


Arg 


Leu 


Pro 




-20 










-15 










-10 










-5 




cag 


get 


gag 


ccc 


gtg 


gag 


ate 


gtg 


gcc 


ttc 


tea 


gtc 


ate 


ate 


ctt 


ttc 


267 


Gin 


Ala 


Glu 


Pro 


Val 

1 


Glu 


He 


Val 


Ala 

5 


Phe 


Ser 


Val 


He 


He 

10 


Leu 


Phe 




aca 


get 


act 


gtt 


ctg 


ctg 


ttg 


ctg 


ctg 


ata 


gcc 


tgc 


age 


tgc 


tgc 


tgc 


315 


Thr 


Ala 


Thr 


Val 


Leu 


Leu 


Leu 


Leu 


Leu 


He 


Ala 


Cys 


Ser 


Cys 


Cys 


Cys 








15 










20 










25 










act 


cac 


tgc 


tgc 


tgc 


cet 


gag 


erg 


aga 


ggc 


agg 


aag 


gtc 


cag 


gtg 


cag 


363 


Thr 


His 


Cys 


Cys 


Cys 


Pro 


Glu 


Xaa 


Arg 


Gly 


Arg 


Lys 


Val 


Gin 


Val 


Gin 






30 










35 










40 













ccg aca cea cea tgacggackg gegatggctg aggagaaget ggagaggaga 415 
Pro Thr Pro Pro 

45 

tggccaatgc catgacacag gccatcagcc tggccctgea geccttacce cteaagacsa 475 
ggetcectgg eccagctetg geceagceag gtacctggac actgae 521 

<210> 377 
<211> 461 
<212> DNA 
<213> Homo sapiens 

<220> 
<221> CDS 
<222> 17. .337 



<220> 



wo 02/083898 



PCT/IBOl/00914 



215 

<22l> sig_peptide 

<222> 17. .97 

<223> Von Heijne matrix 

score 4.90000009536743 

seq VLSTLSLVQFSPS/GR 

<400> 377 

actttagctg ctgcag atg ctg ttt gaa gaa get ctg ccc etc age tgc tct 52 
Met Leu Phe Glu Glu Ala Leu Pro Leu Ser Cys Ser 
-25 -20 



gat 


cct 


Qta 


ett 


aac 


act 


ett 


aac 


cto 


Ota 


cag 


ttc 


aac 


ccc 


aat 


aaa 


100 


Asp 


Pro 


Val 


Leu 


Ser 


Thr 


Leu 


Ser 


Leu 


Val 


Gin 


Phe 


Ser 


Pro 


Ser 


Gly 




-15 










-10 










-5 










1 




agg 


ace 


cag 


gac 


ctg 


etc 


tct 


cea 


ggg 


gtg 


gag 


aac 


ctg 


teg 


gtg 


ctg 


148 


Arg 


Thr 


Gin 


Asp 
5 


Leu 


Leu 


Ser 


Pro 


Gly 

10 


Val 


Glu 


Asn 


Leu 


Ser 
15 


Val 


Leu 




gac 


gtg 


tec 


cct 


ctg 


ggc 


ttg 


gee 


tgc 


tgt 


ctg 


etc 


act 


etc 


acc 


atg 


196 


Asp 


Val 


Ser 
20 


Pro 


Leu 


Gly 


Leu 


Ala 
25 


Cys 


Cys 


Leu 


Leu 


Thr 
30 


Leu 


Thr 


Met 




tec 


tgc 


cea 


ggg 


cct 


gac 


cct 


cct 


gag 


ggg 


ccc 


ggg 


acc 


cag 


cgt 


gtg 


244 


Ser 


Cys 
35 


Pro 


Gly 


Pro 


Asp 


Pro 
40 


Pro 


Glu 


Gly 


Pro 


Gly 
45 


Thr 


Gin 


Arg 


Val 




tgg 


caa 


ggg 


get 


eta 


egg 


ate 


eta 


cag 


etc 


cca 


gga 


gee 


cca 


gat 


ggg 


292 


Trp Gin 


Gly 


Ala 


Leu 


Arg 


He 


Leu 


Gin 


Leu 


Pro Gly 


Ala 


Pro 


Asp 


Gly 




50 










55 










60 










65 




gtt 


tea 


cca 


tat 


cag 


cca 


gtc 


tgg 


tct 


cga 


act 


cct 


gac 


etc 


aag 




337 


Val 


Ser 


Pro 


Tyr 


Gin 
70 


Pro 


Val 


Trp 


Ser 


Arg 
75 


Thr 


Pro 


Asp 


Leu 


Lys 
80 







tgattcgeca gcctegtcet cccaaagtge tgggattaea gatgtgagcc accatgccca 397 
tggagaatgt attacttttg taatttaaaa aataatagta ataaaataaa gatctttaea 457 
atgt 461 



<210> 378 
<211> 495 
<212> DNA 
<213> Homo sapiens 

<220> 
<221> CDS 
<222> 148.. 483 

<220> 

<221> sig_peptide 

<222> 148. .264 

<223> Von Heijne matrix 

score 4.80000019073486 

seq IILFSAIVGFIYG/YV 

<220> 

<221> misc_feature 
<222> 43,112,122,154 
<223> n=a, g, c or t 

<400> 378 

cctaatcgaa aattcgctct cccgggetta gaaggcccgg etnctgaege gcagtgecag 60 
accttacccc tcacggtcct taagtctegg tcgccctcgc etcgcagcet gncacccgcg 120 
mneagctgcc cgcctcctca gccagce atg ctg nag cat ctg age teg ctg ccc 174 

Met Leu Xaa His Leu Ser Ser Leu Pro 
-35 

acg cag atg gat tac aag ggc cag aag eta get gaa cag atg ttt cag 222 

Thr Gin Met Asp Tyr Lys Gly Gin Lys Leu Ala Glu Gin Met Phe Gin 

-30 -25 -20 -15 

gga att att ett ttt tct gca ata gtt gga ttt ate tac ggg tac gtg 270 



i 



wo 02/083898 PCT/IBO 1/009 14 

216 



Gly 


He 


He 


Leu 


Phe 
-10 


Ser 


Ala 


He 


Val 


Gly 
-5 


Phe 


He 


Tyr Gly Tyr Val 
1 




get 


gaa 


cag 


ttc 


ggg 


tgg 


act 


gtc 


tat 


ata 


gtt 


atg 


gcc gga ttt get 


318 


Ala 


Glu 


Gin 


Phe 


Gly Trp Thr Val 


Tyr 


He 


Val 


Met 


Ala Gly Phe Ala 








5 










10 










15 




ttt 


tea 


tgt 


ttg 


ctg 


aca 


ctt 


cct 


cea 


tgg 


ccc 


ate 


tat cgc egg cat 


366 


Phe 


Ser 


Cys 


Leu 


Leu 


Thr 


Leu 


Pro 


Pro Trp 


Pro 


He 


Tyr Arg Arg His 






20 










25 










30 






cet 


etc 


aag 


tgg 


tta 


cct 


gtt 


caa 


gea 


cag 


acg 


aca 


aga aac cag ggg 


414 


Pro 


Leu 


Lys 


Trp 


Leu 


Pro 


Val 


Gin 


Ala 


Gin 


Thr 


Thr 


Arg Asn Gin Gly 




35 










40 










45 




50 




aaa 


gaa 


aaa 


tta 


aga 


ggc 


atg 


eta 


aaa 


ata 


att 


gag 


gtt ttc atg att 


462 


Lys 


Glu 


Lys 


Leu 


Arg Gly Met 


Leu Lys 


He 


He 


Glu 


Val Phe Met He 












55 










60 






65 




cag 


cac 


ctg 


ctt 


ttg 


ttt 


ctg 


tgagatgagc ta 






495 


Gin 


His 


Leu 


Leu 
70 


Leu 


Phe 


Leu 

















<210> 379 
<211> 541 
<212> DNA 

<213> Homo sapiens 

<220> 
<221> CDS 
<222> 60. .368 

<220> 

<221> sig_peptide 

<222> 60. .122 

<223> Von Heijne matrix 

score 4.69999980926514 

seq ALRSLNLAPPTVA/AP 

<400> 379 

acagtcaacg tcatctagga gcaccgagca gcttggctaa aagtaagggt gtcgtgctg 59 



atg 


gcc 


ctg 


tgc 


gea 


ctg 


acc 


cgc 


get 


ctg 


cgc 


tct 


ctg 


aac 


ctg 


gcg 


107 


Met 


Ala 
-20 


Leu 


Cys 


Ala 


Leu 


Thr 
-15 


Arg 


Ala 


Leu 


Arg 


Ser 
-10 


Leu 


Asn 


Leu 


Ala 




ccc 


ccg 


acc 


gtc 


gcc 


gcc 


cct 


gcc 


ccg 


agt 


ctg 


ttc 


ccc 


gcc 


gcc 


cag 


155 


Pro 


Pro 


Thr 


Val 


Ala 


Ala 


Pro 


Ala 


Pro 


Ser 


Leu 


Phe 


Pro 


Ala 


Ala 


Gin 




-5 










1 








5 










10 






atg 


atg 


aac 


aat 


ggc 


etc 


etc 


caa 


cag 


ccc 


tct 


gcc 


ttg 


atg 


ttg 


etc 


203 


Met 


Met 


Asn 


Asn 
15 


Gly 


Leu 


Leu 


Gin 


Gin 
20 


Pro 


Ser 


Ala 


Leu 


Met 
25 


Leu 


Leu 




ccc 


tgc 


cgc 


cca 


gtt 


ctt 


act 


tct 


gtg 


gcc 


ctt 


aat 


gcc 


aac 


ttt 


gtg 


251 


Pro Cys 


Arg 


Pro 


Val 


Leu 


Thr 


Ser 


Val 


Ala 


Leu 


Asn 


Ala 


Asn 


Phe 


Val 








30 










35 










40 










tec 


tgg 


aag 


agt 


cgt 


acc 


aag 


tac 


acc 


att 


aca 


cca 


gtg 


aag 


atg 


agg 


299 


Ser 


Trp 
45 


Lys 


Ser 


Arg 


Thr 


Lys 
50 


Tyr 


Thr 


He 


Thr 


Pro 
55 


Val 


Lys 


Met 


Arg 




aag 


tct 


ggg 


ggc 


cga 


gac 


cac 


aca 


ggt 


get 


gga 


aac 


gtg 


cgt 


age 


aac 


347 


Lys 


Ser 


Gly 


Gly 


Arg 


Asp 


His 


Thr 


Gly 


Ala 


Gly Asn 


Val 


Arg 


Ser 


Asn 




60 










65 










70 










75 




agt 


agg 


ccg 


agt 


ate 


caa 


cgt 


tgatcataac aaacgggtca ttggcaaggc 


398 


Ser Arg 


Pro 


Ser 


He 


Gin 


Arg 























80 



aggtcgeaac cgctggctgg geaagaggcc taacagtggg cggtggcacc gcaagggggg 458 
ctgggctggc cgaaagatte ggccactacc ccccatgaag agttacgtga agctgccttc 518 
tgcttctgcc aaagctgata tec 541 



<210> 380 
<211> 545 



wo 02/083898 

<212> DNA 

<213> Homo sapiens 

<220> 
<221> CDS 
<222> 80.. 355 

<220> 

<221> sig_peptide 
<222> 80. .124 
<223> Von Heijne matrix 

score 4.599999904 63257 

seq VLAGSLLGPTSRS/AA 

<400> 380 

cttttccttt gttccagggc tggagcggct ctgggctccg gaatcgcccg cagccggtac 60 
tgcgggaccc actgcggat atg get gtc ttg get gga tee ctg ttg gge ccc 112 

Met Ala Val Leu Ala Giy Ser Leu Leu Gly Pro 













-15 








-10 








-5 




acg 


agt 


agg 


teg 


gca 


gcg 


ttg 


ctg 


ggt 


gge 


agg 


tgg 


etc 


cag 


ccc 


egg 


160 


Thr 


Ser 


Arg 


Ser 


Ala 
1 


Ala 


Leu 


Leu 


Gly 
5 


Gly 


Arg 


Trp 


Leu 


Gin 
10 


Pro 


Arg 




gcc 


tgg 


ctg 


ggg 


ttc 


eca 


gae 


gee 


tgg 


gge 


etc 


ccc 


ace 


ccg 


cag 


cag 


208 


Ala 


Trp 


Leu 


Gly 


Phe 


Pro 


Asp 


Ala 


Trp 


Gly 


Leu 


Pro 


Thr 


Pro 


Gin 


Gin 








15 










20 










25 










gcc 


egg 


gge 


aag 


get 


cgc 


ggg 


aat 


gag 


tat 


cag 


ccg 


age 


aac 


ate 


aaa 


256 


Ala 


Arg 


Gly 


Lys 


Ala 


Arg 


Gly 


Asn 


Glu 


Tyr 


Gin 


Pro 


Ser 


Asn 


He 


Lys 






30 










35 










40 












cgc- 


aag 


aac 


aag 


cac 


gge 


tgg 


gte 


egg 


cgc 


ctg 


age 


acg 


ccg 


gcc 


gge 


304 


Arg 


Lys 


Asn 


Lys 


His 


Gly 


Trp 


Val 


Arg 


Arg 


Leu 


Ser 


Thr 


Pro 


Ala 


Gly 




45 










50 










55 










60 




gtg 


cag 


gtc 


ate 


ctt 


cgc 


cga 


atg 


etc 


aag 


gge 


cgc 


aag 


teg 


ctg 


age 


352 


Val 


Gin 


Val 


lie 


Leu 


Arg 


Arg 


Met 


Leu 


Lys 


Gly 


Arg 


Lys 


Ser 


Leu 


Ser 












65 










70 










75 






cat 


tgaggatcgc gacgcagtcg gegggaccet eatggaagca 


tegecctege 




405 



His 

ctcggacctt gcetggegct atttttgcag ggagctgggg agcaggaacg cetcggacet 4 65 

gagtgetcte catattgtgg ggttgaagtc tggatgggag ettgecaagt ccctttttag 525 

gctttttaat taggaagcat 545 

<210> 381 
<211> 498 
<212> DNA 

<213> Homo sapiens 

<220> 
<221> CDS 
<222> 137.. 397 

<220> 

<221> sig_peptide 

<222> 137. .253 

<223> Von Heijne matrix 

score 4.59999990463257 

seq LQGARVLLTGANA/GV 

<400> 381 

ggggtgcggt cggggacecg gcaggaggcg gccgagaaga gaggaccgtg ggggcgtteg 60 
cgtggeteec agceegggae cecacccccg ctggacagtg ggggaaactg aggcctgagc 120 
gggcecacac aggace atg aag gtg ctt etc etc aca ggg ctg ggg gcc ctg 172 
Met Lys Val Leu Leu Leu Thr Gly Leu Gly Ala Leu 
-35 -30 
ttc ttc gee tat tat tgg gat gae aac ttc gac eca gcc age etc cag 220 



PCT/IBO 1/009 14 

217 



wo 02/083898 PCT/IBO 1/009 14 



218 



Phe 


Phe 


Ala 


Tyr 


Tyr 


Trp 


Asp 


Asp 


Asn 


Phe 


Asp Pro 


Ala 


Ser 


Leu 


Gin 








-25 










-20 








-15 










gga 


gcg 


cga 


gtg 


ctg 


ctg 


aca 




gcc 


aac 


get ggt 


gtt 


ggt 


gag 


gag 


268 


Gly Ala 


Arg 


Val 


Leu 


Leu 


Thr 


Gly 


Ala 


Asn 


Ala Gly 


Val 


Gly 


Glu 


Glu 






"10 










-5 








1 








5 




ctg 


gcc 


tat 


cac 


tac 


gcg 


cgt 


ctg 


ggc 


tec 


cac ctg 


gtg 


etc 


act 


gee 


316 


Leu 


Ala 


Tyr 


His 


Tyr 


Ala 


Arg 


Leu 


Gly 


Ser 


His Leu 


Val 


Leu 


Thr 


Ala 












10 










15 








20 






cac 


act 


gag 


get 


etc 


ctg 


cag 


aag 


gca 


egg 


tgg etc 


aeg 


ett 


gta 


gtc 


364 


His 


Thr 


Glu 


Ala 


Leu 


Leu 


Gin 


Lys 


Ala 


Arg 


Trp Leu 


Thr 


Leu 


Val 


val 










25 










30 








35 








age 


act 


ttg 


gga 


ggc 


cga 


gga 


aag 


tgg 


ate 


ace tgaggteagg agtteaagae 


417 


Ser 


Thr 


Leu 


Gly 


Gly 


Arg 


Gly 


Lys 


Trp 


lie 


Thr 













40 45 
cagcctggcc aaeatggcga aacctggtct ctactaagaa gaeaaaaatt aaggccagge 477 
atggtggcte atgectgtaa t 498 



<210> 382 
<211> 459 
<212> DNA 

<213> Homo sapiens 



<220> 
<221> CDS 
<222> 43. .417 



<220> 

<221> sig_peptide 

<222> 43. .87 

<223> Von Heijne matrix 

score 4.59999990463257 

seq LLTHNLLSSHVRG/VG 

<400> 382 

acctttttcc ggttceggcc tggcgagagt ttgtgcggcg ac atg aaa ctg ett 54 

Met Lys Leu Leu 

























-15 










acc 


cac 


aat 


ctg 


ctg 


age 


teg 


cat 


gtg 


egg 


ggg gtg 


ggg 


tee 


egt 


ggc 


102 


Thr 


His 


Asn 


Leu 


Leu 


Ser 


Ser 


His 


Val 


Arg 


Gly Val 


Gly 


Ser 


Arg 


Gly 






-10 










-5 








1 








5 




ttc 


ccc 


ctg 


cgc 


etc 


eag 


gee 


ace 


gag 


gtc 


cgt ate 


tgc 


ect 


gtg 


gaa 


150 


Phe 


Pro 


Leu 


Arg 


Leu 


Gin 


Ala 


Thr 


Glu 


Val 


Arg He 


Cys 


Pro 


Val 


Glu 












10 










15 








20 






ttc 


aac 


ccc 


aac 


ttc 


gtg 


gcg 


cgt 


atg 


ata 


cet aaa 


gtg 


gag 


tgg 


teg 


198 


Phe 


Asn 


Pro 


Asn 


Phe 


Val 


Ala 


Arg 


Met 


He 


Pro Lys 


Val 


Glu 


Trp 


Ser 










25 










30 








35 








gcg 


ttc 


ctg 


gag 


gcg 


gee 


gat 


aac 


ttg 


cgt 


ctg ate 


cag 


gtg 


ecg 


aaa 


246 


Ala 


Phe 


Leu 


Glu 


Ala 


Ala 


Asp 


Asn 


Leu 


Arg 


Leu He 


Gin 


Val 


Pro 


Lys 








40 










45 








50 










ggg 


ecg 


gtt 


gag 


gga 


tat 


gag 


gag 


aat 


gag 


gag ttt 


ctg 


agg 


ace 


atg 


294 


Gly 


Pro 


Val 


Glu 


Gly 


Tyr 


Glu 


Glu 


Asn 


Glu 


Glu Phe 


Leu 


Arg 


Thr 


Met 






55 










60 








65 












eac 


cac 


ctg 


ctg 


ctg 


gag 


gtg 


gaa 


gtg 


ata 


gag ggc 


acc 


ctg 


cag 


tgc 


342 


His 


His 


Leu 


Leu 


Leu 


Glu 


Val 


Glu 


Val 


He 


Glu Gly 


Thr 


Leu 


Gin 


Cys 




70 










75 










80 








85 




ecg 


gaa 


tet 


gga 


cgt 


atg 


ttc 


ccc 


ate 


age 


cge ggg 


ate 


ecm 


ame 


atg 


390 


Pro 


Glu 


Ser 


Gly 


Arg 


Met 


Phe 


Pro 


He 


Ser 


Arg Gly 


He 


Pro 


Xaa 


Met 












90 










95 








100 






ctg 


ctg 


agt 


gaa 


gag 


gaa 


act 


gag 


agt 


tgattgtgsc aggcgccagt 




437 


Leu 


Leu 


Ser 


Glu 


Glu 


Glu 


Thr 


Glu 


Ser 

















105 110 



ttttcttgty atgactgtgt at 459 



1 

i 
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<210> 383 
<211> 533 
<212> DNA 

<213> Homo sapiens 

<220> 
<22l> CDS 
<222> 82.. 237 



<220> 

<221> sig^peptide 
<222> 82. .147 
<223> Von Heijne matrix 
score 4 . 5 

seq FFLLITIALGTKT/ES 
<400> 383 

aawtttctca acaattcctc accgcagagc ctctgaagct cccaccaggc cagctctcct 60 



cccacaacag cttcccacag c 


atg 


aag 


ate 


tee 


gtg get gee att ccc ttc 


111 










Met 


Lys 


He 
-20 


Ser 


Val Ala Ala He Pro Phe 
-15 




ttc etc 


etc ate 


acc 


ate 


gee 


eta 


ggg 


acc 


aag act gaa tee tec tea 


159 


Phe Leu 


Leu He 
-10 


Thr 


He 


Ala 


Leu 
-5 


Gly 


Thr 


Lys Thr Glu Ser Ser Ser 
1 




egg gga 


cct tac 


cac 


ccc 


tea 


gag 


tgc 


tgc 


ttc ace tac act acc tac 


207 


Arg Gly 


Pro Tyr 


His 


Pro 


Ser 


Glu 


Cys 


Cys 


Phe Thr Tyr Thr Thr Tyr 




5 






10 










15 20 




aag ate 


ccg egt 


cag 


awt 


tat 


gga 


tta 


eta 


tgagaecaac agccagtgct 


257 


Lys He 


Pro Arg 


Gin 
25 


Xaa 


Tyr 


Gly 


Leu 


Leu 
30 







ceaagccegg aattgtctte ateaccaaaa ggggccattc cgtctgtacc aaccccagtg 317 

aeaagtgggt ccaggactat ateaaggaca tgaaggagaa ctgagtgacc cagaaggggt 377 

ggcgaaggea cagctcagag acataaagag aagatgccaa ggccccctcc tccacccace 437 

getaaetctc agccceagte acectettgg agctteectg ctttgaatta aagaccactc 4 97 

atgctctcca aaaaaaaaaa aaaaaggtca aaagct 533 



<210> 384 
<211> 472 
<212> DNA 

<213> Homo sapiens 



<220> 
<221> CDS 
<222> 92.. 307 



<220> 

<221> sig_peptide 

<222> 92.. 217 

<223> Von Heijne matrix 

score 4.40000009536743 
seq LRLEVAAAPGTPA/QP 



<400> 384 

aaaagaaatt cccgggcccg gccagcctgt gccgctgtta tgaggagaag caaagcccct 60 
tgcagaagca cttgeggtet gcagcctacg c atg aat agg ttg gca ggt gtg 112 

Met Asn Arg Leu Ala Gly Val 
-40 

ggc tgg egg gtg gae tac ace ctg age tee age ctg etg caa tec gtg 160 
Gly Trp Arg Val Asp Tyr Thr Leu Ser Ser Ser Leu Leu Gin Ser Val 
-35 -30 -25 -20 

gaa gag ccc atg gtg cac etg egg etg gag gtg gca get gee cca ggg 208 
Glu Glu Pro Met Val His Leu Arg Leu Glu Val Ala Ala Ala Pro Gly 
-15 -10 -5 



wo 02/083898 PCT/IBO 1/009 14 

220 

acc cca gcc cag cct gtt gcc atg tec etc tea gca gac aag ttc cag 256 
Thr Pro Ala Gin Pro Val Ala Met Ser Leu Ser Ala Asp Lys Phe Gin 

1 5 10 

gtc etc ctg gca gaa ctg aag cag gcc cag acc ctg atg age tec ctg 304 
Val Leu Leu Ala Glu Leu Lys Gin Ala Gin Thr Leu Met Ser Ser Leu 

15 20 25 

ggc tgaggagaag ggtgttccag gcctgtgtgg agccgccctg cccgtatgga 357 
Gly 
30 

gtcacgccct ctgaactgct ettcgggagg cagcectggt tctaggatgc tgaggccctg 417 
gcccggactc tggcctccca gatccccage tgcetcactt etctcttgag aactt 472 

<210> 385 
<211> 514 
<212> DNA 

<213> Homo sapiens 

<220> 
<221> CDS 
<222> 113.. 364 



<220> 

<221> sig_peptide 

<222> 113.. 172 

<223> Von Heijne matrix 

score 4.30000019073486 
seq SLLLSLPPHQGLT/FS 



<400> 385 

ttttttacat ggtgttccca cagctgggag gacacccaca tggtcggcgt gcaggatatt 60 
tcgctggacc etagaaaagc caccacgacc tgtgggecat gatgctacce ca atg get 118 

Met Ala 
-20 



get 


get 


get 


gtt 


cct 


tct 


ett 


ett 


ett 


tct 


ett 


cct 


cct 


cac 


cag 


ggg 


166 


Ala 


Ala 


Ala 


Val 
-15 


Pro 


Ser 


Leu 


Leu 


Leu 
-10 


Ser 


Leu 


Pro 


Pro 


His 
-5 


Gin 


Gly 




etc 


act 


ttc 


tec 


aac 


aaa 


ata 


caa 


cct 


ttt 


gga 


get 


caa 


gga 


gtc 


ttg 


214 


Leu 


Thr 


Phe 
1 


Ser 


Asn 


Lys 


He 
5 


Gin 


Pro 


Phe 


Gly 


Ala 
10 


Gin 


Gly 


Val 


Leu 




cat 


ceg 


gaa 


cca 


gga 


ctg 


ega 


gac 


tgg 


ctg 


ctg 


cca 


acg 


tgc 


tec 


aga 


262 


His 


Pro 


Glu 


Pro 


Gly 


Leu 


Arg 


Asp 


Trp 


Leu 


Leu 


Pro 


Thr 


Cys 


Ser 


Arg 




15 










20 










25 










30 




caa 


ttg 


ega 


gtc 


gca 


ctg 


ceg 


gag 


aag 


ggg 


tec 


gag 


ggc 


agt 


ctg 


tgt • 


310 


Gin 


Leu 


Arg 


Val 


Ala 
35 


Leu 


Pro 


Glu 


Lys 


Gly 
40 


Ser 


Glu 


Gly 


Ser 


Leu 
45 


Cys 




caa 


aeg 


cag 


ctg 


cca 


get 


act 


cca 


tgc 


ttc 


ctg 


cct 


teg 


aat 


acg 


gtg 


358 


Gin 


Thr 


Gin 


Leu 

50 


Pro 


Ala 


Thr 


Pro 


Cys 
55 


Phe 


Leu 


Pro 


Ser 


Asn 

60 


Thr 


Val 





aga acg tgaagtcatg agetgctget aaggcatgtg gcaaeettga agagaaggtc ' 414 
Arg Thr 

aagagctace agccaccaaa agaatgecag eacttcetgt gtetttgctt tggatteatg 474 

agaaatatac gttcctattt gcttcaaaaa aaaaaaaaaw 514 

<210> 386 
<211> 508 
<212> DNA 

<213> Homo sapiens 



<220> 
<221> CDS 
<222> 83.. 244 



<220> 



wo 02/083898 



PCT/IBOl/00914 



221 

<221> sig_peptide 

<222> 83. .139 

<223> Von Heijne matrix 

score 4.19999980926514 

seq LLILSKVLDDTFQ/SV 



<400> 386 

actgctatag cgatataact cagatgttct caggacagca agtttacatg tgaatagaga 60 
gggagggacg tgttaggaag ac atg aaa ttc aat ttt gta ctt ttg ata ctt 112 

Met Lys Phe Asn Phe Val Leu Leu He Leu 
-15 -10 
tec aaa gtt eta gat gac act ttc caa agt gtc aag aaa tgg tta aat 160 
Ser Lys Val Leu Asp Asp Thr Phe Gin Ser Val Lys Lys Trp Leu Asn 

-5 15 
tat ttt cag ttt act tta aga aat ggt tta atg tgg cca ggt gcg gtg 208 
Tyr Phe Gin Phe Thr Leu Arg Asn Gly Leu Met Trp Pro Gly Ala Val 

10 15 20 

get cat gcc tgt aat ccc age act ggc tea ege ctg taateccagc 254 
Ala His Ala Cys Asn Pro Ser Thr Gly Ser Arg Leu 

25 30 35 

actttgggag tccgaggtgg gcagatcatg aggtcaggag atcgagacca tcctggctaa 314 
catggtgaaa ccccgtctct actaaaaata caaaaaaatt agccaggcat ggtggcgggc 374 
gcctgtagtc ccagetactc gggaggctga ggcaggagaa tggcgtgaac ctgggaggcg 434 
gagettacag tgagccgaga tcgcgceact gcactccagc ctgggcgaea gagcgagact 494 
ctgttcaaaa aaaa 508 



<210> 387 
<211> 505 
<212> DNA 
<213> Homo sapiens 



<220> 
<221> CDS 
<222> 55., 327 



<220> 

<221> sig_peptide 

<222> 55. .111 

<223> Von Heijne matrix 

score 4,19999980926514 
seq FVLLLSEIVSISA/LS 



<400> 387 

ttgtctttgg tagttttttt gcactaactt caggaaccag ctcatgatct cagg atg 57 

Met 



tat 


gga 


aaa 


ata 


ate 


ttt 


gta 


tta 


eta 


ttg 


tea 


gaa 


att 


gtg 


age 


ata 


105 


Tyr 


Gly 


Lys 


He 
-15 


He 


Phe 


Val 


Leu 


Leu 
-10 


Leu 


Ser 


Glu 


He 


Val 
-5 


Ser 


He 




tea 


gca 


tta 


agt 


ace 


act 


gag 


gtg 


gca 


atg 


eac 


act 


tea 


ace 


tet 


tet 


153 


Ser 


Ala 


Leu 
1 


Ser 


Thr 


Thr 


Glu 
5 


Val 


Ala 


Met 


His 


Thr 
10 


Ser 


Thr 


Ser 


Ser 




tea 


gtc 


aca 


aag 


agt 


tac 


ate 


tea 


tea 


cag 


aca 


aat 


gga 


gaa 


ayg 


gga 


201 


Ser 


Val 


Thr 


Lys 


Ser 


Tyr 


He 


Ser 


Ser 


Gin 


Thr 


Asn 


Gly 


Glu 


Xaa 


Gly 




15 










20 










25 










30 




caa 


ctt 


gtc 


cat 


egt 


ttc 


act 


gta 


cca 


get 


ect 


gta 


gtg 


ata 


ata 


etc 


249 


Gin 


Leu 


Val 


His 


Arg 
35 


Phe 


Thr 


Val 


Pro 


Ala 
40 


Pro 


Val 


Val 


He 


He 
45 


Leu 




att 


att 


ttg 


tgt 


gtg 


atg 


get 


ggt 


att 


att 


gga 


aeg 


ate 


etc 


tta 


att 


297 


He 


He 


Leu 


Cys 


Val 


Met 


Ala 


Gly 


He 


He 


Gly Thr 


He 


Leu 


Leu 


He 










50 










55 










60 








tet 


tac 


agt 


att 


ege 


ega 


ctg 


ata 


aag 


gca 


tgaggatgtg gcctgcatgc 


347 


Ser 


Tyr 


Ser 


He 


Arg 


Arg 


Leu 


He 


Lys 


Ala 

















65 70 
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tgcctgatct tgcctagaac cggctgcacc tgctgttctc ttgtttatgc aaactggctg 407 

cacctgctat tcctttgctt atgcccctac ccctggctat cctaattccc tgttctcctg 467 
cctcactatt actgtattct ctacttctaa ataaaaat 505 

<210> 388 
<211> 775 
<212> DNA 

<213> Homo sapiens 



<220> 
<221> CDS 
<222> 395.. 736 



<220> 

<221> sig_peptide 

<222> 395. .490 

<223> Von Heijne matrix 

score 4.19999980926514 
seq LLGSGLFVFSLTA/FN 



<400> 388 

atgatgggta acaggaccgg tgggstcccc aggaagtcct agagggggtc ggggtttggg 60 
tggacaagct ttcctcgtcc tctcccgaca gagctgacgt gtcctgggtt ccaccgggag 120 
cgggcatttc caccggacgg gagggttcgg ggtgtccggg gctggggaat acgtaggggt 180 
tgccgcgcgg tgtggggagt tggggcgtgt ggctgcagtc ccgggagttc ttggaggggg 240 
tcggcccacc gagcttccgg accggctgat ctgcccgtag cttgccggag gagggcggas 300 
tgactctccg tcccttctcc catcccctcc agtggtgggt acgggcacct cgctggcgct 360 
ctcctccctc ctgtccctgc tgctctttgc tggg atg cag atg tac age cgt cag 415 

Met Gin Met Tyr Ser Arg Gin 
-30 

etc acc ate cag ggc ggc 
Leu Thr lie Gin Gly Gly 
-15 

etc act gcc ttc aat aat 
Leu Thr Ala Phe Asn Asn 
1 

caa gea aag ate ttc cct 
Gin Ala Lys lie Phe Pro 
15 

ttt gca tct ggc etc ate 
Phe Ala Ser Gly Leu lie 
30 35 

ttc tee atg gtt ggt ctg 
Phe Ser Met Val Gly Leu 
50 

tac cag gca gca get cca 
Tyr Gin Ala Ala Ala Pro 

65 

age aag aag aga aac tgaccctgaa tgttcaataa 756 
Ser Lys Lys Arg Asn 
80 

agttgattct ttgtaaaaa 775 



ctg 


gcc 


tec 


ace 


gag 


tgg 


Leu 


Ala 


Ser 


Thr 


Glu 


Trp 


-25 










-20 


ggt 


etc 


ttc 


gtg 


ttc 


teg 


Gly 


Leu 


Phe 


Val 


Phe 
-5 


Ser 


gtc 


ttt 


ggc 


aaa 


gga 


ttc 


Val 


Phe 


Gly 


Lys 


Gly 


Phe 






10 








tgc 


etc 


ctg 


ttg 


get 


etc 


Cys 


Leu 


Leu 


Leu 


Ala 


Leu 




25 










gtc 


ace 


acc 


tgc 


ttc 


ate 


Val 


Thr 


Thr 


Cys 


Phe 


He 


40 










45 


aag 


ate 


tec 


tec 


acc 


ctg 


Lys 


He 


Ser 


Ser 


Thr 


Leu 










60 




gcc 


aag 


gtc 


aca 


ggc 


aag 


Ala 


Lys 


Val 


Thr 


Gly 


Lys 








75 







ctg 


ctt 


ggt 


teg 


4 63 


Leu 


Leu 


Gly 


Ser 










-10 




ctg 


gag 


aat 


ctt 


511 


Leu 


Glu 

5 


Asn 


Leu 




gag 


att 


etc 


ctg 


559 


Glu 


He 


Leu 


Leu 




20 










cac 


cga 


gtc 


tgt 


607 


His 


Arg 


Val 


Cys 




tac 


tac 


ate 


aac 


655 


Tyr 


Tyr 


He 


Asn 










55 




gtc 


etc 


aca 


cca 


703 


Val 


Leu 


Thr 


Pro 








70 







<210> 389 
<211> 466 
<212> DNA 
<213> Homo sapiens 

<220> 
<221> CDS 
<222> 166... 324 



<220> 



wo 02/083898 



PCT/IBOl/00914 



223 

<22l> sig_peptide 

<222> 166. .204 

<223> Von Heijne matrix 

score 4.09999990463257 

seq MAVAFVLSLGVAA/LY 

<400> 389 

attctgcgcc tgcgcgcggc tacagcacgg ttcgtttttc ctttagtcag gaaggacgtt 60 
ggtgttgagg ttagcatacg tatcaaggac agtaactacc atggctcccg aagttttgcc 120 
aaaacctcgg atgcgtggcc ttctggcagg cgtctgcgaa atcat atg get gta gca 177 

Met Ala Val Ala 
-10 



ttc 


gtg eta tec 


etg 


ggg gtt gca get 


ttg 


tat 


aag 


ttt 


cgt gtg 


get 


225 


Phe 


Val Leu Ser 


Leu 
-5 


Gly Val Ala Ala 


Leu 
1 


Tyr 


Lys 


Phe 


Arg Val 
5 


Ala 




gat 


caa aga aag 


aag 


gca tac gca gat 


ttc 


tac 


aga 


aac 


tac gat 


gtc 


273 


Asp Gin Arg Lys 


Lys 


Ala Tyr Ala Asp 


Phe 


Tyr 


Arg 


Asn 


Tyr Asp 


Val 






10 




15 








20 








atg 


aaa gat ttt 


gag 


gag atg agg aag 


get 


ggt 


ate 


ttt 


cag agt 


gta 


321 


Met 


Lys Asp Phe 


Glu 


Glu Met Arg Lys 


Ala 


Gly 


He 


Phe 


Gin Ser 


Val 






25 




30 






35 










aag 


taatcttgga atataaagaa tttcttcagg ttgaattacc 


tagaagtttg 




374 



Lys 
40 

tcactgactt gtgttcctga actatgacac atgaatatgt gggctaagaa atagttcctc 4 34 

ttgataaata aacaattaac aaatactttt ga 466 

<210> 390 
<211> 522 
<212> DNA 

<213> Homo sapiens 

<220> 

<221> CDS 
<222> 94. .417 

<220> 

<221> sig_peptide 
<222> 94 . .213 
<223> Von Heijne matrix 
score 4 

seq VTAPSMVAPVTFA/SI 
<400> 390 

etgtaatetc tgcgaaagct tccaggttcg etcatcgtag gtcactggaa aaaattgtag 60 
cattgctaaa gagtatttca gagaaaattt tgc atg caa aaa ttt agg aag atg 114 

Met Gin Lys Phe Arg Lys Met 
-40 -35 



agt gaa 


act 


eat 


eat 


tct 


gta 


ate 


tct 


gtg 


aaa 


get 


tec 


agt 


ccc 


tgg 


162 


Ser Glu 


Thr 


His 
-30 


His 


Ser 


Val 


He 


Ser 
-25 


Val 


Lys 


Ala 


Ser 


Ser 
-20 


Pro 


Trp 




eta tct 


tct 


tea 


gtg 


act 


get 


cea 


tee 


atg 


gta 


gee 


cea 


gtc 


act 


ttt 


210 


Leu Ser 


Ser 
-15 


Ser 


Val 


Thr 


Ala 


Pro 
-10 


Ser 


Met 


Val 


Ala 


Pro 
-5 


Val 


Thr 


Phe 




gca tct 


att 


gta 


gaa 


gaa 


gaa 


eta 


caa 


caa 


gaa 


gca 


get 


ett 


att 


aga 


258 


Ala Ser 


He 


Val 


Glu 


Glu 


Glu 


Leu 


Gin 


Gin 


Glu 


Ala 


Ala 


Leu 


He 


Arg 




1 








5 










10 










15 




agt cga 


gaa 


aaa 


ccg 


ttg 


get 


etg 


att 


cag 


att 


gag 


gag 


cat 


gee 


ata 


306 


Ser Arg 


Glu 


Lys 


Pro 
20 


Leu 


Ala 


Leu 


He 


Gin 
25 


He 


Glu 


Glu 


His 


Ala 
30 


He 




caa gat 


tta 


ttg 


gtt 


ttc 


tat 


gag 


gca 


ttt 


ggc 


aac 


cct 


gaa 


gag 


ttt 


354 


Gin Asp 


Leu 


Leu 
35 


Val 


Phe 


Tyr 


Glu 


Ala 
40 


Phe 


Gly 


Asn 


Pro 


Glu 
45 


Glu 


Phe 
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gtc att gtt gaa agg aca ccg cag gga cca ctg gca gta cct atg tgg 402 
Val He Val Glu Arg Thr Pro Gin Gly Pro Leu Ala Val Pro Met Trp 

50 55 60 

aat aag cat gga tgc tagttcactg tggagttgag atgcatttta cataattatg 457 
Asn Lys His Gly Cys 
65 

agtttgttca tataaagaaa gctgtggaaa agagtcttag agatttgaat atcgtctaaa 517 
tagat 522 

<210> 391 
<2H> 409 
<212> DNA 

<213> Homo sapiens 

<220> 
<221> CDS 
<222> 9. .389 

<220> 

<221> sig_peptide 

<222> 9. ,149 

<223> Von Heijne matrix 

score 3.90000009536743 

seq LSSDLQQVGGASA/RI 

<220> 

<221> misc_feature 

<222> 8,403 

<223> n^^a, q, c or t 

<400> 391 

tgggagtn atg ttc acg cct mtg ama gtg aaa tam gcg tac tac gac act 50 
Met Phe Thr Pro Xaa Xaa Val Lys Xaa Ala Tyr Tyr Asp Thr 
-45 -40 -35 

gaa cgc ate gga gtt gac mtg ate atg aag acc tgm ttt age eee aac 98 
Glu Arg He Gly Val Asp Xaa He Met Lys Thr Xaa Phe Ser Pro Asn 

-30 -25 -20 

aga gtg att gga etc tea agt gac ttg eag caa gta gga ggg gca tea 146 
Arg Val He Gly Leu Ser Ser Asp Leu Gin Gin Val Gly Gly Ala Ser 

-15 -10 . -5 

get cgc ate mag gat gee ctg agt aca gtg ttg caa tat gca gag gat 194 
Ala Arg He Xaa Asp Ala Leu Ser Thr Val Leu Gin Tyr Ala Glu Asp 

15 10 15 

gta ctg tct gga aag gtg tea get gac aat act gtg ggc cgc ttc ctg 242 
Val Leu Ser Gly Lys Val Ser Ala Asp Asn Thr Val Gly Arg Phe Leu 

20 25 30 

atg age ctg gtt aac caa gta ccg aaa ata gtt ccc gat gac ttt gag 290 
Met Ser Leu Val Asn Gin Val Pro Lys He Val Pro Asp Asp Phe Glu 

35 40 45 

acc atg etc aac age aac ate aat gac mtt ttg atg gtg acc tac mtg 338 
Thr Met Leu Asn Ser Asn He Asn Asp Xaa Leu Met Val Thr Tyr Xaa 

50 55 60 

gcc aac etc aca cag tea cag att gca etc aat gaa aaa ctt gta aac 386 
Ala Asn Leu Thr Gin Ser Gin He Ala Leu Asn Glu Lys Leu Val Asn 

65 70 75 

ctg tgaatggacc ceangcagta 409 
Leu 
80 

<210> 392 
<211> 431 
<212> DNA 
<213> Homo sapiens 



wo 02/083898 



225 



PCT/IBOl/00914 



<220> 
<221> CDS 
<222> 60.. 314 



<220> 

<221> sig_pepticle 

<222> 60. .155 

<223> Von Heijne matrix 

score 3.79999995231628 
seq LAHIQALISGIEA/QL 



<220> 

<221> misc_feature 
<222> 381 

<223> n==a, g, c or t 



<400> 392 



cataagaaca gggaggttag aagtagggtc ttgagattga 


gctgeagtea eagctgagc 


59 


atg aaa 


get 


gcc 


ttg 


gaa 


gac 


aca 


ctg 


gea 


gaa 


aeg 


gag 


gcg 


cgc 


ttt 


107 


Met Lys 


Ala 
-30 


Ala 


Leu 


Glu 


Asp 


Thr 
-25 


Leu 


Ala 


Glu 


Thr 


Glu 
-20 


Ala 


Arg 


Phe 




gga gcc 


cag 


ctg 


gcg 


cat 


ate 


cag 


gcg 


ctg 


ate 


age 


ggt 


att 


gaa 


gcc 


155 


Gly Ala 


Gin 


Leu 


Ala 


His 


He 


Gin 


Ala 


Leu 


He 


Ser 


Gly 


He 


Glu 


Ala 




-15 










-10 










-5 












cag ctg 


ggc 


gat 


gtg 


ega 


get 


gat 


agt 


gag 


egg 


cag 


aat 


cag 


gag 


tac 


203 


Gin Leu 


Gly 


Asp 


Val 


Arg 


Ala 


Asp 


Ser 


Glu 


Arg 


Gin 


Asn 


Gin 


Glu 


Tyr 




1 






5 










10 










15 






cag egg 


etc 


atg 


gac 


ate 


aag 


teg 


egg 


ctg 


gag 


cag 


gag 


att 


gee 


ace 


251 


Gin Arg 


Leu 


Met 
20 


Asp 


He 


Lys 


Ser 


Arg 
25 


Leu 


Glu 


Gin 


Glu 


He 
30 


Ala 


Thr 




tac cgc 


age 


ctg 


etc 


gag 


gga 


cag 


gaa 


gat 


cac 


tac 


aac 


aat 


ttg 


tct 


299 


Tyr Arg 


Ser 
35 


Leu 


Leu 


Glu 


Gly 


Gin 
40 


Glu 


Asp 


His 


Tyr 


Asn 
45 


Asn 


Leu 


Ser 





gcc tec aag gte etc tgaggcagea ggetctgggg ettctgetgt cetttggagg 354 
Ala Ser Lys Val Leu 
50 

gtgtcttctg ggtagaggga tgggaangaa gggaccctta cccccggctc ttctcctgac 414 
ctgccaataa aaattta 431 



<210> 393 
<211> 551 
<212> DNA 
<213> Homo sapiens 

<220> 
<221> CDS 
<222> 232. .534 



<220> 

<221> sig_peptide 

<222> 232. .306 

<223> Von Heijne matrix 

score 3.70000004768372 
seq AKTCLVLCSRVLS/VI 



<400> 393 

tatcaetgtt aegaaccaag gatttacaga tcaetggcaa aaattctgag aactttcaea 60 
ccagtatact gtccaagccc attaagtggc ateacacctc tcttttatgt agctcagaca 120 
agacagtcta atatcttcaa aatactactg caatatggaa tcttagaaag agaaaaaaac 180 
cctatcaaca ttgtcttaac aatagtactc tacccttcga gagtaagagt a atg gtt 237 

Met Val 
-25 



wo 02/083898 PCT/IBO 1/009 14 
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gat 


cgt 


gaa 


ttg 


get 


gac 


ate 


cat 


gaa gat 


gee 


aaa 


aca 


tgt 


ttg 


gta 


285 


Asp Arg 


Glu 


Leu 


Ala 


Asp 


He 


His 


Glu Asp 


Ala 


Lys 


Thr 


Cys 


Leu 


Val 










-20 










-15 








-10 








eta 


tgt 


tec 


aga 


gtg 


ett 


tct 


gtc 


att tea 


gtc 


aag 


gaa 


ata 


aag 


aca 


333 


Leu 


Cvs 


Ser 
-5 


Arg 


Val 


Leu 


Ser 


Val 
1 


He Ser 


Val 


Lys Glu 


He 


Lys 


Thr 




cag 


ctq 


agt 


tta 


QQa 


aga 


cat 


cea 


att att 


tea 


aat 


tgg 


ttt 


gat 


tac 


381 


Gin 


Leu 


Ser 


Leu 


Gly 


Arg 


His 


Pro 


He He 


Ser Asn Trp 


Phe Asp 


Tyr 




10 










15 








20 










25 




att 


cct 


tea 


aca 


aga 


tac 


aaa 


gat 


eca tgt 


gaa 


eta 


tta 


cat 


ctt 


tgc 


429 


He 


Pro 


Ser 


Thr 


Arg 


Tyr 


Lys 


Asp 


Pro Cys 


Glu 


Leu 


Leu 


His 


Leu Cys 












30 








35 










40 






aga 


eta 


acc 


ate 


agg 


aat 


caa 


eta 


tta acc 


aac 


aat 


atg 


etc 


cea 


gat 


477 


Arg 


Leu 


Thr 


He 


Arg 


Asn 


Gin 


Leu 


Leu Thr 


Asn 


Asn 


Met 


Leu 


Pro 


Asp 










45 










50 








55 








gga 


ata 


ttt 


tea 


ctt 


eta 


att 


cct 


get cgt 


eta 


caa 


aac 


tat 


ctg 


aat 


525 


Gly lie 


Phe 


Ser 


Leu 


Leu 


He 


Pro Ala Arg 


Leu 


Gin 


Asn 


Tyr 


Leu 


Asn 





60 65 70 

tta gaa ate taacatacgt cagtgte 551 

Leu Glu He 



75 

<210> 394 
<211> 529 
<212> DNA 
<213> Homo sapiens 

<220> 
<221> CDS 
<222> 78.. 281 

<220> 

<221> sig_peptide 

<222> 78.. 218 

<223> Von Heijne matrix 

score 3.70000004768372 

seq PLLSLHSRGGSSS/ES 

<400> 394 

ttttttgcga acggcgagea gcggegegcg cggagagacg agcggaggtt ttcctggttt 60 
cggaccccag eggcegg atg gtg aaa tec tee ctg cag egg ate etc aat 110 
Met Val Lys Ser Ser Leu Gin Arg He Leu Asn 
-45 -40 



age 


cac 


tgc 


ttc 


gee 


aga 


gag 


aag 


gaa 


ggg gat 


aaa 


ecc 


age gee 


ace 


158 


Ser 


His 
-35 


Cys 


Phe 


Ala 


Arg 


Glu 
-30 


Lys 


Glu 


Gly Asp 


Lys 
-25 


Pro 


Ser Ala 


Thr 




ate 


cac 


gee 


age 


egc 


ace 


atg 


ccg 


etc 


eta age 


ctg 


cac 


age cgc 


ggc 


206 


He 


His 


Ala 


Ser 


Arg 


Thr 


Met 


Pro 


Leu 


Leu Ser 


Leu 


His 


Ser Arg Gly 




-20 










-15 








-10 








-5 




ggc 


age 


age 


agt 


gag 


agt 


tec 


agg 


gtc 


tec etc 


cac 


tgc 


tgt agt 


aac 


254 


Gly 


Ser 


Ser 


Ser 


Glu 
1 


Ser 


Ser 


Arg 


Val 
5 


Ser Leu 


His 


Cys 


Cys Ser 
10 


Asn 




ccg 


ggt 


ccg 


ggg 


cct 


egg 


tgg 


tgc 


tec 


tgatgcceet cacecacecc 




301 


Pro Gly 


Pro 


Gly 


Pro 


Arg 


Trp Cys 


Ser 


















15 










20 

















tgaagatece aggtgggcga gggaatagtc agagggatea caatctttea getaaettat 361 

tetactccga tgatcggctg aatgtaacag aggaactaac gtccaacgae aagacgagga 421 

ttctcaaegt ecagtccagg etcacagacg ceaaacgeat taactggega acagtgctga 481 

gtggcggeas tcctaeatcg agatcecggg eggcgcgctg ccgagggg 529 



<210> 395 
<211> 491 
<212> DNA 



wo 02/083898 
<213> Homo sapiens 



227 



PCT/IBOl/00914 



<220> 
<221> CDS 
<222> 80. .283 

<220> 

<221> sig_peptide 

<222> 80. .220 

<223> Von Heijne matrix 

score 3.70000004768372 

seq PLLSLHSRGGSSS/ES 

<400> 395 

aktttgcgaa cggcgagcag cggcggcggc gcggagagac gcagcggagg ttttcctggt 60 
ttcggacccc agcggccgg atg gtg aaa tec tee etg cag egg ate etc aat 112 

Met Val Lys Ser Ser Leu Gin Arg He Leu Asn 
-45 -40 



age 


cac 


tgc 


ttc 


gcc 


aga 


gag 


aag 


gaa 


ggg gat 


aaa 


ccc 


age 


gcc 


acc 


160 


Ser 


His 
-35 


Cys 


Phe 


Ala 


Arg 


Glu 
-30 


Lys 


Glu 


Gly Asp 


Lys 
-25 


Pro 


Ser 


Ala 


Thr 




ate 


cac 


gcc 


age 


cgc 


ace 


atg 


ccg 


etc 


eta age 


etg 


cac 


age 


cgc 


ggc 


208 


He 


His 


Ala 


Ser 


Arg 


Thr 


Met 


Pro 


Leu 


Leu Ser 


Leu 


His 


Ser 


Arg 


Gly 




-20 










-15 








-10 










-5 




ggc 


age 


age 


agt 


gag 


agt 


tec 


agg 


gtc 


tee etc 


cac 


tgs 


tgt 


agt 


aac 


256 


Gly Ser 


Ser 


Ser 


Glu 


Ser 


Ser 


Arg Val 


Ser Leu 


His 


Xaa 


Cys 


Ser 


Asn 












1 








5 








10 








ceg 


ggt 


ccg 


ggg 


cct 


egg 


tgg 


tgc 


tec 


tgatgcccct cacecacccc 




303 


Pro 


Gly 


Pro 


Gly 


Pro 


Arg 


Trp 


Cys 


Ser 

















15 20 



tgaagatcce aggtgggcga gggaatagtc agagggatea caatctttca gctaacttat 363 

tctactcega tgatcggctg aatgtaaeag aggaactaac gtseaacgac aagacgagga 423 

ttetcaacgt tcagtccagg etcacaggaa cgccaaacgc attaactgcc gaacagtgct 483 

gagtggcg 491 

<210> 396 
<211> 503 
<212> DNA 

<213> Homo sapiens 

<220> 
<221> CDS 
<222> 9. .395 

<220> 

<221> sig_peptide 
<222> 9.. 56 

<223> Von Heijne matrix 

score 3.59999990463257 
seq LLLLLRALRRGPG/PG 

<400> 396 

aggecaac atg gcc gtg etg etg etg etg etc cgt gee etc cgc egg ggt 50 
Met Ala Val Leu Leu Leu Leu Leu Arg Ala Leu Arg Arg Gly 
-15 -10 -5 

eca ggc ceg ggt cct egg ccg etg tgg ggc eca ggc ccg gee tgg agt 98 
Pro Gly Pro Gly Pro Arg Pro Leu Trp Gly Pro Gly Pro Ala Trp Ser 

15 10 
eca ggg ttc ccc gcc agg ccc ggg agg ggg egg ccg tae atg gee age 14 6 

Pro Gly Phe Pro Ala Arg Pro Gly Arg Gly Arg Pro Tyr Met Ala Ser 
15 20 25 30 

agg cct ccg ggg gae etc gcc gag get gga ggc cga get etg cag age 194 
Arg Pro Pro Gly Asp Leu Ala Glu Ala Gly Gly Arg Ala Leu Gin Ser 
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35 










40 










45 






tta 


caa 


ttg 


aga 


ctg 


eta 


acc 


cct 


acc 


ttt 


gaa 


ggg 


ate 


aac 


gga 


ttg 


242 


Leu 


Gin 


Leu 


Arg 
50 


Leu 


Leu 


Thr 


Pro 


Thr 
55 


Phe 


Glu 


Gly 


lie 


Asn 
60 


Gly 


Leu 




ttg 


ttg 


aaa 


caa 


cat 


tta 


gtt 


cag 


aat 


cca 


gtc 


aga 


etc 


tgg 


caa 


ctt 


290 


Leu 


Leu 


Lys 
65 


Gin 


His 


Leu 


Val 


Gin 
70 


Asn 


Pro 


Val 


Arg 


Leu 
75 


Trp 


Gin 


Leu 




tta 


ggt 


ggt 


act 


ttc 


tat 


ttt 


aac 


acc 


tea 


agg 


ttg 


aag 


cag 


aag 


aat 


338 


Leu Gly 


Gly 


Thr 


Phe 


Tyr 


Phe 


Asn 


Thr 


Ser 


Arg 


Leu 


Lys 


Gin 


Lys 


Asn 






80 










85 










90 












aag 


gag 


aag 


gat 


aag 


teg 


aag 


ggg 


aag 


gcg 


cct 


gaa 


gag 


gae 


gaa 


ggt 


386 


Lys 


Glu 


Lys 


Asp 


Lys 


Ser 


Lys 


Gly 


Lys 


Ala 


Pro 


Glu 


Glu 


Asp 


Glu 


Gly 




95 










100 










105 










110 





ata ttc ate tgatgttctt cagtcagtag ctgcctctgg atgtctttac 435 
lie Phe lie 

atttctgttt wccttttagc aaggtgaaac cagtctggae aatggggaga tgggccgggt 4 95 

gcagtggc 503 



<210> 397 
<211> 517 
<212> DNA 
<213> Homo sapiens 

<220> 
<221> CDS 
<222> 93. .419 



<220> 

<221> sig_peptide 

<222> 93. .191 

<223> Von Heijne matrix 

score 3.59999990463257 
seq LAVFQMLKSMCAG/QR 



<400> 397 

gaaaaaaagc gaaggccggc cgggcgggga agggaaatgg cgaggcagga gtgcggggga 60 
gggastggtc cttagctgaa tgcgcctgcg tt atg gcg gee tec ggc gee cca 113 

Met Ala Ala Ser Gly Ala Pro 
-30 



agg 


ate 


ctg 


gtg 


gae 


ctg 


ctg 


aag 


ctg 


aac 


gtg gcc 


ecc 


etc 


gee 


gtc 


161 


Arg 


He 


Leu 


Val 


Asp 


Leu 


Leu 


Lys 


Leu 


Asn 


Val Ala 


Pro 


Leu 


Ala 


Val 






-25 










-20 








-15 












ttc 


cag 


atg 


etc 


aag 


tec 


atg 


tgt 


gee 


ggg 


cag agg 


eta 


gcg 


age 


gag 


209 


Phe 


Gin 


Met 


Leu 


Lys 


Ser 


Met 


Cys 


Ala 


Gly 


Gin Arg 


Leu 


Ala 


Ser 


Glu 




-10 










-5 










1 






5 






ecc 


cag 


gae 


cct 


gcg 


gee 


gtg 


tct 


ctg 


ecc 


aeg teg 


age 


gtg 


ecc 


grg 


257 


Pro 


Gin 


Asp 


Pro 


Ala 


Ala 


Val 


Ser 


Leu 


Pro 


Thr Ser 


Ser 


Val 


Pro 


Xaa 










10 










15 








20 








acc 


cga 


ggg 


aga 


aac 


aaa 


ggc 


age 


get 


gee 


etc ggg 


gga 


gca 


ttg 


gee 


305 


Thr Arg 


Gly 


Arg 


Asn 


Lys 


Gly 


Ser 


Ala 


Ala 


Leu Gly 


Gly 


Ala 


Leu 


Ala 








25 










30 








35 










ctg 


gcg 


gaa 


esm 


rrc 


age 


cgc 


gaa 


gga 


tec 


age cag 


agg 


atg 


cca 


cgc 


353 


Leu 


Ala 


Glu 


Xaa 


Xaa 


Ser 


Arg 


Glu 


Gly 


Ser 


Ser Gin 


Arg 


Met 


Pro 


Arg 






40 










45 








50 












cag 


ecc 


age 


get 


ace 


agg 


ctg 


ecc 


aag 


ggg 


ggc ggg 


cct 


ggg 


aag 


age 


401 


Gin 


Pro 


Ser 


Ala 


Thr 


Arg 


Leu 


Pro 


Lys 


Gly 


Gly Gly 


Pro 


Gly 


Lys 


Ser 




55 










60 










65 








70 





cct aca egg ggc age ace taggatgggg cagagacttg ttgcatettt 449 
Pro Thr Arg Gly Ser Thr 
75 

gtceeeagea aaggctaeat gttacetect teaattgata ataaaecttt etgagatgag 509 
agggteca 517 



wo 02/083898 



<210> 398 
<211> 507 
<212> DNA 

<213> Homo sapiens 

<220> 
<221> CDS 
<222> 78. .386 

<220> 

<221> sig_peptide 
<222> 78. .185 
<223> Von Heijne matrix 

score 3.59999990463257 

seq RALSTFLFGSIRG/AA 

<400> 398 

aagtgcgcag acgcaggggt cggcgccggg tgagagcgtg cggccgggta agggcgtgtg 60 
gccggattca ccacaac atg gca aat ctt ttt ata agg aaa atg gtg aac 110 
Met Ala Asn Leu Phe lie Arg Lys Met Val Asn 

-35 -30 



cct 


ctg 


etc 


tat 


etc 


agt 


cgt 


cac 


acg 


gtg 


aag 


cct 


cga 


gee 


etc 


tec 


158 


Pro 


Leu 


Leu 


Tyr 


Leu 


Ser 


Arg 


His 


Thr 


Val 


Lys 


Pro 


Arg 


Ala 


Leu 


Ser 




-25 










-20 










-15 










-10 




aca 


ttt 


eta 


ttt 


gga 


tec 


att 


ega 


ggt 


gca 


gee 


ccc 


gtg 


get 


gtg 


gaa 


206 


Thr 


Phe 


Leu 


Phe 


Gly 
-5 


Ser 


lie 


Arg 


Gly 


Ala 
1 


Ala 


Pro 


Val 


Ala 
5 


Val 


Glu 




ccc 


ggg 


gca 


gca 


gtg 


cge 


tea 


ctt 


etc 


tea 


ccc 


ggc 


etc 


ctg 


ccc 


eat 


254 


Pro Gly 


Ala 


Ala 


Val 


Arg 


Ser 


Leu 


Leu 


Ser 


Pro 


Gly 


Leu 


Leu 


Pro 


His 








10 










15 










20 










ctg 


ctg 


cct 


gcg 


ctg 


ggg 


tte 


aaa 


aac 


aag 


act 


gtc 


ctt 


aag 


aag 


cge 


302 


Leu 


Leu 


Pro 


Ala 


Leu 


Gly 


Phe 


Lys 


Asn 


Lys 


Thr 


Val 


Leu 


Lys 


Lys 


Arg 






25 










30 










35 












tgc 


aag 


gac 


tgt 


tac 


ctg 


gtg 


aag 


agg 


egg 


ggt 


egg 


tgg 


tac 


gtc 


tac 


350 


Cys 


Lys 


Asp 


Cys 


Tyr 


Leu 


Val 


Lys 


Arg 


Arg 


Gly 


Arg 


Trp 


Tyr 


Val 


Tyr 




40 










45 










50 










55 




tgt 


aaa 


acc 


cat 


ccg 


agg 


cac 


aag 


cag 


aga 


cag 


atg 


tagacccttt 




396 


Cys 


Lys 


Thr 


His 


Pro 


Arg 


His 


Lys 


Gin 


Arg 


Gin 


Met 




















60 










65 

















ccctccagag tcacgcacat actcgtcatc gcatcacttg ggagaatggt tgtatcttat 456 
ggaaggaatt atcacatcaa ggagtcaggg gamagtgact gggaagcaaa c 507 



<210> 399 
<211> 493 
<212> DNA 

<213> Homo sapiens 

<220> 
<221> CDS 
<222> 58. .315 

<220> 

<221> sigjpeptide 

<222> 58.. 165 

<223> Von Heijne matrix 

score 3.59999990463257 

seq SVSLSGFTVGTLS/ET 

<400> 399 

ctgtgggaga ggecaggatg etgcattagg cacaggataa cctgggaacc caggcac 57 

atg ggt cct get etc cga agt ctg caa gtc aag aag gga aca gag cac 105 

Met Gly Pro Ala Leu Arg Ser Leu Gin Val Lys Lys Gly Thr Glu His 

-35 -30 -25 



PCT/IBOl/00914 

229 
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230 



gcc 


gac 


cct 


etc 


cct 


ttc 


ccc tct gtc tct 


ctt 


agt 


ggc 


ttt 


aca 


gtg 


153 


Ala 


Asp 


Pro 


Leu 


Pro 


Phe 


Pro Ser Val Ser 


Leu 


Ser Gly 


Phe 


Thr 


Val 




-20 










-15 




-10 










-5 




ggt 


acc 


ctg 


tea 


gaa 


acc 


age act ggg ggc 


cct 


gcc 


acc 


ccc 


aca 


tgg 


201 


Gly Thr 


Leu 


Ser 


Glu 


Thr Ser Thr Gly Gly 


Pro 


Ala 


Thr 


Pro 


Thr 


Trp 












1 




5 








10 








aag 


gag 


tgt 


cct 


ate 


tgt 


aag gag cgc ttt 


cct 


get 


gag 


agt 


gac 


aag 


249 


Lys 


Glu 


Cys 


Pro 


He 


Cys 


Lys Glu Arg Phe 


Pro 


Ala 


Glu 


Ser Asp 


Lys 








15 








20 






25 










gat 


gcc 


ctg 


gag 


gac 


cac 


atg gat gga cac 


ttc 


ttt 


ttc 


age 


acc 


cag 


297 


Asp Ala 


Leu 


Glu 


Asp 


His Met Asp Gly His 


Phe 


Phe 


Phe 


Ser 


Thr 


Gin 






30 










35 




40 












gac 


ccc 


ttc 


acc 


ttt 


gag 


tgatcttact ccctcgtaca tgcacaaata 




345 


Asp 


Pro 


Phe 


Thr 


Phe 


Glu 


















45 










50 



















cacactcatg cacacacaca ctcacacaca tgcatacact taggtttcat gcccattttc 405 
tatcacactg ggctccatga tattctgttc cctaagaact gcttctgtgt gccctgtttt 4 65 
catcccaaga tttctcactt catcctct 493 

<210> 400 
<211> 769 
<212> DNA 
<213> Homo sapiens 

<220> 
<221> CDS 
<222> 506.. 661 

<220> 

<221> sig_pepticie 
<222> 506.. 595 
<223> Von Heijne matrix 
score 3.5 

seq VKLLLKMVSSATT/ER 

<400> 400 

cttggctctt aaagccaggt tgagggttat gtgtttttat tcttttawat ctccttttca 60 
cttaggcttg gcaactagtt cattgaaaca gagccaacaa gaatgagtga gtcactcagt 120 
gatatgaagc aagctgtggt catttatcta ctaccttcca tacctacttc ctaggtttga 180 
taagtagtgt ttgtagtgta gttgtcagta acttttctca gcttacataa gtttatgtgg 240 
aattcattta caaatgtgtg attattacga tagatgttat tttataaatt tcaaggttct 300 
ttaatacagc atggacatgt ttctaaatca ataaatatag cttamccaca gactttttat 360 
gggatgtaaa gtatgtatta caacaggtaa atccatttct ctgttttggg aaagcaggtt 420 
aatgttttgt ttgttttgtt gctgcatata atgttgcact atatatatat kctaatgcag 480 
ttggtctcct aatyctacag gataa atg tct age tgt ggc att gtt gga tea 532 

Met Ser Ser Cys Gly He Val Gly Ser 

-30 -25 
tea gtt teg ttt cag tta gat get gtg aaa ttg etc ttg aaa atg gtg 580 
Ser Val Ser Phe Gin Leu Asp Ala Val Lys Leu Leu Leu Lys Met Val 

-20 -15 -10 

tec tct gcc acc aca gaa egg tgt tgt aat gga agt gcc aat ttt cat 628 
Ser Ser Ala Thr Thr Glu Arg Cys Cys Asn Gly Ser Ala Asn Phe His 
-5 15 10 

aaa aac ttg tgt gca aca ggt att aag aat ttt tgaggtcggg cgcagtagct 681 
Lys Asn Leu Cys Ala Thr Gly He Lys Asn Phe 

15 20 
eacgectgta atcccagcac tttgggagge etaggeaggt ggatcacctg aggtcaggag 741 
ttccagacea geetgaceaa catggtga 769 

<210> 401 
<211> 646 
<212> DNA 
<213> Homo sapiens 
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231 



<220> 
<221> CDS 
<222> 76. .621 



<400> 401 

attacgagat tggcttggat tctgtcggat ggacttgggg ctagctgcgg cggggctgga 60 
ggaggccaga taacc atg tea gcc aca gtt gta gat gca gtt aat get gca 111 
Met Ser Ala Thr Val Val Asp Ala Val Asn Ala Ala 
15 10 



ccc eta 


tea 


aaa 


tec 


aaa 


gaa 


atg agt ttg 


qaa 


qaa 


cea 


aag 


aag 


atg 


159 


Pro Leu 


Ser 


Gly 


Ser 


Lys 


Glu 


Met 


Ser Leu 


Glu 


Glu 


Pro 


Lys 


Lys 


Met 






15 










20 








25 










ace aga 


aaa 


aac 

'k^ vt w 


tgg 


aga 


aag 


aag 


aag gag 


eta 


gaa 


gaa 


cag 


cga 


aaa 


207 


Thr Arg 


Glu 


ASD 


Trp Arg 


Lys 


Lys 


Lys Glu 


Leu 


Glu 


Glu 


Gin 


Arg 


Lys 




30 










35 








40 












ttcr oac 


aat 


get 


ect 


gca 


gaa 


gtt 


gat gaa 


gaa 


aaa 


aaa 


aac 


ate 


aac 


255 


Leu Gly 


Asn 


Ala 


Pro 


Ala 


Glu 


Val 


Asp Glu 


Glu 


Gly 


Lvs 


Asp 


He 


Asn 




45 








50 








55 










60 




ccc cat 


att 


cet 


cag 


tat 


att 


tct 


tea gtg 


cea 


too 


tat 


att 


aat 


ect 


303 


Pro His 


He 


Pro 


Gin 


Tyr 


He 


Ser 


Ser Val 


Pro 


Trp 


Tyr 


He 


Asp 


Pro 










65 








70 










75 






tea aaa 


aga 


cet 


act 


tta 


aaa 


eac 


cag aga 


cea 


caa 


eca 


gaa 


aaa 


caa 


351 


Ser Lys 


Arg 


Pro 


Thr 


Leu 


Lys 


His Gin Arg 


Pro 


Gin 


Pro 


Glu 


Lys 


Gin 








80 










85 








90 








aag cag 


ttc 


age 


tea 


tct 


gga 


gaa 


tgg tac 


aag 


agg 


ggt 


gta 


aaa 


gag 


399 


Lys Gin 


Phe 


Ser 


Ser 


Ser 


Gly Glu 


Trp Tyr 


Lys 


Arg 


Gly 


Val 


Lys 


Glu 






95 










100 








105 










aat tec 


ata 


att 


act 


aag 


tac 


cge 


aaa gga 


gca 


tgt 


gaa 


aat 


tgt 


ggg 


447 


Asn Ser 


He 


He 


Thr 


Lys 


Tyr Arg Lys Gly Ala 


Cys 


Glu 


Asn 


Cys 


Gly 




110 










115 








120 












gcc atg 


aca 


eac 


aaa 


aag 


aaa 


gae 


tge ttt 


gag 


aga 


cet 


agg 


cga 


gtt 


4 95 


Ala Met 


Thr 


His 


Lys 


Lys 


Lys 


Asp Cys Phe 


Glu 


Arg 


Pro 


Arg 


Arg 


Val 




125 








130 








135 










140 




gga gee 


aaa 


ttt 


aca 


ggt 


act 


aat 


ata get 


cea 


gat 


gaa 


cat 


gtc 


cag 


543 


Gly Ala 


Lys 


Phe 


Thr 


Gly 


Thr 


Asn 


He Ala 


Pro 


Asp 


Glu 


His 


Val 


Gin 










145 








150 










155 






cet caa 


ctg 


atg 


ttt 


gae 


tat 


gat 


ggg aag 


agg 


gat 


egg 


tgg 


aat 


ggc 


591 


Pro Gin 


Leu 


Met 


Phe 


Asp 


Tyr Asp 


Gly Lys 


Arg 


Asp 


Arg 


Trp 


Asn 


Gly 





160 165 170 



tac aat cea gaa gaa eac atg aaa att gtt tgaagagtat gcaaagtgat 641 

Tyr Asn Pro Glu Glu His Met Lys He Val 

175 180 
ttggc 646 



<210> 402 
<211> 484 

<212> DNA 

<213> Homo sapiens 



<220> 
<221> CDS 
<222> 29, .469 



<400> 402 

aettggcgag tgagacgctg atgggagg atg gae gta ctg gtg tct gag tge 52 

Met Asp Val Leu Val Ser Glu Cys 
1 5 

tec gcg egg etg ctg cag cag gaa gaa gag att aaa tct ctg act get 100 
Ser Ala Arg Leu Leu Gin Gin Glu Glu Glu He Lys Ser Leu Thr Ala 

10 15 20 

gaa att gae egg ttg aaa aac tgt ggc tgt tta gga get tct cea aat 148 
Glu He Asp Arg Leu Lys Asn Cys Gly Cys Leu Gly Ala Ser Pro Asn 



wo 02/083898 

25 

ttg gag cag tta caa 
Leu Glu Gin Leu Gin 
45 

att ctt cga aag agt 
lie Leu Arg Lys Ser 
60 

atg att aac att att 
Met He Asn He He 
75 

aag get gca tat cca 
Lys Ala Ala Tyr Pro 

90 

agt cag cag gcc aag 
Ser Gin Gin Ala Lys 
105 

att tot cag gtg atg 
He Ser Gin Val Met 
125 

aag tat tat tat cat 
Lys Tyr Tyr Tyr His 
140 



30 

gaa gaa aat tta 
Glu Glu Asn Leu 

ctt cag gca gaa 
Leu Gin Ala Glu 
65 

age cgc eta caa 
Ser Arg Leu Gin 
80 

gat ttg gaa aat 
Asp Leu Glu Asn 

95 

ttt ggg gac tat 
Phe Gly Asp Tyr 
110 

tat tgt cat gac 
Tyr Cys His Asp 

cat tge cat tta 
His Cys His Leu 
145 



232 





35 








aaa 


tta 


aag 


tat 


cga 


Lys 


Leu 


Lys 


Tyr 


Arg 


50 












aac 


aaa 


cca 


act 


Arg 


Asn 


Lys 


Pro 


Thr 










70 


gag 


gtc 


ttt 


ggt 


eat 


Glu 


Val 


Phe 


Gly 


His 








85 




cct 


cct 


ctg 


eta 


gtg 


Pro 


Pro 


Leu 


Leu 


Val 






100 






cag 


tgt 


aat 


agt 


get 


Gin 


Cys 


Asn 


Ser 


Ala 




115 








tct 


tgg 


ctg 


ttt 


gat 


Ser 


Trp 


Leu 


Phe 


Asp 



130 

cag aaa taatactatt 
Gin Lys 



PCT/IBOl/00914 

40 

ctg aat 196 

Leu Asn 

55 

aaa aat 244 
Lys Asn 

gca att 292 
Ala He 

aca cca 340 
Thr Pro 

atg ggt 388 
Met Gly 
120 

ttt ttt 436 

Phe Phe 

135 

icaag 484 



<210> 403 
<211> 345 
<212> DNA 
<213> Homo sapiens 



<400> 403 

ctttetccag etcaagcgte tggtatttgg tgtagggaea gegettttte egggtggagc 60 

gagegtggat ecagttcgcg geggggttgt ttgggtcaag ttgctgctgc tgctgcggct 120 

gcgaatgctg cttctcctce tccttcagcg aacagcctgg gatcgggtgg tcgctgcaag 180 

ctgagggctc cgacgagccg cccgtcccag tcgcggcccc gatccctgcc ccaggcccgg 240 

ttccecccgt egcegctgec gecttetcet geaggaaega gttgcacatg ttactgcttc 300 

etactaataa atgetgaeet gateaaatgg agcecaaaar aaaaa 345 



<210> 404 
<211> 432 
<212> DNA 
<213> Homo sapiens 



<400> 404 

aceaggactc 

aactggggga 

agcgtttcaa 

catctcgggg 

caagcatctc 

ecceacccca 

atcctttcat 

aaaaaaaaaa 



caaaatggcg 
getgecaage 
agaggttaet 
attaeeatgg 
aagcacgagc 
cgaccttggc 
atectaatag 
ag 



tcagttgtac 
tggatettga 
aecggtacta 
tgctggcatg 
ggctccgcaa 
cegagcccct 
gaattaacct 



<210> 405 
<211> 428 
<212> DNA 
<213> Homo sapiens 



cagtgaagga 
tgcgggactt 
eaacaagtae 
ctacgtgctc 
ataccactga 
ccgtgagaac 
ccaaataaaa 



caagaaactt 
cagtctagtg 
atcaatgtga 
tttagctact 
agaggacaca 
acaatcteaa 
catgactggt 



ctggaggtea 
gcattttcgg 
agaaggggag 
ccttttccta 
ctctgcaccc 
tcgttgctga 
aegyraaaaa 



60 
120 
180 
240 
300 
360 
420 
432 



<400> 405 

actcegcett ccacgtgcac ccactgcctc ttcecttete gettgggaae tetagtctcg 60 

cctcgggttg eaatggaccc caactgctcc tgtgccgetg caggtgtctc ctgeacctgc 120 

gccagctect gcaagtgcaa agagtgcaaa tgcacctcct gcaagaagag ctgctgctcc 180 

tgctgccctg tgggctgtgc caagtgtgcc cagggctgca tctgcaaagg ggcatcggag 240 

aagtgcagct gctgcgcctg atgtcgggac agccctgctc ccaagtacaa atagagtgac 300 

ccgtaaaatc caggattttt tgttttttgc tacaatcttg acccctttgc tacattcctt 360 
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233 

tttttctgtg aaatatgtga ataataatta aacacttaga caaaaaaaaa aaaaaaaggt 420 
caaaagct 428 



<210> 406 
<211> 391 
<212> DNA 
<213> Homo sapiens 

<220> 
<221> CDS 
<222> 196.. 390 



<220> 

<221> sig_peptide 

<222> 196, .279 

<223> Von Heijne matrix 

score 5.90000009536743 

seq IILLLLKIWKSRS/CA 

<400> 406 

ctcttcccgg ctccagctcc gccgccagct ccagcctttg ctccccctcc caaagtcccc 60 
tccccggagc ggagcgcacc tagggtccct cttccgtccc cccagcccag ctacccgttc 120 
agaccagcag cctcgggggg cacccccccg ccagcctgcc tccctcccgc tcagccctgc 180 
cagggttccc cagcc atg aat etc ttc cga ttc ctg gga gac etc tec cac 231 
Met Asn Leu Phe Arg Phe Leu Giy Asp Leu Ser His 
-25 -20 
ttg eta ctg etc aaa ate tgg aag tec egc teg 279 
Leu Leu Leu Leu Lys lie Trp Lys Ser Arg Ser 

-10 -5 
cag ett ect etc tec ttc tgt ctg tet gte tgt 327 
Gin Leu Pro Leu Ser Phe Cys Leu Ser Val Cys 

10 15 
tet etc tet gte tet etc tet etc tct ttc tet 375 
Ser Leu Ser Val Ser Leu Ser Leu Ser Phe Ser 

25 30 
t 391 

I Lys Lys 

35 



etc 


etc 


gee 


ate 


ate 


Leu 


Leu 


Ala 


He 


He 




-15 








tgc 


gee 


gee 


cac 


ecc 


Cys 


Ala 


Ala 


His 


Pro 


1 








5 


ctg 


tct 


gte 


tet 


etc 


Leu 


Ser 


Val 


Ser 


Leu 








20 




gte 


tea 


aaa 


aaa 


aaa 


Val 


Ser 


Lys 


Lys 


Lys 



<210> 407 
<211> 500 
<212> DNA 
<213> Homo sapiens 

<220> 
<221> CDS 
<222> 81. .500 



<220> 

<221> sig_peptide 

<222> 81. .200 

<223> Von Heijne matrix 

score 5.19999980926514 
seq LLMRALHLLKSGC/SP 



<400> 407 

aaccccaccg cctaactctc ccaactctge agtccgtgae cttcaggtgc ccttcaaggt 60 
cagaaaaact cggcaagaat atg gtt tct agt ttt agg gtt tet gaa eta caa 113 

Met Val Ser Ser Phe Arg Val Ser Glu Leu Gin 
-40 -35 -30 

gta tta eta gge ttt get gga egg aat aaa agt gga cgc aag cat gac 161 
Val Leu Leu Gly Phe Ala Gly Arg Asn Lys Ser Gly Arg Lys His Asp 
-25 -20 -15 
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etc 


ctg 


atg 


agg 


gcg 


ctg 


cat 


tta 


ttg 


Leu 


Leu 


Met 


Arg Ala 


Leu 


His 


Leu 


Leu 








-10 










-5 


gtt 


cag 


att 


aaa 


ate 


cga 


gaa 


ttg 


tat 


Val 


Gin 


He Lys 


He Arg Glu 


Leu Tyr 




5 










10 






ctt 


gaa 


gga 


ctt 


tct 


gat 


tta 


tec 


aca 


Leu 


Glu 


Gly Leu Ser Asp Leu 


Ser 


Thr 


20 










25 








ttg 


gat 


ggt 


ggc 


tea 


tea 


cct 


gta 


gaa 


Leu 


Asp 


Gly Gly Ser Ser 
A n 


Pro 


Val 


Glu 


ate 


cac 


teg 


ttg 


cct 


tec 


act 


tea 


gtt 


He 


His 


Ser 


Leu 


Pro 


Ser 


Thr 


Ser 


Val 








55 










60 


cct 


gtt 


ggt 


tct 


gtg 


ctg 


ctt 


eaa 


gat 


Pro 


Val 


Gly 


Ser 


Val 


Leu 


Leu 


Gin 


Asp 






70 










75 




cag 


cag 


cca 


tct 


ccc 


cca 


att 


cct 


cct 


Gin 


Gin 


Pro 


Ser 


Pro 


Pro 


He 


Pro 


Pro 



85 90 



aaa 
Lys 
100 
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aag age ggc tgc age cct gcg 209 
Lys Ser Gly Cys Ser Pro Ala 
1 

aga cgc cga tat cca cga act 257 
Arg Arg Arg Tyr Pro Arg Thr 
15 

ate aaa tea teg gtt ttc agt 305 
He Lys Ser Ser Val Phe Ser 

30 35 
cct gae ttg gee gtg get gga 353 
Pro Asp Leu Ala Val Ala Gly 
45 50 
aca cct cac tea cca tec tct 401 
Thr Pro His Ser Pro Ser Ser 
65 

act aag ccc aca ttt gag atg 449 
Thr Lys Pro Thr Phe Glu Met 
80 

gtc eat cct gat gtg cag tta 4 97 

Val His Pro Asp Val Gin Leu 
95 

500 



<210> 408 
<211> 497 
<212> DMA 
<213> Homo sapiens 



<220> 
<221> CDS 
<222> 35.. 496 



<400> 408 

tcetaatcga aawctcttet gaggatecgg 



aag 


aag 


egg 


acc 


ttc 


cgc 


aag 


ttc 


ace 


Lys 


Lys 


Arg 


Thr 


Phe 


Arg 


Lys 


Phe 


Thr 






10 










15 




cag 


ctg 


ctg 


gae 


atg 


tee 


tac 


gag 


cag 


Gin 


Leu 


Leu 


Asp 


Met 


Ser 


Tyr 


Glu 


Gin 




25 










30 






cgc 


cag 


gcg 


gcg 


get 


gaa 


ccg 


ggg 


cct 


Arg 


Gin 


Ala 


Ala 


Ala 


Glu 


Pro 


Gly 


Pro 


40 










45 








get 


gaa 


gcg 


cct 


gcg 


eaa 


ggc 


caa 


gaa 


Ala 


Glu 


Ala 


Pro 


Ala 


Gin 


Gly 


Gin 


Glu 










60 










gee 


gga 


agt 


ggt 


gaa 


gae 


gea 


cct 


gcg 


Ala Gly 


Ser 


Gly 


Glu 


Asp 


Ala 


Pro 


Ala 








75 










80 


gat 


ggt 


ggg 


cag 


cat 


ggt 


ggg 


cgt 


eta 


Asp Gly 


Gly 


Gin 


His 


Gly 


Gly 


Arg 


Leu 






90 










95 




ggt 


gga 


gat 


eaa 


gee 


cga 


gat 


gat 


egg 


Gly Gly 


Asp 


Gin 


Ala 


Arg 


Asp 


Asp 


Arg 




105 










110 






eat 


cac 


eta 


eaa 


gee 


cgt 


aaa 


gea 


tgg 


His 


His 


Leu 


Gin 


Ala 


Arg 


Lys 


Ala 


Trp 


120 










125 








ema 


mte 


etc 


csg 


ctt 


cat 


ccm 


tct 


eaa 



atg gea gaa gta gag cag aag 55 
Met Ala Glu Val Glu Gin Lys 



1 








5 








tac 


cgc 


ggc 


gtg 


gae 


etc 


gae 


103 


Tyr 


Arg 


Gly 


Val 

20 


Asp 


Leu 


Asp 




ctg 


atg 


cag 


ctg 


tac 


agt 


gcg 


151 


Leu 


Met 


Gin 
35 


Leu 


Tyr 


Ser 


Ala 




gcg 


gcg 


gaa 


gea 


gea 


etc 


cct 


199 


Ala 


Ala 
50 


Glu 


Ala 


Ala 


Leu 


Pro 
55 




gga 


ggc 


gee 


gee 


cat 


gga 


gaa 


247 


Gly 


Gly 


Ala 


Ala 


His 


Gly 


Glu 




65 










70 






gga 


eat 


gat 


eat 


cct 


acc 


cga 


295 


Gly 


His 


Asp 


His 


Pro 
85 


Thr 


Arg 




caa 


egg 


caa 


gae 


ctt 


caa 


cca 


343 


Gin 


Arg 


Gin 


Asp 
100 


Leu 


Gin 


Pro 




cca 


eta 


cct 


ggg 


cga 


gtt 


etc 


391 


Pro 


Leu 


Pro 
115 


Gly 


Arg 


Val 


Leu 




ccg 


gee 


egg 


cat 


egg 


ggc 


mmc 


439 


Pro 


Ala 
130 


Arg 


His 


Arg 


Gly 


Xaa 
135 




gta 


atg 


get 


cag 


eta 


ata 


aag 


487 
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Xaa Xaa Leu Xaa Leu His Pro Ser Gin Val Met Ala Gin Leu He Lys 

140 145 150 

gcg cac atg a 4 97 

Ala His Met 

<210> 409 
<211> 304 
<212> DNA 

<213> Homo sapiens 

<220> 
<221> CDS 
<222> 110.. 304 

<220> 

<221> misc_f eature 
<222> 296. .297 
<223> n=a, g, c or t 

<400> 409 

acgtacggca gggcggggtc ggcccgcagt ggccagctgg gagctacgag cgcggasttg 60 
cgcagaagac ccccatcagg gtgcggggtg cagttgcggc tccagggcc atg gcg gag 118 

Met Ala Glu 

1 



gag 


cag 


ggc 


egg gaa egg gac teg gtt 


ccc aag 


ccg 


teg 


gtg 


ctg 


ttc 


166 


Glu 


Gin 


Gly 


Arg Glu Arg Asp Ser Val 


Pro Lys 


Pro 


Ser 


Val 


Leu 


Phe 






5 




10 




15 












etc 


cac 


cca 


gac ctg ggc gtg ggc ggc 


get gag 


egg 


ctg 


gtg 


etc 


ttg 


214 


Leu 


His 


Pro 


Asp Leu Gly Val Gly Gly Ala Glu Arg 


Leu 


Val 


Leu 


Leu 




20 






25 


30 










35 




cct 


ttt 


ccc 


act gag aga agg ctg etc 


ttt tgt 


act 


gee 


ccc 


cgc 


tea 


262 


Pro 


Phe 


Pro 


Thr Glu Arg Arg Leu Leu 


Phe Cys 


Thr 


Ala 


Pro 


Arg 


Ser 










40 


45 








50 






tta 


aac 


age 


etc eee eta aaa aaa aaa 


aaa att 


nna 


aaa 


age 






304 


Leu 


Asn 


Ser 


Leu Pro Leu Lys Lys Lys 


Lys He 


Xaa 


Lys 


Ser 









55 60 65 

<210> 410 
<211> 553 
<212> DNA 

<213> Homo sapiens 

<220> 
<221> CDS 
<222> 155,. 553 

<40P> 410 

aatagtgagc ggcagaeaga ggggtggeca gtttgttgtt ttacctaaaa ggtgttttcc 60 
acatggagtg ggactgagag gcacaggtgg ctgaggggac ccgcctggga cgtgaggcgc 120 
aggtcctcgg ccccttcate tgtcagctge agca atg gaa tac gtg ctg gaa gtg 175 

Met Glu Tyr Val Leu Glu Val 



aag 


aac 


tct ccg egg 


cac 


etc 


ctg aag 


1 

caa 


ttc 


aca 


gtg 


5 

tgt 


gac 


gtt 


223 


Lys 


Asn 


Ser Pro Arg 


His 


Leu 


Leu Lys 


Gin 


Phe 


Thr 


Val 


Cys 


Asp Val 








10 






15 








20 










cct 


ctg 


tat gac att 


tgt 


gac 


tac aac 


gtc 


tee 


aga 


gac 


cga 


tgc 


cag 


271 


Pro 


Leu 


Tyr Asp He 


Cys 


Asp 


Tyr Asn 


Val 


Ser 


Arg 


Asp 


Arg 


Cys 


Gin 






25 






30 








35 












gag 


etc 


ggg tgc tgc 


ttc 


tac 


gaa ggc 


gtc 


tgc 


tac 


aag 


aaa 


gcg 


gtt 


319 


Glu 


Leu 


Gly Cys Cys 


Phe 


Tyr Glu Gly 


Val 


Cys 


Tyr 


Lys 


Lys 


Ala 


Val 




40 






45 








50 










55 




ccc 


att 


tac ate cac 


gtg 


ttc 


tct gee 


ttg 


att 


gtg 


ate 


ata 


get 


ggg 


367 


Pro 


He 


Tyr He His 


Val 


Phe 


Ser Ala 


Leu 


He 


Val 


He 


He 


Ala 


Gly 
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60 










65 










70 






gcc 


ttc 


gtc ate 


acc 


ate 


ate 


tac 


aga 


gtc 


att 


cag 


gag 


age 


agg 


aaa 


415 


Ala 


Phe 


Val He 


Thr 


lie 


He 


Tyr 


Arg 


Val 


He 


Gin 


Glu 


Ser 


Arg 


Lys 








75 










80 










85 








gaa 


aag 


gcc ate 


ect 


gtg 


tat 


gtc 


gcg 


ctg 


eca 


cag 


aag 


tee 


age 


gaa 


463 


Glu 


Lys 


Ala He 


Pro 


Val 


Tyr 


Val 


Ala 


Leu 


Pro 


Gin 


Lys 


Ser 


Ser 


Glu 








90 








95 










100 










aag 


gcg 


gag ttg 


gcc 


tea 


tec 


age 


age 


aag 


tta 


ggg 


ctg 


aag 


ctg 


cga 


511 


Lys 


Ala 


Glu Leu 


Ala 


Ser 


Ser 


Ser 


Ser 


Lys 


Leu 


Gly 


Leu 


Lys 


Leu 


Arg 






105 








110 










115 












gtc 


ctg 


ggc etc 


aaa 


gtg 


ctg 


ggc 


etc 


gat 


gag 


agt 


gac 


gag 






553 


Val 


Leu 


Gly Leu Lys 


Val 


Leu 


Gly 


Leu 


Asp 


Glu 


Ser 


Asp 


Glu 









120 125 130 

<210> 411 
<211> 470 
<212> DNA 
<213> Homo sapiens 

<220> 
<221> CDS 
<222> 23.. 469 



<400> 411 

gggttcccgt teccegcgga sc atg egg tac aae gag aag gag ctg cag get 52 

Met Arg Tyr Asn Glu Lys Glu Leu Gin Ala 















1 








5 










10 




ctg 


tee 


egg 


cag 


ccg 


gee 


gag 


atg 


gcg 


gcc 


gag 


ctg 


ggc 


atg 


agg 


ggc 


100 


Leu 


Ser 


Arg Gin 


Pro 


Ala 


Glu 


Met 


Ala 


Ala 


Glu 


Leu 


Gly 


Met 


Arg 


Gly 












15 










20 










25 






ccc 


aag 


aag 


ggc 


age 


gtg 


ctg 


aag 


egg 


egg 


ctg 


gtg 


aag 


ctg 


gtg 


gtg 


148 


Pro 


Lys 


Lys 


Gly Ser 


Val 


Leu 


Lys 


Arg 


Arg 


Leu 


Val 


Lys 


Leu 


Val 


Val 










30 










35 










40 








aat 


ttc 


etc 


ttc 


tac 


ttt 


egg 


aca 


gac 


gag 


gee 


gag 


ccc 


gtc 


gga 


gcc 


196 


Asn 


Phe 


Leu 
45 


Phe 


Tyr 


Phe 


Arg 


Thr 
50 


Asp 


Glu 


Ala 


Glu 


Pro 
55 


Val 


Gly 


Ala 




ctg 


ctg 


ctg 


gag 


egc 


tge 


aga 


gtc 


gtc 


egg 


gaa 


gag 


ccc 


ggc 


ace 


ttc 


244 


Leu 


Leu 


Leu 


Glu Arg 


Cys Arg 


Val 


Val 


Arg 


Glu 


Glu 


Pro 


Gly 


Thr 


Phe 






60 










65 










70 












tec 


ate 


age 


ttc 


att 


gag 


gac 


ect 


gag 


agg 


aag 


tat 


cac. 


ttt 


gag 


tge 


292 


Ser 


He 


Ser 


Phe 


He 


Glu Asp 


Pro 


Glu 


Arg 


Lys 


Tyr 


His 


Phe 


Glu 


Cys 




75 










80 










85 










90 




age 


age 


gag 


gag 


cag 


tgt 


cag 


gag 


tgg 


atg 


gag 


get 


ctg 


cgt 


egg 


gee 


340 


Ser 


Ser 


Glu 


Glu 


Gin 
95 


Cys 


Gin 


Glu 


Trp 


Met 
100 


Glu 


Ala 


Leu 


Arg 


Arg 
105 


Ala 




age 


tac 


gag 


ttc 


atg 


egg 


aga 


age 


etc 


ate 


ttc 


tac 


agg 


aae 


gaa 


ate 


388 


Ser 


Tyr 


Glu 


Phe 


Met 


Arg Arg 


Ser 


Leu 


He 


Phe 


Tyr 


Arg 


Asn 


Glu 


He 










110 










115 










120 








egg 


aag 


gtg 


acg 


ggc 


aag 


gac 


ccc 


ctg 


gaa 


cag 


ttc 


ggc 


ata 


tec 


gag 


436 


Arg 


Lys 


Val 


Thr Gly 


Lys 


Asp 


Pro 


Leu 


Glu 


Gin 


Phe 


Gly 


He 


Ser 


Glu 








125 










130 










135 










gag 


gcc 


agg 


ttc 


cag 


ctg 


agt 


ggc 


ttg 


cag 


gcg 


t 










470 


Glu 


Ala 
140 


Arg 


Phe 


Gin 


Leu 


Ser 
145 


Gly 


Leu 


Gin 


Ala 















<210> 412 
<211> 486 
<212> DNA 
<213> Homo sapiens 

<22Q> 
<221> CDS 
<222> 211. .486 



wo 02/083898 
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<400> 412 

agattcttaa ctacgtgtct tacagcaaag ttgcccatgg gaacaaaggc tgtagggtgt 60 
cattgatctt gtaatacaac acatatttga tgggtgaatc ttattagcac tcctttgaaa 120 
atgttccagt atggagtcag ttgctgataa agaggaagtt cacagctagg aatccgaagt 180 
taagatttct caattgccgc ctgggcactc atg cct gta gtc cca gca ctt ggg 234 

Met Pro Val Val Pro Ala Leu Gly 

1 5 



agg cca 


agg 


tgg 


gca 


gat 


cac 


ctg 


agg 


teg gga gtt 


egg 


gac 


cag 


cct 


282 


Arg Pro 


Arg 


Trp 


Ala Asp 


His 


Leu Arg Ser Gly Val Arg Asp Gin Pro 




10 










15 






20 












ggc caa 


cct 


ggt 


gaa 


gcc 


ccc 


cca 


tct 


eta eta aaa 


ata 


caa 


aaa 


ttg 


330 


Gly Gin 


Pro 


Gly 


Glu 


Ala 


Pro 


Pro 


Ser 


Leu Leu Lys 


He 


Gin 


Lys 


Leu 




25 








30 








35 








40 




gcc ggg 


tat 


ggt 


ggt 


ggc 


tgc 


ctg 


tgg 


tec cag eta 


etc 


ggg 


agg 


ctg 


378 


Ala Gly 


Tyr 


Gly 


Gly Gly 


Cys 


Leu 


Trp 


Ser Gin Leu 


Leu Gly Arg 


Leu 










45 










50 






55 






agg egg 


gag 


aat 


cac 


ttg 


age 


ccg 


gga 


ggc gga ggt 


tgc 


agt 


gag 


ccg 


426 


Arg Arg 


Glu 


Asn 


His 


Leu 


Ser 


Pro Gly Gly Gly Gly Cys 


Ser 


Glu 


Pro 








60 










65 






70 








aga ttg 


tgc 


cac 


tgc 


act 


cca 


gcc 


tgg 


gtg aca gag 


caa 


gac 


tec 


ate 


474 


Arg Leu 


Cys 
75 


His 


Cys 


Thr 


Pro 


Ala 
80 


Trp 


Val Thr Glu 


Gin 
85 


Asp 


Ser 


He 




tea aaa 


ata 


gaa 






















486 


Ser Lys 


He 


Glu 

























90 



<210> 413 
<211> 335 
<212> DNA 

<213> Homo sapiens 

<220> 
<221> CDS 
<222> 73. .252 



<220> 

<221> sig_peptide 

<222> 73.. 135 

<223> Von Heijne matrix 

score 4.30000019073486 
seq AGVSCTCAXPASA/KS 



<400> 413 

actccgcctt ccaegtgeac ecactgeete ttecettcte gettgggaac tctagtctcg 60 
ectcgggttg ca atg gac cce aac tgc tec tgt gee get ggt gtc tee tgc 111 
Met Asp Pro Asn Cys Ser Cys Ala Ala Gly Val Ser Cys 
-20 -15 -10 

acc tgc gcc ast cct gca agt gca aag agt gca aat gca cot cct gca 159 
Thr Cys Ala Xaa Pro Ala Ser Ala Lys Ser Ala Asn Ala Pro Pro Ala 

'5 1 5 

aga aga get get get cct ggt gca ctg gee cca ccc teg tgg aca cct 207 
Arg Arg Ala Ala Ala Pro Gly Ala Leu Ala Pro Pro Ser Trp Thr Pro 

10 15 20 

gcc etg cce tgc cac ctg tct gtc tgt ccc aaa gaa gtt ctg gta 252 
Ala Leu Pro Cys His Leu Ser Val Cys Pro Lys Glu Val Leu Val 
25 30 35 

tgaaettgag gaeaeatgte eagtgggagg tgagaceace teteaatatt caataaaget 312 
gctgagaatc tagcetegaa aaa 335 



<210> 414 
<211> 494 
<212> DNA 



wo 02/083898 

<213> Homo sapiens 

<220> 
<221> CDS 
<222> 63.-422 

<220> 

<22l> sig_peptide 
<222> 63.. 167 
<223> Von Heijne matrix 

score 4.09999990463257 

seq LSQSLAMSPQTHS/QT 

<400> 414 

cacggatctc acaggtggag gtggggcgag cttctttcta gtgtggacca gaggtggggg 60 
ca atg agt aaa cat ctt cac tgt aag gtt ttg gga ctg ccc etc agg 107 



Met Ser Lys His Leu His Cys Lys Val Leu Gly Leu Pro Leu Arg 





■35 










•30 










-25 












ctt 


gag 


gtc 


teg 


gca 


teg 


tgt 


etc 


agt 


cag 


tec 


ttg 


gcg 


atg 


tec 


cct 


155 


Leu 


Glu 


Val 


Ser 


Ala 


Ser 


Cys 


Leu 


Ser 


Gin 


Ser 


Leu 


Ala 


Met 


Ser 


Pro 




-20 










-15 










-10 










-5 




cag 


acg 


cac 


tea 


cag 


act 


tgc 


ata 


cgt 


aat 


tta 


gta 


aca 


tgc 


ate 


aac 


203 


Gin 


Thr 


His 


Ser 


Gin 
1 


Thr 


Cys 


He 


Arg 
5 


Asn 


Leu 


Val 


Thr 


Cys 
10 


He 


Asn 




tat 


cct 


aga 


aca 


tct 


aca 


ggg 


tgc 


aaa 


gga 


acc 


ace 


act 


cag 


aga 


ate 


251 


Tyr 


Pro 


Arg 
15 


Thr 


Ser 


Thr 


Gly 


Cys 
20 


Lys 


Gly 


Thr 


Thr 


Thr 

25 


Gin 


Arg 


He 




atg 


gag 


cca 


gtg 


gag 


tta 


gaa 


gtg 


gaa 


ggg 


aca 


gaa 


caa 


gac 


aat 


get 


299 


Met 


Glu 
30 


Pro 


Val 


Glu 


Leu 


Glu 
35 


Val 


Glu 


Gly 


Thr 


Glu 
40 


Gin 


Asp 


Asn 


Ala 




aaa 


acc 


tgt 


ggt 


tea 


eta 


gga 


agg 


ggg 


aat 


gaa 


aac 


acc 


atg 


etc 


ega 


347 


Lys 


Thr 


Cys 


Gly 


Ser 


Leu 


Gly 


Arg 


Gly 


Asn 


Glu 


Asn 


Thr 


Met 


Leu Arg 




45 










50 










55 










60 




ggt 


gga 


ttc 


age 


atg 


aac 


aca 


act 


gtg 


ggg 


caa 


gga 


att 


tec 


aag 


caa 


395 


Gly 


Gly 


Phe 


Ser 


Met 

65 


Asn 


Thr 


Thr 


Val 


Gly 

70 


Gin 


Gly 


He 


Ser 


Lys 
75 


Gin 




aca 


cac 


cac 


act 


agt 


acc 


act 


tct 


tec 


taaacagaaa agggaaagag 




442 


Thr 


His 


His 


Thr 


Ser 


Thr 


Thr 


Ser 


Ser 



















80 85 
eatecttega cttcccccte ceteeateag caecgggttt tcctceaaaa aa 4 94 



<210> 415 
<211> 366 
<212> DNA 

<213> Homo sapiens 

<220> 
<221> CDS 
<222> 112. .306 

<220> 

<221> sig peptide 

<222> 1127.153 

<223> Von Heijne matrix 

score 3.70000004768372 

seq LCWLKLVSITTVA/II 

<400> 415 

cctttctcgt tecccggcea tcttatcggc tgctgttggt tgggggccgt cccgetccta 60 
aggcaggaag atggtggccg caaagaagac ggaaatctga aatagagtae t atg eta 117 

Met Leu 

tgt tgg eta aaa ctg gtg tec ate act aca gtg gca ata ata ttg aac 165 
Cys Trp Leu Lys Leu Val Ser He Thr Thr Val Ala He He Leu Asn 
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1 

213 



261 



306 

{ Pro 

40 45 50 

taaacctgca aaattttcct ttaataaaat ttgcttgttc taaaaaaaaa aaaaaaaaag 366 

<210> 416 
<211> 472 
<212> DNA 
<213> Homo sapiens 







-10 










-5 










1 








tgg 


gca 


cag 


cat 


gcg 


gaa 


aat 


act 


aca 


gag 


tgt 


gea 


eac 


tgg 


eta 


tea 


Trp Ala 


Gin 


His 


Ala 


Glu 


Asn 


Thr 


Thr 


Glu 


Cys 


Ala 


His 


Trp 


Leu 


Ser 


5 










10 










15 










20 


ttg 


ate 


cag gtg 


act 


ctg 


aca 


tea 


tta 


gaa 


gea 


tge 


cag 


aac 


aga 


ctg 


Leu 


lie 


Gin 


Val 


Thr 


Leu 


Thr 


Ser 


Leu 


Glu 


Ala 


Cys 


Gin Asn Arg 


Leu 










25 










30 










35 




gtg 


aaa 


agt 


aaa 


cct 


ttt 


cac 


eta 


caa 


aat 


ttc 


ace 


tgc 


aaa 


cct 




Val 


Lys 


Ser 


Lys 


Pro 


Phe 


His 


Leu 


Gin 


Asn 


Phe 


Thr 


Cys 


Lys 


Pro 





<220> 
<221> CDS 
<222> 115.. 411 



<400> 416 

aagaaagggg tgaggcctaa gggacaatea ggatgttttt cagagagaag tgtggatgct 60 
ggacaggaag aaceaeagat accagatacg ggtactgttg taactetgtt etcc atg 117 

Met 
1 



aaa 


aaa 


aag 


gaa 


gaa 


aca 


aca 


ctt 


tea 


gag 


atg 


gag 


cct 


gtt 


gag 


cea 


165 


Lys 


Lys 


Lys 


Glu 
5 


Glu 


Thr 


Thr 


Leu 


Ser 
10 


Glu 


Met 


Glu 


Pro 


Val 
15 


Glu 


Pro 




cag 


tac 


caa 


eta 


gtc 


aat 


get 


gaa 


teg 


act 


tet 


ecc 


ttt 


eta 


eat 


tge 


213 


Gin 


Tyr 


Gin 
20 


Leu 


Val 


Asn 


Ala 


Glu 
25 


Ser 


Thr 


Ser 


Pro 


Phe 
30 


Leu 


His 


Cys 




ctg 


aga 


gaa 


gtc 


att 


ggg 


gaa 


tac 


tet 


gta 


eac 


gaa 


ttt 


tea 


ctg 


ttg 


261 


Leu Arg 


Glu 


Val 


He 


Gly 


Glu 


Tyr 


Ser 


Val 


His 


Glu 


Phe 


Ser 


Leu 


Leu 






35 










40 










45 












ggg 


aaa 


aca 


gag 


agt 


caa 


ggg 


att 


gga 


ttg 


tgg 


att 


gea 


ttg 


gtg 


gtt 


309 


Gly 


Lys 


Thr 


Glu 


Ser 


Gin 


Gly 


He 


Gly 


Leu 


Trp 


He 


Ala 


Leu 


Val 


Val 




50 










55 










60 










65 




ttc 


etc 


agt 


ttc 


etc 


ate 


ttc 


tec 


aca 


agt 


ttc 


tac 


ata 


teg 


aat 


gea 


357 


Phe 


Leu 


Ser 


Phe 


Leu 
70 


He 


Phe 


Ser 


Thr 


Ser 
75 


Phe 


Tyr 


He 


Ser 


Asn 
80 


Ala 




gag 


cag 


ecc 


ttc 


ttc 


aaa 


gaa 


cct 


cct 


a eg 


gaa 


get 


get 


aag 


gaa 


etc 


405 


Glu 


Gin 


Pro 


Phe 


Phe 


Lys 


Glu 


Pro 


Pro 


Thr 


Glu 


Ala 


Ala 


Lys 


Glu 


Leu 





85 90 95 



agt ctg tagctctgcg tggagccatg tgtaaaeact gaaetgagac etgccaectc 461 
Ser Leu 

etactaecta a 472 



<210> 417 
<211> 532 
<212> DNA 
<213> Homo sapiens 

<220> 
<221> CDS 
<222> 125.. 508 



<400> 417 

agctrcctgg gtaaggceca agatggctgt ettcgectta gtaetcgtgt gaagttggcg 60 

gggacggttc ctgteatett ettgggetta tttggtgtge tgttgaaggg gggagaetag 120 

agaa atg gca ggg aac etc tta tee ggg gea ggt agg egc ctg tgg gae 169 

Met Ala Gly Asn Leu Leu Ser Gly Ala Gly Arg Arg Leu Trp Asp 
15 10 15 
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tgg 


gtg 


cct 


ctg 


gcg 


tgc 


aga 


age 


ttc 


tet 


ctt 


ggt 


gtg 


cct 


aga 


ttg 


217 


Trp 


Val 


Pro 


Leu 


Ala 
20 


Cys 


Arg 


Ser 


Phe 


Ser 
25 


Leu 


Gly 


Val 


Pro 


Arg 
30 


Leu 




ate 


ggt 


ata 


agg 


etc 


act 


etc 


ccg 


ccc 


ccc 


aaa 


gtg 


gtt 


gat 


cgt 


tgg 


265 


He 


Gly 


He 


Arg 

35 


Leu 


Thr 


Leu 


Pro 


Pro 
40 


Pro 


Lys 


Val 


Val 


Asp 

45 


Arg 


Trp 




aac 


gag 


aaa 


agg 


gcc 


atg 


ttc 


gga 


gtg 


tat 


gae 


aac 


ate 


ggg 


ate 


ctg 


313 


Asn 


Glu 


Lys 
50 


Arg 


Ala 


Met 


Phe 


Gly 
55 


Val 


Tyr 


Asp 


Asn 


He 
60 


Gly 


He 


Leu 




gga 


aac 


ttt 


gaa 


aag 


cac 


mcc 


aaa 


gaa 


ctg 


ate 


agg 


ggg 


ccc 


ata 


tgg 


361 


Gly Asn 


Phe 


Glu 


Lys 


His 


Xaa 


Lys 


Glu 


Leu 


He 


Arg 


Gly 


Pro 


He 


Trp 






65 










70 










75 












ctt 


cga 


ggt 


tgg 


aaa 


ggg 


aat 


gaa 


ttg 


caa 


cgt 


tgt 


ate 


cga 


aag 


agg 


409 


Leu 


Arg 


Gly 


Trp 


Lys 


Gly 


Asn 


Glu 


Leu 


Gin 


Arg 


Cys 


He 


Arg 


Lys 


Arg 




80 










85 










90 










95 




aaa 


atg 


gtt 


gga 


agt 


aga 


atg 


ttc 


get 


gat 


gac 


ctg 


cac 


aac 


ctt 


aat 


457 


Lys 


Met 


Val 


Gly 


Ser 
100 


Arg 


Met 


Phe 


Ala 


Asp 
105 


Asp 


Leu 


His 


Asn 


Leu 
110 


Asn 




aaa 


cgc 


ate 


cgc 


tat 


etc 


tac 


aaa 


cac 


ttt 


aac 


cga 


eat 


ggg 


aag 


ttt 


505 


Lys 


Arg 


He 


Arg 
115 


Tyr 


Leu 


Tyr 


Lys 


His 
120 


Phe 


Asn 


Arg 


His 


Gly 
125 


Lys 


Phe 





cga tagaagagaa agctgagaae tteg 532 

Arg 



<210> 418 
<211> 470 
<212> DNA 
<213> Homo sapiens 



<220> 
<221> CDS 
<222> 50. .205 



<400> 418 

actctttect tggctgctca ggtcataggt gctgtgatct agagaeaaa atg ttc ctt 58 

Met Phe Leu 

106 



154 



aca 


ctg 


gca 


gat act tgc 


aaa 


eta agg gga 


atg 


age 


ttc 


1 

ctt 


ctg 


aat 


Thr Leu Ala Asp Thr Cys 


Lys 


Leu Arg Gly Met 


Ser 


Phe 


Leu 


Leu 


Asn 




5 






10 






15 










gtt 


tat 


gaa 


gga gag gcc 


act 


gtg tea tet 


gtc 


tta 


gag 


eta 


ttg 


gaa 


Val 


Tyr 


Glu 


Gly Glu Ala 


Thr 


Val Ser Ser 


Val 


Leu 


Glu 


Leu 


Leu 


Glu 


20 






25 






30 










35 


tec 


tgg 


ate 


att gtg gga 


aat 


gaa aga tac 


ttc 


gat 


gga 


ate 


age 


age 


Ser 


Trp 


He 


He Val Gly Asn Glu Arg Tyr 


Phe 


Asp Gly 


He 


Ser 


Ser 








40 




45 










50 





202 



cat tgatceaatg ceaactccaa gaetggaacg tegeaatgat agttccaagg 255 

His 

cggaaatttg aegtaattet tttegaeaea gttttaeagg tetggatgee cattttaatt 315 

cttctgaaag catgcctyct ccttctggct tcaggactcc atctscagcc tcttgatcta 375 

aaaataatec ceaaaccaaa aaaaaattag atactattte cteaaaatta ggtattttaa 435 

tcaaaaacat cttaacataa atacattatt atcac 470 



<210> 419 
<211> 559 
<212> DNA 
<213> Homo sapiens 

<220> 
<221> CDS 
<222> 175.. 426 



<400> 419 
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actcggcggg cgctgttgag ggagtcgggc cgcgactgtg gtcgttttta taccttcccg 60 

cgcggacgcc ggcgctgcca acggaagggc gggtagggcg agacggagtt tcgtcatgtt 120 

ggccaggccc atttgagatc tttgaagata tcctcaacgt gaggctctgc tgcc atg 177 

Met 
1 



aag 


gtg 


aag 


att 


aag 


tgc 


tgg 


aac 


ggc 


gtg 


gcc 


act 


tgg 


etc 


tgg 


gtg 


225 


Lys 


Val 


Lys 


He 


Lys 


Cys 


Trp 


Asn 


Gly 


Val 


Ala 


Thr 


Trp 


Leu 


Trp 


Val 










5 










10 










15 








gcc 


aac 


gat 


gag 


aac 


tgt 


ggc 


ate 


tgc 


agg 


atg 


gca 


ttt 


aac 


gga 


tgc 


273 


Ala 


Asn 


Asp 


Glu 


Asn 


Cys 


Gly 


He 


Cys 


Arg 


Met 


Ala 


Phe 


Asn 


Gly 


Cys 








20 










25 










30 










tgc 


cct 


gac 


tgc 


aag 


gtg 


ccc 


ggc 


gac 


gac 


tgc 


ccg 


ctg 


gtg 


tgg 


ggc 


321 


Cys 


Pro 


Asp 


Cys 


Lys 


Val 


Pro 


Gly 


Asp 


Asp 


Cys 


Pro 


Leu 


val 


Trp 


Gly 






35 










40 










45 












cag 


tgc 


tec 


cac 


tgc 


ttc 


cac 


atg 


cat 


tgc 


ate 


etc 


aag 


tgg 


ctg 


cac 


369 


Gin 


Cys 


Ser 


His 


Cys 


Phe 


His 


Met 


His 


Cys 


He 


Leu 


Lys 


Trp 


Leu 


His 




50 










55 










60 










65 




gca 


cag 


cag 


gtg 


cag 


cag 


cac 


tgc 


ccc 


atg 


tgc 


cgc 


cag 


gaa 


tgg 


aag 


417 


Ala 


Gin 


Gin 


Val 


Gin 


Gin 


His 


Cys 


Pro 


Met 


Cys 


Arg 


Gin 


Glu 


Trp 


Lys 





70 75 80 



ttc aag gag tgaggcccga cctggctctc gctggagggg catcetgaga 466 
Phe Lys Glu 

ctccttcctc atgctggcgc cgatggctgc tggggacagc gcccctgagc tgcaacaagg 526 

tggawacaag ggctggagct gcgtttgttt tgc 559 

<210> 420 

<211> 462 
<212> DNA 

<213> Homo sapiens 

<220> 
<221> CDS 
<222> 29. .268 

<400> 420 

atccagcagc cccctgctcc ggcccagc atg gcg ace ccg acc cag acc ccc 52 

Met Ala Thr Pro Thr Gin Thr Pro 



aca 


aag 


get 


cct 


gag 


gaa 


cct 


gac 


1 

cca 


ttt 


tac 


5 

tat gac 


tac 


aac 


acg 


100 


Thr 


Lys 
10 


Ala 


Pro 


Glu 


Glu 


Pro 

15 


Asp 


Pro 


Phe 


Tyr 


Tyr Asp 
20 


Tyr 


Asn 


Thr 




gtg 


cag 


act 


gtg 


ggc 


atg 


act 


ctg 


gca 


acc 


ate 


ttg ttc 


ctg 


ctg 


ggt 


148 


Val 


Gin 


Thr 


Val 


Gly 


Met 


Thr 


Leu 


Ala 


Thr 


He 


Leu Phe 


Leu 


Leu 


Gly 




25 










30 










35 








40 




ate 


etc 


ate 


gte 


ate 


age 


aag 


aag 


gtg 


aag 


tgc 


agg aag 


gcg 


gac 


tec 


196 


He 


Leu 


He 


Val 


He 
45 


Ser 


Lys 


Lys 


Val 


Lys 
50 


Cys 


Arg Lys 


Ala 


Asp 
55 


Ser 




agg 


tct 


gag 


age 


cca 


acc 


tgc 


aaa 


tec 


tgt 


aag 


tct gag 


ctt 


ccc 


tct 


244 


Arg 


Ser 


Glu 


Ser 

60 


Pro 


Thr 


Cys 


Lys 


Ser 
65 


Cys 


Lys 


Ser Glu 


Leu 
70 


Pro 


Ser 




tea 


gcc 


cct 


ggt 


ggc 


ggc 


ggc 


gtg 


taacaccttc ccgaggaaac tccgctgccg 


298 


Ser 


Ala 


Pro 

75 


Gly 


Gly 


Gly 


Gly 


Val 
80 



















accctgcctg agcgcgggag cctgaggacc gggtggaggc ggtggggacc cagccgcgcg 358 
ccgggagcgc tccccggaat gagcgcccca cccaccccaa ggctggagcc gctgcaccct 418 
getgtccctc tccaggcctt ggcaatgacg atcccccaaa gage 4 62 



<210> 421 
<211> 547 
<212> DNA 
<213> Homo sapiens 



<220> 
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<22l> CDS 
<222> 99. .479 



<400> 421 

tttcgaaggg aagggagccc cctataaaac agcctacagt ggacagtctg gtcggcagag 60 
ccgcaggtca gtcgtgaaga gggagctcta ttgccacc atg agt ttc tec ggc aag 116 

Met Ser Phe Ser Gly Lys 



tac caa 


eta 


cao 


aac 
^y ^ 


caa 

way 


aaa 

yoo 


aac 


ttt 


aaa 


1 

ace 


ttc 


atq 


aaq 


5 

gea 


ate 


164 


Tyr Gin 


Leu 


Gin 
10 


Ser 


Gin 


Glu 


Asn 


Phe 
15 


Glu 


Ala 


Phe 


Met 


Lvs 
20 


Ala 


He 




crat eta 


ecg 


gaa 


y ay 


etc 


ate 


cag 


aag 


aaa 
y y y 


aag 


aat 

y» 


ate 


aaa 

a«sy 


ggg 


gtg 


212 


Gly Leu 


Pro 


Glu 


Glu 


Leu 


He 


Gin 


Lvs 


Gly 


Lvs 


Asp 


He 


Lys 


Gly Val 






25 










30 










35 










teg gaa 


ate 


crtcr 


cag 


aat 


noes 
y y y 


aag 


cac 


ttc 


aag 


ttc 


ace 


ate 


ace 


get 


260 


Ser Glu 


lie 


Val 


Gin 


Asn 


Glv 


Lvs 


His 


Phe 


Lys 


Phe 


Thr 


He 


Thr 


Ala 




40 










45 










50 












aaa tec 


aaa 


y 


ate 


caa 


aac 


gaa 


ttc 


acg 


gtg 


ggg 


gag 


gaa 


tgt 


gag 


308 


Gly Ser 


Lys 


Val 


He 


Gin 


Asn 


Glu 


Phe 


Thr 


Val 


Gly 


Glu 


Glu 


Cys 


Glu 




55 








60 










65 










70 




ctg gag 


aca 


atg 


aca 


ggg 


gag 


aaa 


gtc 


aag 


aca 


gtg 


gtt 


cag 


ttg 


gaa 


356 


Leu Glu 


Thr 


Met 


Thr 
75 


Gly 


Glu 


Lys 


Val 


Lys 
80 


Thr 


Val 


Val 


Gin 


Leu 
85 


Glu 




ggt gac 


aat 


aaa 


ctg 


gtg 


aca 


act 


ttc 


aaa 


aac 


ate 


aag 


tct 


gtg 


ace 


404 


Gly Asp 


Asn 


Lys 
90 


Leu 


Val 


Thr 


Thr 


Phe 
95 


Lys 


Asn 


He 


Lys 


Ser 
100 


Val 


Thr 




gaa etc 


aac 


ggc 


gac 


ata 


ate 


acc 


aat 


ace 


atg 


aca 


ttg 


ggt 


gac 


att 


452 


Glu Leu 


Asn 
105 


Gly 


Asp 


He 


He 


Thr 
110 


Asn 


Thr 


Met 


Thr 


Leu 
115 


Gly 


Asp 


He 




gtc ttc 


aag 


aga 


ate 


age 


aag 


aga 


att 


taaacaagtc tgeatttcat 




499 


Val Phe 


Lys 


Arg 


He 


Ser 


Lys 


Arg 


He 


















120 










125 























attattttag tgtgtaaaat taatgtaata aagtgaactt tgttttaa 547 



<210> 422 
<211> 669 
<212> DNA 
<213> Homo sapiens 



<220> 
<221> CDS 
<222> 29.. 259 



<400> 422 

gacgeakage cgtggggaag ttttcgea atg gee gee gga acg teg ecg gee 52 

Met Ala Ala Gly Thr Ser Pro Ala 

1 5 



gat 


get 


etc 


tge 


gac 


gga 


aag 


aaa 


gee 


gee 


gga 


ecc act 


tec 


ggt 


gcg 


100 


Asp 


Ala 


Leu 


Cys 


Asp 


Gly 


Lys 


Lys 


Ala 


Ala 


Gly 


Pro Thr 


Ser 


Gly 


Ala 






10 










15 










20 










ttt 


tac 


aat 


gaa 


etc 


egg 


gea 


etc 


gag 


teg 


etc 


ecc gga 


aac 


tat 


agt 


148 


Phe 


Tyr 


Asn 


Glu 


Leu 


Arg 


Ala 


Leu 


Glu 


Ser 


Leu 


Pro Gly 


Asn 


Tyr 


Ser 




25 










30 










35 








40 




tct 


get 


tec 


acc 


gtt 


gtc 


eta 


ctt 


tgg 


ata 


gea 


tea aga 


caa 


aaa 


tct 


196 


Ser 


Ala 


Ser 


Thr 


Val 


Val 


Leu 


Leu Trp 


He 


Ala 


Ser Arg 


Gin 


Lys 


Ser 












45 










50 








55 






cag 


tct 


teg 


agg 


aac 


tea 


cag 


tgt 


age 


aga 


aaa 


gac aaa 


gga 


ggt 


aaa 


244 


Gin 


Ser 


Ser 


Arg 


Asn 


Ser 


Gin 


Cys 


Ser 


Arg 


Lys 


Asp Lys 


Gly 


Gly 


Lys 





60 65 70 



cag gag gag cat gaa taattcatct tgatgtaatc atgcaagact cctaagaaga 299 
Gin Glu Glu His Glu 
75 

tgttaggaga ccctgtcatc attcttttgg ectctaatgt ecgttatett catgtttget 359 
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ggcttccttt 


tactactatg 


gaaatacaac 


atatgcaaaa 


gtagtgtttt 


cgctgtttaa 


419 


aaacatcgtc 


agcagcatca 


tcatcattaa 


ctgcaccagg 


aagccatgac 


tcctttctgg 


479 


acaaggtgca 


cttgcctgtc 


atccttccat 


ctctttcttc 


ctccttgaag 


ttcacaatac 


539 


ccagtccagg 


agcattgcag 


agaatgctag 


gtgtgtctgc 


agacttgact 


ttaaaagaaa 


599 


acaagttcct 


tcaagtgagt 


tcatcatgat 


ggggatgata 


tttgtttgag 


agaatgtgtg 


659 


tctttggcaa 












669 


<210> 423 














<211> 546 














<212> DNA 














<213> Homo 


sapiens 













<220> 
<221> CDS 
<222> 78. .302 



<400> 423 

ctygccagct gccggtcttt cgggggctcc gtaactttct atccgtccgc gtcascyttg 60 
ccaccctcat ctccaat atg cct ggt ccg acc ccc agt ggc act aac gtg 110 
Met Pro Gly Pro Thr Pro Ser Gly Thr Asn Val 

158 



206 











1 








5 








10 




gga 


tec 


tea 


ggg 


cgc tct 


ccc 


age 


aaa 


gca 


gtg 


gcc 


gcc 


egg geg 


egg 


Gly Ser 


Ser Gly 


Arg Ser 


Pro 


Ser 


Lys 


Ala 


Val 


Ala 


Ala 


Arg Ala 


Arg 








15 








20 










25 




gat 


oca 


ctg 


tec 


ggc aga 


gga 


aaa 


atg 


cca 


get 


gtg 


gga 


caa gga 


gtg 


Asp 


Pro 


Leu 


Ser 


Gly Arg 


Gly 


Lys 


Met 


Pro 


Ala 


Val 


Gly 


Gin Gly 


Val 






30 








35 










40 






cag 


gcc 


gca 


caa 


cct egg 


cag 


gca 


ccg 


ggg 


gga 


tgt 


ggc 


gat tct 


aca 


Gin 


Ala 


Ala 


Gin 


Pro Arg 


Gin 


Ala 


Pro Gly 


Gly 


Cys 


Gly 


Asp Ser 


Thr 




45 








50 










55 








cag 


aag 


att 


cac 


ctg grc 


tea 


aag 


ttg 


gee 


ctg 


ttc 


cag 


tat tgg 


tta 


Gin 


Lys 


He 


His 


Leu Xaa 


Ser 


Lys 


Leu 


Ala 


Leu 


Phe 


Gin 


Tyr Trp 


Leu 


60 








65 










70 








75 



254 



302 



tgagtcttet gttcatcgct tctgtattta tgttgeacat ttggggcaag tacactcgtt 362 

cgtagattea gttacatcca tetgteatct gaagaakgag gaaawaaccc aacatttctt 422 

ggacceaaaa gtatagtgac tatctgttea tgagagaaat tttctgtaag ettgctgttt 482 

ttaamcaggg gatttatcaa ttaattgatt tgaggaatea gtttttttet atggctaata 542 

aact 546 



<210> 424 
<211> 499 
<212> DNA 

<213> Homo sapiens 



<220> 
<221> CDS 
<222> 16. .183 



<400> 424 

tttattttgc cttct atg tea tct tgg age tta atg tgc atg caa gat atg 51 
Met Ser Ser Trp Ser Leu Met Cys Met Gin Asp Met 
15 10 

gga aat act aaa gca aga eta etc tat gaa gee aat ctt cca gag aac 99 

Gly Asn Thr Lys Ala Arg Leu Leu Tyr Glu Ala Asn Leu Pro Glu Asn 
15 20 25 

ttt ega aga cca cag aca gat caa gca gtg gaa ttt ttc ate aga gat 147 

Phe Arg Arg Pro Gin Thr Asp Gin Ala Val Glu Phe Phe He Arg Asp 
30 35 40 

aaa tat gaa aag aag aaa tac tac gaa aaa atg cca tagctattae 193 

Lys Tyr Glu Lys Lys Lys Tyr Tyr Glu Lys Met Pro 

45 50 55 

aaatatttec tcctetgatg etcctcttca gectttggta teetctcctt ctetgeaage 253 
tgctgttgac aaaaataaat tggagaaaga aaaggaaaaa aaaaaggaag agaaaaagag 313 
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agaaaaggag ccagaaaagc cggcaaaacc acttacagct gaaaagctgc agaagaaaga 373 

tcagcaactg gagcctaaaa aaagtaccag ccctaaaaaa gctgcggasc cactgtggat 4 33 

cttttaggac ttgtttggag aaagattgag aaggaaaaga gtgaaaggca gttagttgaa 4 93 

gtggaa 4 99 

<210> 425 
<211> 512 
<212> DNA 

<213> Homo sapiens 



<220> 
<221> CDS 
<222> 122. .487 



<400> 425 

aaacacatcc aagcttaaga cggtgaggtc agcttcacat tctcaggaac tctccttctt 60 
tgggtctggc tgaagttgag gatctcttac tctctaggcc acggaattaa cccgagcagg 120 
c atg gag gcc tct get etc acc tea tea gca gtg ace agt gtg gcc aaa 169 
Met Glu Ala Ser Ala Leu Thr Ser Ser Ala Val Thr Ser Val Ala Lys 
15 10 15 

gtg gtc agg gtg gcc tct ggc tct gcc gta gtt ttg ccc ctg gcc agg 217 
Val Val Arg Val Ala Ser Gly Ser Ala Val Val Leu Pro Leu Ala Arg 

20 25 30 

att get aca gtt gtg att gga gga gtt gtg gcc atg gcg get gtg ccc 265 
He Ala Thr Val Val He Gly Gly Val Val Ala Met Ala Ala Val Pro 

35 40 45 

atg gtg etc agt gcc atg ggc ttc act gcg gcg gga ate gcc teg tec 313 
Met Val Leu Ser Ala Met Gly Phe Thr Ala Ala Gly He Ala Ser Ser 

50 55 60 

tec ata gca gee aag atg atg tec gcg gcg gee att gee aat ggg ggt 361 
Ser He Ala Ala Lys Met Met Ser Ala Ala Ala He Ala Asn Gly Gly 
65 70 75 80 

gga gtt gcc teg ggc age ctt gtg get act ctg cag tea ctg gga gca 409 
Gly Val Ala Ser Gly Ser Leu Val Ala Thr Leu Gin Ser Leu Gly Ala 

85 90 95 

act gga etc tec gga ttg acc aag ttc ate ctg ggc tec att ggg tct 457 
Thr Gly Leu Ser Gly Leu Thr Lys Phe He Leu Gly Ser He Gly Ser 

100 105 110 

gcc att gcg get gtc att gcg agg ttc tac tagctcectg eeetcgeetg 507 
Ala He Ala Ala Val He Ala Arg Phe Tyr 

115 120 
cagag 512 



<210> 426 
<211> 502 
<212> DNA 
<213> Homo sapiens 



<220> 
<221> CDS 
<222> 57. .278 



<400> 426 

attcaeeggc gaegcgatae ggttectcca ecgaggceea tgegaagctt tceaet atg 59 

Met 
1 

get tec age act gtc ccg gtg age get get ggc teg get aat gaa act 107 
Ala Ser Ser Thr Val Pro Val Ser Ala Ala Gly Ser Ala Asn Glu Thr 

5 10 15 

ccc gaa ata ccg gae aac gtg gga gat tgg ctt egg ggc gtc tac cgc 155 
Pro Glu He Pro Asp Asn Val Gly Asp Trp Leu Arg Gly Val Tyr Arg 

20 25 30 

ttt gcc act gat agg aat gae ttc egg agg aac ttg ata eta aat ttg 203 
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245 

Phe Ala Thr Asp Arg Asn Asp Phe Arg Arg Asn Leu lie Leu Asn Leu 

35 40 45 

gga etc ttt get gcg gga gtt tgg ctg gee agg aac ttg agt gac att 251 
Gly Leu Phe Ala Ala Gly Val Trp Leu Ala Arg Asn Leu Ser Asp lie 
50 55 60 65 

gac etc atg gca cct cag cca ggg gtg tagccaagta gttctaatgc 298 
Asp Leu Met Ala Pro Gin Pro Gly Val 
70 

cacctgtcgt cttatcatct gattgcagac aaatggaatc ctgtgctgaa cccgaatctt 358 
ccaaaaaaca gcctacaatc tgtgaccacc acaagatgtg ccctgatggc agctgaagtt 418 
tgattcagat gggcactttt cttccccttc cctgcctagt ttccttttgt tccttgagtc 478 
acgcagaatt ccattctctg gtca 502 

<210> 427 
<211> 575 
<212> DNA 
<213> Homo sapiens 



<220> 
<221> CDS 
<222> 50. .340 



<400> 427 

gaggagaaga ggtgaccccg ctaaagataa taaatcatca ttgatcaag atg tct tea 58 

Met Ser Ser 



gca 


cct 


gag 


cct 


cca 


aca 


ttc 


aaa 


aag 


gaa 


cca 


cec 


aaa 


1 

gar 


aaa 


gag 


106 


Ala 


Pro 
5 


Glu 


Pro 


Pro 


Thr 


Phe 
10 


Lys 


Lys 


Glu 


Pro 


Pro 
15 


Lys 


Glu 


Lys 


Glu 




ttt 


caa 


age 


cca 


ggg 


etc 


aga 


ggg 


gtg 


cgc 


aca 


aca 


ace 


tta 


ttt 


cgt 


154 


Phe 


Gin 


Ser 


Pro 


Gly 


Leu 


Arg 


Gly 


Val 


Arg 


Thr 


Thr 


Thr 


Leu 


Phe 


Arg 




20 










25 










30 










35 




get 


gtg 


aat 


cca 


gag 


etc 


ttc 


att 


aaa 


cct 


aac 


aaa 


cct 


gta 


atg 


get 


202 


Ala 


Val 


Asn 


Pro 


Glu 


Leu 


Phe 


He 


Lys 


Pro 


Asn Lys 


Pro 


Val 


Met 


Ala 












40 










45 










50 






ttc 


gga 


ttg 


gta 


act 


ett 


tea 


ett 


tgc 


gtg 


gca 


tat 


att 


ggt 


tat 


eta 


250 


Phe 


Gly 


Leu 


Val 
55 


Thr 


Leu 


Ser 


Leu 


Cys 
60 


Val 


Ala 


Tyr 


He 


Gly 
65 


Tyr 


Leu 




cat 


gca 


ata 


caa 


gag 


aat 


aaa 


aag 


gac 


etc 


tat 


gaa 


get 


att 


gat 


agt 


298 


His 


Ala 


lie 


Gin 


Glu 


Asn 


Lys 


Lys 


Asp 


Leu 


Tyr Glu 


Ala 


He 


Asp 


Ser 








70 










75 










80 










gag 


ggg 


cac 


agt 


tat 


atg 


agg 


egg 


aaa 


aca 


tec 


aaa 


tgg 


gat 






340 


Glu 


Gly 


His 


Ser 


Tyr 


Met 


Arg 


Arg 


Lys 


Thr 


Ser 


Lys 


Trp 


Asp 









85 90 95 

tagtagtget ggttagtgca ratggacett tattaaaggt tetgaaatct teaaatgaaa 400 

gaccttgtga gtgtaeagta tcatgtttct tgttctagaa catgctaatg aagagagaag 460 

atagcagttg caaccagaca actgtcgtaa attttgtcct ttcacagctg cagccattat 520 

ctcattcttt ttccacagag tgagcgtcat aatattttct ttccttactc ttata 575 



<210> 428 
<211> 493 
<212> DNA 
<213> Homo sapiens 

<220> 
<221> CDS 
<222> 177.. 470 



<400> 428 

atcgaccaeg gaaatttgac acctccggge ttggaagcag etctctcctc cttccccgct 60 
gettataaac etcagccctg aggctccagc tcactctacc ccatctcctt gccgggtcag 120 
ccctgacaaa ggtcagctag ccccttgagg acateagctt tggcctcagg gtccta atg 179 

Met 
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246 



gca 


gca 


gaa 


cca 


ctg 


aca 


gag 


eta 


gag gag tec 


att 


gag 


aac 


gtg 


gtc 


227 


Ala 


Ala 


Glu 


Pro 


Leu 


Thr 


Glu 


Leu 


Glu Glu Ser 


He 


Glu 


Asn 


Val 


Val 










5 










10 






15 








acc 


acc 


ttc 


ttc 


acc 


ttt 


gca 


agg 


cag gag ggc 


egg 


aag 


gat 


age 


etc 


275 


Thr 


Thr 


Phe 


Phe 


Thr 


Phe 


Ala 


Arg 


Gin Glu Gly 


Arg 


Lys 


Asp 


Ser 


Leu 








20 










25 






30 










age 


gtc 


aac 


gag 


ttc 


aaa 


gag 


ctg 


gtt acc cag 


cag 


ttg 


ccc 


cat 


ctg 


323 


Ser 


Val 


Asn 


Glu 


Phe 


Lys 


Glu 


Leu 


Val Thr Gin 


Gin 


Leu 


Pro 


His 


Leu 






35 










40 






45 












etc 


aag 


gat 


gtg 


ggc 


tct 


ctt 


gat 


gag aag atg 


aag 


age 


ttg 


gat 


gtg 


371 


Leu 


Lys 


Asp 


Val 


Gly 


Ser 


Leu 


Asp 


Glu Lys Met 


Lys 


Ser 


Leu Asp 


Val 




50 










55 






60 










65 




aat 


cag 


gae 


teg 


gag 


etc 


aag 


ttc 


aat gag tac 


tgg 


aga 


ttg 


att 


ggg 


419 


Asn 


Gin 


Asp 


Ser 


Glu 


Leu 


Lys 


Phe 


Asn Glu Tyr 


Trp Arg 


Leu 


He Gly 












70 








75 








80 






gag 


ctg 


gcc 


aag 


gaa 


ate 


agg 


aag 


aag aaa gac 


ctk 


aag 


ate 


agg 


aag 


467 


Glu 


Leu 


Ala 


Lys 


Glu 


He 


Arg 


Lys 


Lys Lys Asp 


Leu 


Lys 


He 


Arg 


Lys 





85 90 95 

aag taaagecgee tggetgagat ggg 493 

Lys 



<210> 429 
<211> 505 
<212> DNA 
<213> Homo sapiens 

<220> 
<221> CDS 
<222> 114. .458 



<400> 429 

agttccggct ctgcstgaag agctttgcat tgtgggaagt ytttytttct cgttccccgg 60 
ecatcttage ggctgctgtt ggttgggggc egtcecgetc ctaaggcagg aag atg 116 

Met 
1 



gtg 


gcc 


gca 


aag 


aag 


aeg 


aaa 


aag 


teg 


ctg 


gag 


teg 


ate 


aac 


tct 


agg 


164 


Val 


Ala 


Ala 


Lys 


Lys 


Thr 


Lys 


Lys 


Ser 


Leu 


Glu 


Ser 


He 


Asn 


Ser 


Arg 










5 










10 










15 








etc 


caa 


etc 


gtt 


atg 


aaa 


agt 


ggg 


aag 


tac 


gtc 


ctg 


ggg 


tac 


aag 


cag 


212 


Leu 


Gin 


Leu 


Val 


Met 


Lys 


Ser 


Gly 


Lys 


Tyr 


Val 


Leu 


Gly 


Tyr 


Lys 


Gin 








20 










25 










30 










act 


ctg 


aag 


atg 


ate 


aga 


caa 


ggc 


aaa 


gcg 


aaa 


ttg 


gtc 


att 


etc 


get 


260 


Thr 


Leu 


Lys 


Met 


He 


Arg 


Gin 


Gly 


Lys 


Ala 


Lys 


Leu 


Val 


He 


Leu 


Ala 






35 










40 










45 












aac 


aac 


tge 


cca 


get 


ttg 


agg 


aaa 


tct 


gaa 


ata 


gag 


tac 


tat 


get 


atg 


308 


Asn 


Asn 


Cys 


Pro 


Ala 


Leu 


Arg 


Lys 


Ser 


Glu 


He 


Glu 


Tyr 


Tyr 


Ala 


Met 




50 










55 










60 










65 




ttg 


get 


aaa 


act 


ggt 


gtc 


cat 


cac 


tac 


agt 


ggc 


aat 


aat 


att 


gaa 


ctg 


356 


Leu 


Ala 


Lys 


Thr 


Gly 


Val 


His 


His 


Tyr 


Ser 


Gly 


Asn 


Asn 


He 


Glu 


Leu 












70 










75 










80 






ggc 


aca 


gca 


tge 


gga 


aaa 


tac 


tac 


aga 


gtg 


tge 


aca 


ctg 


get 


ate 


att 


404 


Gly 


Thr 


Ala 


Cys 


Gly 


Lys 


Tyr 


Tyr 


Arg 


Val 


Cys 


Thr 


Leu 


Ala 


He 


He 










85 










90 










95 








gat 


cca 


ggt 


gae 


tct 


gac 


ate 


att 


aga 


age 


atg 


cca 


gaa 


cag 


act 


ggt 


452 


Asp 


Pro 


Gly 


Asp 


Ser 


Asp 


He 


He 


Arg 


Ser 


Met 


Pro 


Glu 


Gin 


Thr 


Gly 





100 105 110 



gaa aag taaacctttt cacctaeaaa atttcacetg caaaaaaaaa aaaaaav 505 
Glu Lys 
115 



<210> 430 
<211> 772 
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<212> DNA 

<213> Homo sapiens 

<220> 
<221> CDS 
<222> 324. .629 



<400> 430 

ataatcgcgt catttccggg aggggacgaa ggggtagttc tttcacctcg gctgggcgcc 60 

tagaaaagcc tagaaacagc tccttttttc ttccgcctcc gagtcttcgc gtcagcgtcc 120 

tgcgcagggc ccttggggcg aatcgcggtg cgcgtcgggg cgaccgccct ccctccctgg 180 

gaggggcgag ggggctagcg gcgacsgctg gggcgagcgc gcctgcgcgc tgggtgattt 240 

tttcacgtgt cgccagggcc ggactgcgag tctctttgcg gcgctacact agagcagagt 300 

acgagtctga ggcggaggga gta atg gca gga caa gcg ttt aga aag ttt ctt 353 

Met Ala Gly Gin Ala Phe Arg Lys Phe Leu 















1 








5 










10 




cca 


etc 


ttt 


gac 


cga 


gta 


ttg 


gtt 


gaa 


agg 


agt 


get 


get 


gaa 


act 


gta 


401 


Pro 


Leu 


Phe 


Asp 


Arg 
15 


Val 


Leu 


Val 


Glu 


Arg 
20 


Ser 


Ala 


Ala 


Glu 


Thr 
25 


Val 




acc 


aaa 


gga 


ggc 


att 


atg 


ctt 


cca 


gaa 


aaa 


tct 


caa 


gga 


aaa 


gta 


ttg 


449 


Thr 


Lys 


Gly 


Gly 
30 


lie 


Met 


Leu 


Pro 


Glu 
35 


Lys 


Ser 


Gin 


Gly 


Lys 
40 


Val 


Leu 




caa 


gca 


aca 


gta 


gtc 


get 


gtt 


gga 


teg 


ggt 


tct 


aaa 


gga 


aag 


ggt 


gga 


497 


Gin 


Ala 


Thr 
45 


Val 


Val 


Ala 


Val 


Gly 
50 


Ser 


Gly 


Ser 


Lys 


Gly 
55 


Lys 


Gly 


Gly 




gag 


att 


caa 


cca 


gtt 


age 


gtg 


aaa 


gtt 


gga 


gat 


aaa 


gtt 


ctt 


etc 


cca 


545 


Glu 


lie 
60 


Gin 


Pro 


Val 


Ser 


Val 
65 


Lys 


Val 


Gly 


Asp 


Lys 
70 


Val 


Leu 


Leu 


Pro 




gaa 


tat 


gga 


ggc 


acc 


aaa 


gta 


gtt 


eta 


gat 


rac 


aag 


gat 


tat 


ttc 


eta 


593 


Glu 


Tyr 


Gly 


Gly 


Thr 


Lys 


Val 


Val 


Leu 


Asp 


Xaa 


Lys 


Asp 


Tyr 


Phe 


Leu 




75 










80 










85 










90 




ttt 


aga 


gat 


ggt 


gac 


att 


ctt 


gga 


aag 


tac 


gta 


gac 


tgaaataagt 




639 


Phe 


Arg 


Asp 


Gly 


Asp 
95 


lie 


Leu 


Gly 


Lys 


Tyr 
100 


Val 


Asp 












cactattgaa atggcatcaa catgatgctg 


r cccattccac 


tgaagttctg i 


aaatctttcg 


699 


tcatgtaaat aatttccata tttctctttt 


ataataaact 


aatgataact < 


aatgaccaaa 


759 



aaaaaaaaaa aaa 772 



<210> 431 
<211> 672 
<212> DNA 
<213> Homo sapiens 

<220> 
<221> CDS 
<222> 224. .529 



<400> 431 

cataagaaca gggaggttag aagtagggtc ttggtgacaa aatatgttgt gtagagttca 60 

ggggagagtg cgtcatatgt tgttctagga agattgtagt ggtgagggtg tttattataa 120 

taatgtttgt gtattegget atgaagaata ggceagggee ggactgcgag tctctttgcg 180 



gcgctacact agagcagagt acgagtctga ggcggaggga 


gta 


atg 


gca 


gga 


caa 


235 












Met 
1 


Ala 


Gly 


Gin 




gcg ttt aga aag ttt 


ctt 


cca 


etc ttt gac cga 


gta 


ttg 


gtt 


gaa 


agg 


283 


Ala Phe Arg Lys Phe 


Leu 


Pro 


Leu Phe Asp Arg 


Val 


Leu 


Val 


Glu 


Arg 




5 


10 




15 










20 




agt get get gaa act 


gta 


acc 


aaa gga ggc att 


atg 


ctt 


cca 


gaa 


aaa 


331 


Ser Ala Ala Glu Thr 


Val 


Thr 


Lys Gly Gly He 


Met 


Leu 


Pro 


Glu 


Lys 




25 






30 








35 






tct caa gga aaa gta 


ttg 


caa 


gca aca gta gtc 


get 


gtt 


gga 


teg 


ggt 


379 


Ser Gin Gly Lys Val 


Leu 


Gin 


Ala Thr Val Val 


Ala 


Val 


Gly Ser 


Gly 




40 






45 






50 
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tct aaa gga aag ggt gga gag att caa cca gtt age gtg aaa gtt gga 427 
Ser Lys Gly Lys Gly Gly Glu lie Gin Pro Val Ser Val Lys Val Gly 

55 60 65 

gat aaa gtt ctt etc cca gaa tat gga ggc acc aaa gta gtt eta gat 475 
Asp Lys Val Leu Leu Pro Glu Tyr Gly Gly Thr Lys Val Val Leu Asp 

70 75 80 

gac aag gat tat ttc eta ttt aga gat ggt gac att ett ggr aag tac 523 
Asp Lys Asp Tyr Phe Leu Phe Arg Asp Gly Asp lie Leu Gly Lys Tyr 
85 90 95 100 

gta gac tgaaataagt cactattgaa atggeatcaa catgatgctg cccattccac 579 
Val Asp 

tgaagttetg aaatctttcg tcatgtaaat aatttccata tttetctttt ataataaact 639 
aatgataact aatgaccaaa aaaaaaaaaa aaa 672 



<210> 432 
<211> 517 
<212> DNA 
<213> Homo sapiens 

<220> 
<221> CDS 
<222> 97.. 444 



<400> 432 

atctttccgt cccgggcagc cagcgccagt cggagccagc gegagccgcc gccgccatca 60 
etgccgctgc caagtcctec acccgctgcc eccgcc atg tct get acc get gee 114 























Met 
1 


Ser 


Ala 


Thr 


Ala 
5 


Ala 




acg 


gcc 


GCC 


cct 


get 


gcc 


ceg 


get 


ggg 


gag 


ggt 


ggt 


cec 


cct 


gea 


ecc 


162 


Thr 


Ala 


Pro 


Pro 


Ala 


Ala 


Pro 


Ala 


Gly 


Glu 


Gly Gly 


Pro 


Pro 


Ala 


Pro 










10 










15 










20 








cct 


cca 


aac 


etc 


acc 


agt 


aac 


agg 


aga 


ctg 


cag 


cag 


acc 


cag 


gcc 


cag 


210 


Pro 


Pro 


Asn 


Leu 


Thr 


Ser 


Asn 


Arg 


Arg 


Leu 


Gin 


Gin 


Thr 


Gin 


Ala 


Gin 








25 










30 










35 










gtg 


gat 


gag 


gtg 


gtg 


gac 


ate 


atg 


agg 


gtg 


aac 


gtg 


gac 


aag 


gtc 


ctg 


258 


Val 


Asp 


Glu 


Val 


Val 


Asp 


He 


Met 


Arg 


Val 


Asn 


Val 


Asp 


Lys 


Val 


Leu 






40 










45 










50 












gag 


cga 


gac 


cag 


aag 


ctg 


teg 


gag 


ctg 


gac 


gac 


cgt 


gea 


gat 


gea 


etc 


306 


Glu Arg 


Asp 


Gin 


Lys 


Leu 


Ser 


Glu 


Leu 


Asp 


Asp Arg 


Ala 


Asp 


Ala 


Leu 




55 










60 










65 










70 




cag 


gcg 


ggg 


gcc 


tec 


cag 


ttt 


gaa 


aca 


age 


gea 


gcc 


aag 


etc 


aag 


cgc 


354 


Gin 


Ala 


Gly 


Ala 


Ser 


Gin 


Phe 


Glu 


Thr 


Ser 


Ala 


Ala 


Lys 


Leu 


Lys 


Arg 












75 










80 










85 






aaa 


tac 


tgg 


tgg 


aaa 


aac 


etc 


aag 


atg 


atg 


ate 


ate 


ttg 


gga 


gtg 


att 


402 


Lys 


Tyr 


Trp 


Trp 


Lys 


Asn 


Leu 


Lys 


Met 


Met 


He 


He 


Leu 


Gly 


Val 


He 










90 










95 










100 








tgc 


gcc 


ate 


ate 


etc 


ate 


ate 


ate 


ata 


gtt 


tac 


ttc 


age 


act 






444 


Cys 


Ala 


lie 


He 


Leu 


He 


He 


He 


He 


Val 


Tyr 


Phe 


Ser 


Thr 












105 










110 










115 











taaatccccg aggagtctge cetgectaga gaagggcetc teccccaace ctcagccgtt 504 
cetecacetc tct 517 



<210> 433 
<211> 485 
<212> DNA 
<213> Homo sapiens 



<220> 
<221> CDS 
<222> 21.. 311 



<400> 433 

tttgttttct aatagggeaa atg tct eta age ttg gtg ttt aga get get tea 53 
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Met Ser Leu Ser Leu Val Phe Arg Ala Ala Ser 
15 10 
tat ttt aaa eta gtt cca ttc cac agt tct agt tea aac cag ttt tta 101 
Tyr Phe Lys Leu Val Pro Phe His Ser Ser Ser Ser Asn Gin Phe Leu 

15 20 25 

cag cct cct ggg tgg gtc gtc ttg acc caa act ctt gtg ttg tta cat 149 
Gin Pro Pro Gly Trp Val Val Leu Thr Gin Thr Leu Val Leu Leu His 

30 35 40 

ttt gag agg ttt tea tac eag aat gta cca raa agt gea caa ggt aaa 197 
Phe Glu Arg Phe Ser Tyr Gin Asn Val Pro Xaa Ser Ala Gin Gly Lys 

45 50 55 

ggt aat tta cag cca gaa aca aat ata cat ttg ttt cat ttc ttg act 245 
Gly Asn Leu Gin Pro Glu Thr Asn lie His Leu Phe His Phe Leu Thr 
60 65 70 75 

ttc cct aag cag ata age aga aac ttg ttt aac tea tta ctt tgt ttg 293 
Phe Pro Lys Gin lie Ser Arg Asn Leu Phe Asn Ser Leu Leu Cys Leu 

80 85 90 

atg tgt ctt aca tat ttt tgactaaaag ttatagaatg ttatt octet 341 
Met Cys Leu Thr Tyr Phe 
95 

gggggaatct tttacaggtg gaggaatggg gatagcagta ttgcctcaga ttcaaactgg 401 
catcaccata aaccctgtag gccagggtgg aatgaagtca gctccttttt atagttgaaa 4 61 
tacaattttt tattcaccat tgtc 485 

<210> 434 
<211> 511 
<212> DNA 
<213> Homo sapiens 

<220> 
<221> CDS 
<222> 31.. 369 

<400> 434 

acttgctctg cccgetcaaa cagttgcagg atg teg atg aca gae ttg ctg aac 54 

Met Ser Met Thr Asp Leu Leu Asn 
1 5 

get gag gac ate aag aag gcg gtg gga gcc ttt age get acc gae tec 102 
Ala Glu Asp lie Lys Lys Ala Val Gly Ala Phe Ser Ala Thr Asp Ser 

10 15 20 

ttc gac cac aaa aag ttc ttc caa atg gtc gge ctg aag aaa aag agt 150 
Phe Asp His Lys Lys Phe Phe Gin Met Val Gly Leu Lys Lys Lys Ser 
25 30 35 40 

gcg gat gat gtg aag aag gtg ttt cac atg ctg gac aag gac aaa agt 198 
Ala Asp Asp Val Lys Lys Val Phe His Met Leu Asp Lys Asp Lys Ser 

45 50 55 

gge ttc ate gag gag gat gag ctg gga ttc ate eta aaa gge ttc tec 246 
Gly Phe lie Glu Glu Asp Glu Leu Gly Phe lie Leu Lys Gly Phe Ser 

60 65 70 

cca gat gcc aga gae ctg tct get aaa gaa acc aag atg ctg atg get 294 
Pro Asp Ala Arg Asp Leu Ser Ala Lys Glu Thr Lys Met Leu Met Ala 

75 80 85 

get gga gac aaa gat ggg gac gge aaa att ggg gtt gac gaa ttc tec 342 
Ala Gly Asp Lys Asp Gly Asp Gly Lys lie Gly Val Asp Glu Phe Ser 

90 95 100 

act ctg gtg get gaa ast aag aag cac tgaetgecce tggtctteca 389 
Thr Leu Val Ala Glu Xaa Lys Lys His 
105 110 

ectctetgee etgaacaccc aatcteggec cetctegeca eeeteetgca tttctgttca 449 
gttcgtttat gttatttttt actcccceat cccctgtggc cetctaatga caccattctt 509 
ct 511 



<210> 435 
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<211> 511 
<212> DNA 
<213> Homo sapiens 

<220> 
<221> CDS 
<222> 82. .417 



<400> 435 

cccttctgcg cggtcacgcg agccagcgcc tgggcctgga accgggccgt agcccccagt 60 

ttcgcccacc acctccctac c atg gac ccc cgc aaa gtg aac gag ctt egg 111 

Met Asp Pro Arg Lys Val Asn Glu Leu Arg 
15 10 

gcc ttt gtg aaa atg tgt aag cag gat ccg age gtt ctg cac acc gag 159 

Ala Phe Val Lys Met Cys Lys Gin Asp Pro Ser Val Leu His Thr Glu 

15 20 25 

gaa atg cgc ttc ctg agg gag tgg gtg gag age atg ggt ggt aaa gta 207 

Glu Met Arg Phe Leu Arg Glu Trp Val Glu Ser Met Gly Gly Lys Val 

30 35 40 

cca act get act caa aaa get aaa tea gaa gaa aat ace aag gaa gaa 255 

Pro Pro Ala Thr Gin Lys Ala Lys Ser Glu Glu Asn Thr Lys Glu Glu 

45 50 55 

aaa cet gat agt aag aag gtg gag gaa gac tta aag gca gac gaa cca 303 

Lys Pro Asp Ser Lys Lys Val Glu Glu Asp Leu Lys Ala Asp Glu Pro 

60 65 70 

tea agt gag gaa agt gat eta gaa att gat aaa gaa ggt gtg att gaa 351 

Ser Ser Glu Glu Ser Asp Leu Glu lie Asp Lys Glu Gly Val lie Glu 
75 80 85 90 

cca gac act gat get eet caa gaa atg gga gat gaa aat geg gag ata 399 

Pro Asp Thr Asp Ala Pro Gin Glu Met Gly Asp Glu Asn Ala Glu lie 

95 100 105 

acg get caa cat ttt ttg tagccacagg cttccacgga cttcacgtca 4 47 

Thr Ala Gin His Phe Leu 
110 

ttattggetc aacttteete actatctget teatccgeca actaatattt cactttacat 507 
ceca 511 



<210> 436 
<211> 475 
<212> DNA 
<213> Homo sapiens 



<220> 
<221> CDS 
<222> 22.-375 



<400> 436 



agtcgagcea gcgtcgccgc g 


atg 


gtg 


ttg 


ttg 


gag 


age 


gag 


cag 


ttc 


ctg 


51 














Met 
1 


Val 


Leu 


Leu 


Glu 
5 


Ser 


Glu 


Gin 


Phe 


Leu 
10 




acg 


gag 


ctg 


ace 


aga 


ctt 


ttc 


cag 


aag 


tge 


egg 


acg 


teg 


gge 


age 


gtc 


99 


Thr 


Glu 


Leu 


Thr 


Arg 


Leu 


Phe 


Gin 


Lys 


Cys 


Arg 


Thr 


Ser 


Gly 


Ser 


Val 












15 










20 










25 






tat 


ate 


acc 


ttg 


aag 


aag 


tat 


gac 


ggt 


cga 


acc 


aaa 


ccc 


att 


cca 


aag 


147 


Tyr 


lie 


Thr 


Leu 


Lys 


Lys 


Tyr 


Asp 


Gly 


Arg 


Thr 


Lys 


Pro 


He 


Pro 


Lys 










30 










35 










40 








aaa 


ggt 


act 


gtg 


gag 


gge 


ttt 


gag 


ccc 


gca 


gac 


aac 


aag 


tgt 


ctg 


tta 


195 


Lys 


Gly 


Thr 


Val 


Glu 


Gly 


Phe 


Glu 


Pro 


Ala 


Asp 


Asn 


Lys 


Cys 


Leu 


Leu 








45 










50 










55 










aga 


get 


acc 


gat 


ggg 


aag 


aag 


aag 


ate 


age 


act 


gtg 


gtg 


age 


tec 


aag 


243 


Arg Ala 


Thr 


Asp 


Gly 


Lys 


Lys 


Lys 


lie 


Ser 


Thr 


Val 


Val 


Ser 


Ser 


Lys 






60 










65 










70 












gaa 


gtg 


aat 


aag 


ttt 


cag 


atg 


get 


tat 


tea 


aac 


etc 


ctt 


aga 


get 


aac 


291 
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Glu Val Asn Lys Phe Gin Met Ala Tyr Ser Asn Leu Leu Arg Ala Asn 



75 



90 



80 85 
atg gat ggg ttg aag aag aga gac aaa aag aac gaa act aag aag acc 
Met Asp Gly Leu Lys Lys Arg Asp Lys Lys Asn Glu Thr Lys Lys Thr 

95 100 
aaa gca gca gca gca gca gca gca aca gca gca 
Lys Ala Ala Ala Ala Ala Ala Ala Thr Ala Ala 

110 115 
acatttcctg ctttcaccaa ttaaccactg aattgctatt ttttcctttt ggccagatag 
ctaggtttct ggttccccca cagtaggtgt 



cag 
Gin 



105 
taaagggcat 



339 



385 



445 
475 



<210> 437 
<211> 896 
<212> DNA 

<213> Homo sapiens 

<220> 
<221> CDS 
<222> 351. .677 

<220> 

<221> misc_feature 
<222> 234 

<223> n«a, g, c or t 



<400> 437 

gcacgagacc 

taagtgaaaa 

ctgtacatgt 

atactgtgct 

atgattctgg 

ctccattcca 



attacataaa 
tgacttagaa 
ttctgatttc 
ttttccagca 
tctgataaca 
tactcctcct 



aacaaacagg 
gaactaaggg 
agtacagata 
aaggatctca 
ataaacagtt 
gaggaaccca 



aatgcattct 
tagatcacta 
atagtggatc 
aggaaaagga 
cacaagagca 
atgaatgtga 



agt tta 


cct 


tct 


ggt 


aaa 


ata 


crt 


cga 


aaa 


Ser Leu 


Pro 


Ser 


Gly 


Lys 


He 


Xaa 


Arg 


Lys 




5 










10 






cga aat 


aga 


aaa 


gaa 


aat 


ctg 


gaa 


cca 


aat 


Arg Asn 


Arg 


Lys 


Glu 


Asn 


Leu 


Glu 


Pro 


Asn 



20 



25 



act gaa ttt wtt aca era gaa gaa 
Thr Glu Phe Xaa Thr Xaa Glu Glu 



35 



40 



cag tct tta eta gac ttg ttt cag 
Gin Ser Leu Leu Asp Leu Phe Gin 
55 

ttg ggt ttc aca age tac aca gaa 
Leu Gly Phe Thr Ser Tyr Thr Glu 
70 

gat att tgg gaa gag gaa aat tea 
Asp He Trp Glu Glu Glu Asn Ser 

85 90 
teg tec cct tea act tct aca ttt 
Ser Ser Pro Ser Thr Ser Thr Phe 



100 



105 



cttttcagaa gtgataagga teatattctt 

taggattttt ttaccagctt tgtttacaga 

gcaattttct acagaattga atacctgtta 

gtaaaaaaaa aaaaaaatg 



aac aga att 
Asn Arg He 

45 

act agt gaa 
Thr Ser Glu 
60 

aag agt ggt 
Lys Ser Gly 
75 

gat aat ctg 
Asp Asn Leu 

act ggc ttt 
Thr Gly Phe 

gaaattttta 
cccaaatgta 
aagaaaaatt 



tgacatttcc gaacacacat 
taaatgtaac atacaggcat 
tcaaccaaaa cagaagtcag 
ccttcattca atanttactc 
cctaactgtt caggcaaagg 
cttcaagaat atg gat 

Met Asp 

1 

aaa ata ata tta gga 
Lys He He Leu Gly 
15 

gaa ttt gat aaa aga 
Glu Phe Asp Lys Arg 
30 

tgt agt tea ceg gta 
Cys Ser Ser Pro Val 

50 

gag aaa tea gaa ttt 
Glu Lys Ser Glu Phe 
65 

ata tgc aat gtt tta 
He Cys Asn Val Leu 
80 

tta aca gcg ttw ttc 
Leu Thr Ala Xaa Phe 
95 

tagaatttaa aaaatgeata 



taaatatgta tggaaattct 
aatattaaaa ataaatattt 
acagaataaa cttgtgactg 



60 
120 
180 
240 
300 
356 



404 



452 



500 



548 



596 



644 



697 



757 
817 
877 
896 



<210> 438 
<211> 576 
<212> DNA 
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<213> Homo sapiens 

<220> 
<221> CDS 
<222> 162. .470 

<400> 438 

cattttaatt catatytgag tgggcggtgg cgattggtgt tggcggtctg gctcagctgg 60 
gcagggggta actttactga tttgggggtg gtttttagtt taatttttct tttctagctt 120 
cccatcgacg gtcagtgcgc acgttgtaat cagctgaggc c atg tea gga gac gga 176 

Met Ser Gly Asp Gly 



gcc 


acg 


gag 


cag 


gca 


get 


gag 


tat 


gtc 


cca 


gag 


1 

aag 


gtg 


aag 


aaa 


5 

gcg 


224 


Ala 


Thr 


Glu 


Gin 


Ala 


Ala 


Glu 


Tyr 


Val 


Pro 


Glu 


Lys 


Val 


Lys 


Lys 


Ala 












10 










15 










20 






gaa 


aag 


aaa 


tta 


gaa 


gag 


aat 


cea 


tat 


gac 


ctt 


gat 


get 


tgg 


age 


att 


272 


Glu 


Lys 


Lys 


Leu 


Glu 


Glu 


Asn 


Pro 


Tyr 


Asp 


Leu 


Asp 


Ala 


Trp 


Ser 


He 










25 










30 










35 








etc 


att 


cga 


gag 


gca 


cag 


aat 


caa 


cct 


ata 


gac 


aaa 


gca 


egg 


aag 


act 


320 


Leu 


lie 


Arg 


Glu 


Ala 


Gin 


Asn 


Gin 


Pro 


He 


Asp 


Lys 


Ala 


Arg 


Lys 


Thr 








40 










45 










50 










tat 


gaa 


cgc 


ctt 


gtt 


gcc 


cag 


ttc 


ccc 


agt 


tct 


ggc 


aga 


ttc 


tgg 


aaa 


368 


Tyr 


Glu 


Arg 


Leu 


Val 


Ala 


Gin 


Phe 


Pro 


Ser 


Ser 


Gly 


Arg 


Phe 


Trp 


Lys 






55 










60 










65 












ctg 


tac 


att 


gaa 


gca 


gag 


gtt 


act 


att 


tta 


ttt 


tat 


ttt 


ttc 


tta 


tat 


416 


Leu 


Tyr 


He 


Glu 


Ala 


Glu 


Val 


Thr 


He 


Leu 


Phe 


Tyr 


Phe 


Phe 


Leu 


Tyr 




70 










75 










80 










85 




cag 


tat 


tgc 


age 


att 


cac 


tgt 


agt 


gat 


aga 


aaa 


caa 


gtt 


agg 


aac 


ata 


464 


Gin 


Tyr 


Cys 


Ser 


He 


His 


Cys 


Ser 


Asp 


Arg 


Lys 


Gin 


Val 


Arg 


Asn 


He 





90 95 100 



gcc aat taggacaagg aggatttaaa tgtgtcttac ctttattttg taaaataggt 520 
Ala Asn 

ataaaggagt aattaaaatg aatttttgaa atttgggtct tttacaagct gatgat 57 6 

<210> 439 
<211> 484 
<212> DNA 
<213> Homo sapiens 

<220> 
<221> CDS 
<222> 32. .349 

<400> 439 

actaactage aaacggggac tagaaatagg g atg ctg aaa age aac ggg gag 52 

Met Leu Lys Ser Asn Gly Glu 
1 5 



aga 


cgc 


agt 


cgt 


aac 


gca 


ctt 


ccg 


gcg 


gtc 


tac 


gcg 


agg 


aag 


atg 


get 


100 


Arg 


Arg 


Ser 


Arg 


Asn 


Ala 


Leu 


Pro 


Ala 


Val 


Tyr 


Ala 


Arg 


Lys 


Met 


Ala 








10 










15 










20 










gca 


tec 


cag 


cag 


caa 


get 


tea 


gcg 


get 


tee 


tea 


get 


get 


ggt 


gta 


teg 


148 


Ala 


Ser 


Gin 


Gin 


Gin 


Ala 


Ser 


Ala 


Ala 


Ser 


Ser 


Ala 


Ala 


Gly 


val 


Ser 






25 










30 










35 












ggt 


cct 


agt 


teg 


get 


ggc 


ggc 


ccg 


ggt 


ccc 


cag 


cag 


cag 


ccg 


caa 


ccg 


196 


Gly 


Pro 


Ser 


Ser 


Ala 


Gly 


Gly 


Pro 


Gly 


Pro 


Gin 


Gin 


Gin 


Pro 


Gin 


Pro 




40 










45 










50 










55 




cca 


gca 


caa 


ctg 


gtg 


ggc 


cct 


gcc 


cag 


age 


ggc 


etc 


ctg 


cag 


caa 


cag 


244 


Pro 


Ala 


Gin 


Leu 


Val 


Gly 


Pro 


Ala 


Gin 


Ser 


Gly 


Leu 


Leu 


Gin 


Gin 


Gin 












60 










65 










70 






caa 


cag 


gac 


ttc 


gat 


cct 


gtg 


cag 


cgt 


tat 


aag 


atg 


etc 


ate 


ccg 


cag 


292 


Gin 


Gin 


Asp 


Phe 


Asp 


Pro 


Val 


Gin 


Arg 


Tyr 


Lys 


Met 


Leu 


He 


Pro 


Gin 










75 










80 










85 








ctg 


aag 


gag 


agt 


eta 


cag 


gtg 


att 


ggc 


ctt 


aag 


cag 


cga 


gaa 


gca 


aac 


340 
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Leu Lys Glu Ser Leu Gin Val lie Gly Leu Lys Gin Arg Glu Ala Asn 

90 95 100 

tgg att tgg tagtctagag gggcaggccg rgggccaggt tagtcggaga 389 
Trp He Trp 
105 

gagcgctaag cagaaagagg aatagaacct gctgttttgg cctggctggt gcacgcctgt 449 
gttccagcac tttgggaggc cgaggcgkgc ggatc 484 

<210> 440 
<211> 523 
<212> DNA 
<213> Homo sapiens 

<220> 
<221> CDS 
<222> 114 . .371 

<400> 440 

ctctctcagc ttccggctgg tagtagttcc gcttcctgtc cgactgtggt gtctttgctg 60 
agggtcacat tgagctgcag gttgaatccg gggtgccttt aggattcagc acc atg 116 

Met 

1 



gcg 


gaa 


gac 


atg 


gag 


acc 


aaa 


ate 


aag 


aac 


tac aag 


acc 


gee 


ect 


ttt 


164 


Ala 


Glu 


Asp 


Met 


Glu 


Thr 


Lys 


He 


Lys 


Asn 


Tyr Lys 


Thr 


Ala 


Pro 


Phe 










5 










10 








15 








gac 


age 


cgc 


ttc 


ccc 


aac 


cag 


aac 


cag 


act 


aga aac 


tgc 


tgg 


cag 


aac 


212 


Asp 


Ser 


Arg 


Phe 


Pro 


Asn 


Gin 


Asn 


Gin 


Thr 


Arg Asn 


Cys 


Trp 


Gin 


Asn 








20 










25 








30 










tac 


ctg 


gac 


ttc 


cac 


cgc 


tgt 


cag 


aag 


gca 


atg acc 


get 


aaa 


gga 


ggc 


260 


Tyr 


Leu 


Asp 


Phe 


His 


Arg 


Cys 


Gin 


Lys 


Ala 


Met Thr 


Ala 


Lys 


Gly 


Gly 






35 










40 








45 












gat 


ate 


tct 


gtg 


tgc 


gaa 


tgg 


tac 


cag 


cgt 


gtg tac 


cag 


tec 


etc 


tgc 


308 


Asp 


He 


Ser 


Val 


Cys 


Glu 


Trp 


Tyr 


Gin 


Arg 


Val Tyr 


Gin 


Ser 


Leu 


Cys 




50 










55 










60 








65 




ccc 


aca 


tec 


tgg 


gtc 


aca 


gac 


tgg 


gat 


gag 


caa egg 


get 


gaa 


ggc 


acg 


356 


Pro 


Thr 


Ser 


Trp 


Val 


Thr 


Asp 


Trp 


Asp 


Glu 


Gin Arg 


Ala 


Glu 


Gly 


Thr 





70 75 80 



ttt ccc ggg aag ate tgaactggct gcatctccet ttcctctgtc ctecatcett 411 
Phe Pro Gly Lys He 
85 

etcccaggat ggtgaagggg gacctggtac ceagtgatcc ccaccccagg atcctaaatc 471 
atgacttace tgctaataaa aactcattgg aaaagyraaa aaaaaaaaaa aa 523 

<210> 441 
<211> 499 
<212> DNA 
<213> Homo sapiens 

<220> 
<221> CDS 
<222> 95. .409 

<400> 441 

ttcgaagtgg ttgtstctgc actatttact gaaaagttca tcctttcccc agtgatttgt 60 
aacactgcet ctttcataaa ttaaattttt gtgt atg tgt gtg ggt ctt ttg atg 115 

Met Cys Val Gly Leu Leu Met 

1 5 

gtt tct att ctg act gac ate aat ttg tct aat ctt gta gca gta cag 163 
Val Ser He Leu Thr Asp He Asn Leu Ser Asn Leu Val Ala Val Gin 

10 15 20 

tac agt ect gat tat tgc aac ttt agg aaa agg tct gat aaa aac caa 211 
Tyr Ser Pro Asp Tyr Cys Asn Phe Arg Lys Arg Ser Asp Lys Asn Gin 
25 30 35 
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259 



307 



gat 


QCC 


tec 


aca 


ttt 


tat 


cat 


aat 


tqt 


aac 


caa 


ttt 


cac 


etc 


tgt 


etc 


Asp 


Ala 


Ser 


Thr 


Phe 


Cys 


His 


Asn 


Cvs 


Asn 


Gin 


Phe 


His 


Leu 


Cys 


Leu 


40 










45 










50 










55 


cag 


tat 


cac 


cac 


aaa 


att 


att 


ctt 


cct 


taa 


aat 


gtc 


ttg 


get 


att 


ctt 


Gin 


Tvr 


His 


His 


Lys 


He 


Val 


Leu 


Pro 


TrD 


Ser 


Val 


Leu 


Ala 


He 


Leu 










60 










65 










70 




age 


caa 


ctg 


ttc 


etc 


cat 


att 


act 


tct 


aga 


att 


aga 


cca 


aca 


att 


tat 


Ser 


Gin 


Leu 


Phe 


Leu 


His 


He 


Thr 


Ser 


Arg 


He 


Arg 


Pro 


Thr 


He 


Tyr 








75 










80 










85 






aaa 


tea 


aac 


aaa 


ccc 


aag 


age 


att 


gaa 


att 


ttg 


att 


gga 


ttt 


gta 


ttg 


Lys 


Ser 


Asa 


Lys 


Pro 


Lys 


Ser 


He 


Glu 


He 


Leu 


He 


Gly 


Phe 


Val 


Leu 






90 










95 










100 









355 



403 



aat tta tagattaate tggtaaacca tgteatcttt aeaatgttgt ettceaatac 4 59 

Asn Leu 
105 

atgaatatgg tacagctett catttactta ggcctttgaa 4 99 



<210> 442 
<211> 460 

<212> DNA 

<213> Homo sapiens 

<220> 
<221> CDS 
<222> 75. .377 



<400> 442 

ccctcgcgga gcttactgag cgcggccccg agcccagctc cgcegecgag cgcctgtgcc 60 



ggcaeggmwa cacc 


atg 


gag 


ege 


eeg 


gat 


aag 


gcg 


gcg 


ctg 


aac 


gea 


ctg 


110 










Met 


Glu 


Arg 


Pro 


Asp 


Lys 


Ala 


Ala 


Leu 


Asn 


Ala 


Leu 












1 








5 










10 








cag 


cct 


cct 


gag 


ttc 


aga 


aat 


gaa 


age 


tea 


tta 


gea 


tct 


aca 


ctg 


aag 


. 158 


Gin 


Pro 


Pro 
15 


Glu 


Phe 


Arg 


Asn 


Glu 

20 


Ser 


Ser 


Leu 


Ala 


Ser 

25 


Thr 


Leu 


Lys 




acg 


etc 


ctg 


ttc 


ttc 


aca 


get 


tta 


atg 


ate 


act 


gtt 


cct 


att 


ggg 


tta 


206 


Thr 


Leu 
30 


Leu 


Phe 


Phe 


Thr 


Ala 
35 


Leu 


Met 


He 


Thr 


Val 
40 


Pro 


He 


Gly 


Leu 




tat 


ttc 


aca 


act 


aaa 


tct 


tae 


ata 


ttt 


gaa 


ggc 


gee 


ctt 


ggg 


atg 


tee 


254 


Tyr 


Phe 


Thr 


Thr 


Lys 


Ser 


Tyr 


He 


Phe 


Glu 


Gly 


Ala 


Leu 


Gly 


Met 


Ser 




45 










50 










55 










60 




aat 


agg 


gac 


age 


tat 


ttt 


tac 


get 


get 


att 


gtt 


gea 


gtg 


gtc 


gee 


gtc 


302 


Asn 


Arg 


Asp 


Ser 


Tyr 
65 


Phe 


Tyr 


Ala 


Ala 


He 
70 


Val 


Ala 


Val 


Val 


Ala 

75 


Val 




cat 


gtg 


gtg 


ctg 


gee 


etc 


ttt 


gtg 


tat 


ktg 


gee 


tgg 


aat 


gaa 


ggc 


tea 


350 


His 


Val 


Val 


Leu 
80 


Ala 


Leu 


Phe 


Val 


Tyr 
85 


Xaa 


Ala 


Trp 


Asn 


Glu 
90 


Gly 


Ser 




ega 


cag 


tgk 


cgt 


gaa 


ggc 


aaa 


cag 


gat 


taaagtgaac atcacctttt 




397 


Arg 


Gin 


Xaa 
95 


Arg 


Glu 


Gly 


Lys 


Gin 
100 


Asp 



















tatagcatta aattcatttt ttaaaatgat aaatgctgga gggggccatc tgatttgaat 457 



<210> 443 
<211> 470 
<212> DNA 
<213> Homo sapiens 



<220> 
<221> CDS 
<222> 30. .305 



<400> 443 

agaatgtegt acteteagga ggctccace atg ttc agg gat ttc gga cgc ega 



53 
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Met 

1 


Phe 


Arg 


Asp 


Phe 
5 


Gly 


Arg 


Arg 




ctg 


cag 


agg 


gat 


ttg 


aag 


aga 


gtg 


gtg 


gat 


get 


agg 


ctg 


agg 


etc 


age 


101 


Leu 


Gin 


Arg 


Asp 


Leu 


Lys 


Arg 


Val 


Val 


Asp 


Ala 


Arg 


Leu 


Arg 


Leu 


Ser 






10 










15 










20 












gag 


gag 


etc 


age 


ggc 


ggg 


agg 


ate 


aag 


ceg 


aag 


cct 


gtg 


gag 


gtc 


cag 


149 


Glu 


Glu 


Leu 


Ser 


Gly 


Gly 


Arg 


lie 


Lys 


Pro 


Lys 


Pro 


Val 


Glu 


Val 


Gin 




25 










30 










35 










40 




gtg 


gtc 


acg 


cat 


cac 


atg 


cag 


ege 


tac 


gee 


gtg 


tgg 


ttc 


gga 


ggc 


tec 


197 


Val 


Val 


Thr 


His 


His 


Met 


Gin 


Arg 


Tyr 


Ala 


Val 


Trp 


Phe 


Gly 


Gly 


Ser 












45 










50 










55 






atg 


ctg 


gcc 


teg 


act 


ccc 


gag 


ttc 


ttt 


cag 


gtc 


tgc 


cac 


acc 


aag 


aag 


245 


Met 


Leu 


Ala 


Ser 


Thr 


Pro 


Glu 


Phe 


Phe 


Gin 


Val 


Cys 


His 


Thr 


Lys 


Lys 










60 










65 










70 








gac 


tat 


gaa 


gag 


tac 


ggg 


ccc 


age 


ate 


tgc 


ege 


cae 


aac 


eec 


gtc 


ttt 


293 


Asp 


Tyr 


Glu 


Glu 


Tyr 


Gly 


Pro 


Ser 


He 


Cys 


Arg 


His 


Asn 


Pro 


Val 


Phe 





75 80 85 

gga gtc atg tec tagtgtctgc ctgaacgcgt egttcgatgg tgtcacgttg 345 
Gly Val Met Ser 



90 

gggaacaagt gtecttcaga aeceagagaa ggcegecgtt ctgtaaatag egacgtcggt 405 

gttgetgccc agcagcgtgc ttgcattgcc ggtgcatgag gcgcggcgcg ggccttcagt 465 

aaaag 470 



<210> 444 
<211> 521 
<212> DNA 
<213> Homo sapiens 

<220> 
<221> CDS 
<222> 162. .455 



<400> 444 

agaataegta caccaatgeg gaeaagctcc tagaageage agagcagttg getcagaegg 60 
gggaatgtga ceeegaggag atctaeaagg eagctegaea eetggaggtg cgeatceaag 120 
acttcgtgcg cagggtggag cageggaage ttctcctgga c atg tct gtt tec ttc 176 

Met Ser Val Ser Phe 



cac 


aca 


cac 


acc 


aaa 


gag 


ttg 


tgg 


aca 


tgg 


atg 


1 

gaa 


gac 


ett 


cag 


5 

aag 


224 


His 


Thr 


His 


Thr 


Lys 


Glu 


Leu 


Trp 


Thr 


Trp 


Met 


Glu 


Asp 


Leu 


Gin 


Lys 












10 










15 










20 






gag 


atg 


ttg 


gag 


gat 


gtc 


tgt 


gca 


gat 


tct 


gtg 


gat 


gca 


gtc 


cag 


gaa 


272 


Glu 


Met 


Leu 


Glu 


Asp 


Val 


Cys 


Ala 


Asp 


Ser 


Val 


Asp 


Ala 


Val 


Gin 


Glu 










25 










30 










35 








ctg 


ate 


aag 


cag 


ttc 


cag 


cag 


cag 


cag 


acc 


gcc 


act 


eta 


kat 


gee 


aca 


320 


Leu 


He 


Lys 


Gin 


Phe 


Gin 


Gin 


Gin 


Gin 


Thr 


Ala 


Thr 


Leu 


Xaa 


Ala 


Thr 








40 










45 










50 










etc 


aat 


gtc 


ate 


aag 


gaa 


ggc 


gaa 


gac 


ctt 


ate 


cag 


cag 


etc 


agg 


tea 


368 


Leu 


Asn 


Val 


He 


Lys 


Glu 


Gly 


Glu 


Asp 


Leu 


He 


Gin 


Gin 


Leu 


Arg 


Ser 






55 










60 










65 












gcg 


ect 


ccc 


tec 


etc 


ggg 


gag 


ccc 


age 


gag 


gee 


agg 


tea 


gca 


tgg 


gca 


416 


Ala 


Pro 


Pro 


Ser 


Leu 


Gly 


Glu 


Pro 


Ser 


Glu 


Ala 


Arg 


Ser 


Ala 


Trp 


Ala 




70 










75 










80 










85 




Icag 


ett 


tec 


agt 


ggg 


aaa 


tgc 


etc 


ggg 


eta 


gac 


gtg 


aga 


taagtcctgt 


4 65 


Xaa 


Leu 


Ser 


Ser 


Gly 


Lys 


Cys 


Leu 


Gly 


Leu 


Asp 


Val 


Arg 











90 95 
cccagtcttg acteagcaaa atettgettt gtggtcttgt gtgagcette ggttte 521 



<210> 445 
<211> 625 
<212> DNA 
<213> Homo sapiens 



wo 02/083898 
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PCT/IBOl/00914 



<220> 
<221> CDS 
<222> 281. .547 



<400> 445 

atgctcgcca gaatacccct cgttaaaggc aaggcggctt ctggctcttc cgcagctcag 60 
ttatgggttc ctgtgtgctt agtcatcggg tccgacacag gctggactga tctggggagc 120 
cgcgaagggc ctgccttcac aaagggacgt aacgcaagta ctgcgggcag tgtttgaata 180 
tggccctgaa caatgtgtcc ctgtcctccg gtgatcagag gagcagggtg gcctaccgct 240 
cttcccatgg cgacctcaga ccgcgggcgt casgttggcg atg gtc tec gga gac 295 

Met Val Ser Gly Asp 
1 5 

ggc ttc etc gtt tec agg cct gaa gcg att cat eta gga ect egg cag 343 
Gly Phe Leu Val Ser Arg Pro Glu Ala lie His Leu Gly Pro Arg Gin 

10 15 20 

gcg gtg ega eca age gtt egg gee gag age egt ega gtg gat ggt ggc 391 
Ala Val Arg Pro Ser Val Arg Ala Glu Ser Arg Arg Val Asp Gly Gly 

25 30. 35 

ggc egg age eca agg gaa eca gat ggc egg ggc egg age egc caa gcs 4 39 

Gly Arg Ser Pro Arg Glu Pro Asp Gly Arg Gly Arg Ser Arg Gin Ala 

40 45 50 

aga ttc tea cct tac eca ate cct gee gtt gaa cec gat etc eta aga 4 87 

Arg Phe Ser Pro Tyr Pro He Pro Ala Val Glu Pro Asp Leu Leu Arg 

55 60 65 

agt gtg ctg caa cag egt ttg att gca tta gga ggt gtt ate gea get 535 
Ser Val Leu Gin Gin Arg Leu He Ala Leu Gly Gly Val He Ala Ala 
70 75 80 85 

ega att tea gtt taaacgaaca cctttcctct ggccctcact tagcttgtga 587 
Arg He Ser Val 

acaggccttt ttaaaatcct ttettggtgt agcagcaa 625 

<210> 446 

<211> 534 

<212> DNA 

<213> Homo sapiens 



<220> 
<221> CDS 
<222> 8. .523 



<400> 446 



agtcaag 


atg 


gcg 


gga 


gea 


get 


ace 


cag 


get 


tee 


ctg 


gag 


teg 


gee 


eca 


49 






Met 


Ala 


Gly 


Ala 


Ala 


Thr 


Gin 


Ala 


Ser 


Leu 


Glu 


Ser 


Ala 


Pro 








1 






5 










10 










97 


egg 


ate 


atg 


egg 


ctg 


gtg 


gee 


gaa 


tgc 


age 


egc 


tec 


agg 


gee 


egg 


gca 


Arg 


He 


Met 


Arg 


Leu 


Val 


Ala 


Glu 


Cys 


Ser 


Arg Ser Arg Ala Arg Ala 




15 










20 










25 










30 




ggc 


gag 


ctg 


tgg 


ctg 


ccg 


cat 


ggg 


aca 


gtg 


gee 


act 


cct 


gtg 


ttc 


atg 


145 


Gly 


Glu 


Leu 


Trp 


Leu 


Pro 


His 


Gly 


Thr 


Val 


Ala 


Thr 


Pro 


Val 


Phe 


Met 










35 










40 










45 






eca 


gtg 


ggc 


a eg 


cag 


gee 


ace 


atg 


aag 


ggc 


ate 


acg 


ace 


gaa 


cag 


ctg 


193 


Pro 


Val 


Gly 


Thr 


Gin 


Ala 


Thr 


Met 


Lys 


Gly 


He 


Thr 


Thr 


Glu 


Gin 


Leu 








50 










55 










60 








gac 


get 


ctg 


ggt 


tge 


egc 


ate 


tgc 


ctg 


ggc 


aat 


ace 


tac 


cat 


ctg 


ggt 


241 


Asp 


Ala 


Leu 
65 


Gly 


Cys 


Arg 


He 


Cys 
70 


Leu 


Gly 


Asn 


Thr 


Tyr 
75 


His 


Leu 


Gly 




eta 


agg 


ccg 


gga 


cec 


gag 


ctg 


ate 


cag 


aaa 


gee 


aac 


ggt 


etc 


eac 


ggc 


289 


Leu 


Arg 
80 


Pro 


Gly 


Pro 


Glu 


Leu 
85 


He 


Gin 


Lys 


Ala 


Asn 
90 


Gly 


Leu 


His 


Gly 




ttc 


atg 


aat 


tgg 


cct 


cat 


aat 


ctg 


eta 


acg 


gac 


age 


ggc 


ggt 


ttc 


cag 


337 


Phe 


Met 


Asn 


Trp 


Pro 


His 


Asn 


Leu 


Leu 


Thr 


Asp 


Ser 


Gly Gly 


Phe 


Gin 




95 










100 










105 










110 





wo 02/083898 

atg gtg teg ctg 
Met Val Ser Leu 

ttc cgc tec ccc 
Phe Arg Ser Pro 
130 

tec gtg cag ate 
Ser Val Gin He 
145 

gac gac gtg gtt 
Asp Asp Val Val 
160 

taggccatgt a 
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gtg tct ctg tec 
Val Ser Leu Ser 
115 

tac gac ggc aat 
Tyr Asp Gly Asn 

cag aat gcg ctg 
Gin Asn Ala Leu 
150 

age agt act gtg 
Ser Ser Thr Val 
165 



257 

gag gtg acg 
Glu Val Thr 

120 
gag ace ctg 
Glu Thr Leu 
135 

ggc teg gac 
Gly Ser Asp 

act ggg cca 
Thr Gly Pro 



gag gag 
Glu Glu 

ctg age 
Leu Ser 

ate ate 
He He 
155 
cgt gtg 
Arg Val 
170 



ggc gtc cgc 
Gly Val Arg 

125 
ceg gag aaa 
Pro Glu Lys 
140 

atg cag ctg 
Met Gin Leu 

gag 
Glu 



<210> 447 
<211> 111 
<212> DNA 
<213> Homo sapiens 

<400> 447 

agagctgage tgaagcggga cecggagccc gagcagccgc cgecatggca atcaaatttc 
tggaagtcat caagcectte tgtgteatec tgcaactaac atctgtgaag c 

<210> 448 
<211> 150 
<212> DNA 
<213> Homo sapiens 

<400> 448 

ttttcteaac aattcctcac egcagagcet ctgaagctec caccaggcea getctcctcc 

acccaccgct aactctcagc cecagtcace ctettggagc ttcectgctt tgaattaaag 

aceactcatg cgeaaaaaaa aaaaaaaaaa 

<210> 449 
<211> 465 
<212> DNA 
<213> Homo sapiens 



385 



433 



481 



523 



534 



60 
111 



60 
120 
150 



<400> 449 

aaccaaagca 

tcagcettea 

tggagattga 

tcaaggagct 

acaagatccg 

aggagaacgg 

taccggggga 

aggggtctct 



tgggctcttt 
ettcctcctc 
cgtggatgag 
gctggttgac 
gggcatgcag 
tggctcccac 
ccetgcggcc 
teccctgeee 



<210> 450 
<211> 394 
<212> DNA 
<213> Homo sapiens 



gttteetett 
actcecgggg 
ctcctggaca 
tgttacaaac 
aagctgagca 
aggacaawtc 
aggctggtgc 
cctcagtttt 



tgecaggect 
agctgtgaca 
tggagagtga 
ccacagaggc 
caccccagaa 
gctgcccccc 
catgagcagg 
ceaettttgg 



gggtcettce 
ggaagaggag 
cgatgcccgg 
cttcatttct 
gaagtgaggg 
aacctegtag 
getcetcgtg 
ggtyt 



eagtgaeecc 
ateeeagaac 
getgecaggg 
ggcctgctgg 
tccccgaccc 
caaeakgcaa 
cecctggcee 



60 
120 
180 
240 
300 
360 
420 
4 65 



<400> 450 

agtctgaaga 

cagccggttg 

cactttgcac 

tttcacagag 

caacaggaaa 

cttccgcaaa 

ttaaeataac 



tggcggcctc 
cttttgtgag 
agttcggcca 
gtttgggttg 
atcatattat 
catetgatga 
tgacaaaaaa 



ageagegega 
aagaattcct 
tgtcagaagg 
ggttcagttt 
agatggagta 
tgaaaagaaa 
aaaaaaaaaa 



ggtgctgcgg 
tggactgcgg 
tgcattttac 
tcttcagaag 
aaggtccagg 
gatttttgag 
arse 



cgctgcgtag 
cgtcgagtea 
cttttgacaa 
aaggacttcg 
ttcacactag 
actgeagcet 



aagtatcaat 
gctgaaagaa 
ggagactggc 
gaatgcacta 
aaggccaaaa 
attaataaag 



60 
120 
180 
240 
300 
360 
394 



<210> 451 
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<211> 391 
<212> DNA 
<213> Homo sapiens 

<400> 451 

aagtctgaag atggcggcct cagcagcgcg aggtgctgcg gcgctgcgta gaagtatcaa 60 

tcagccggtt gcttttgtga gaagaattcc ttggactccg gcgtcgagtc agctgaaaga 120 

acactttgca cagttcggcc atgtcagaag gtgcatttta ccttttgaca aggagactgg 180 

ctttcacaga ggtttgggtt gggttcagtt ttcttcagaa gaaggacttc ggaatgcact 240 

acaacaggaa aatcatatta tagatggagt aaaggtccag gttcacacta gaaggccaaa 300 

acttccgcaa acatctgatg atgaaaagaa agatttttga gactgcagcc tattaataaa 360 

gttaacataa ccaaaaaaaa aaaaaaaaag s 391 

<210> 452 
<211> 427 
<212> DNA 
<213> Homo sapiens 

<400> 452 

cccartcaga gggctgaaca tgtccgagtt tgtctgtaac ctgtcagcaa ggccttatgt 60 

gtacgacctc attgccgtgt ccaatcatta tggagccatg ggggttggcc actactaaag 120 

cagcttatgt gctattttac caacgtcgag atgatgaatt ttataagaca ccttcactta 180 

gcagttctgg ttcctctgat ggagggacac gaccaagcag ctctcagcag ggctttgggg 240 

atgatgaggc ttgcagcatg gacaccaact aatgctgact ccacgatcct gccaccctgt 300 

agcgccagtg taatccccca ggagaacatc tttgacactc tgcagactgc tagtgttctg 360 

tctaaaaacc agacaaggaa atacccttct tttatgagca gaaggaaaca aaaaaaaaaa 420 

aaaaagg 427 

<210> 453 
<2L1> 438 
<212> DNA 
<213> Homo sapiens 

<400> 453 

agatgttagg ctttgccaaa aactggatct ttagtatcac ttgagcactc attggacaca 60 

agaggtcaac tgttttgtca tttgtgtagt aagaacaaaa taaagtgtgt ggcattaaag 120 

aaataacact gtctggccgg gatcggtggc tcacgcctgt aatcccagca ctttgggagg 180 

ccgaggcggg cagatcacga ggtcaggagt tcgagaccag cctgaccaac atgatgaaac 240 

cctgtctcta ctaaaaatac aaaaattagc caggcatggt ggtgcacacc tgtagtccca 300 

gctgctcagg aggctgaggc aggagaatcg ctcgaacccg ggaggtggag gttgcagtga 360 

gctgagatgg cgccactgca ttccaggcct ggggggtgac agagcaagac tcagtctcag 420 

aaaaaaaaaa aaaaaaag 438 

<210> 454 
<211> 394 
<212> DNA 
<213> Homo sapiens 

<400> 454 

agtctgaaga tggcggcctc agcagcgcga ggtgctgcgg cgctgcgtag aagtatcaat 60 

cagccggttg cttttgtgag aagaattcct tggactgcgg cgtcgagtca gctgaaagaa 120 

cactttgcac agttcggcca tgtcagaagg tgcattttac cttttgacaa ggagactggc 180 

tttcacagag gtttgggttg ggttcagttt tcttcagaag aaggacttcg gaatgcacta 240 

caacaggaaa atcatattat agatggagta aaggtccagg ttcacactag aaggccaaaa 300 

cttccgcaaa catctgatga tgaaaagaaa gatttttgag actgcagcct attaataaag 360 

ttaacataac tgamaaaaaa aaaaaaaaaa aggc 394 

<210> 455 
<211> 713 
<212> DNA 
<2l3> Homo sapiens 



<400> 455 
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atataggcgt ggctgaatac gggcctcctg cgtacgtgcg tggtgtacgt acgtgcgtga 60 

ttacgcgcam. acgtacgttc ctcatgaaag ggacgacggg agctgcatga aagccgaagt 120 

tatggaccgc tagcatctgt cactggccac cggtttccgg gagtaagcgg cagctacctt 180 

acagccctga cacgagccgg gtgctctctc ttctcaccgc ggcccacgtc tcctcgctgg 240 

ctccggtggc ctcgctgggt cgcgaggagg cggaggactg tactctgagg ccaaaagcca 300 

gagtcggccc tgaacgccca cgactctcag ggtccagagg ccgtgagacc ggccggctga 360 

aaggtaaaga aaccaagtgg aagagtgttt cctcctctgg ccgtaaagca ggtactctct 420 

gcagcaccag ctgtccccgc cctactccgg accgccmcaa agactccatg ggatggacct 480 

gagtcagccg aatcctagcc ccttcccttg ggcctgctgt ggtgctcgac atcagtgaca 540 

gacggaagca gcagaccatc aaggctacgg gaggcccggg gcgcttgcga agatgaagtt 600 

tggctgcctc tccttccggc agccttatgc tggctttgtc ttaaatggaa tcaagactgt 660 

ggagacgcgc tggcgtcctc tgctgagcag ccagcggaac tgtaccatcg ccg 713 

<210> 456 
<211> 128 
<212> PRT 

<213> Homo sapiens 
<220> 

<221> SIGNAL 
<222> -34 . .-1 



<400> 456 

Met Ser Leu Arg 

Leu His Ala Leu 
-15 

Leu Ala Ser Ser 
1 

Lys Glu Glu Ser 
15 

Cys lie Lys Thr 

Glu Val He Gly 
50 

Thr Leu Lys Asp 
65 

He Lys Lys He 
80 



Leu Asp Thr Thr 
-30 

Gin Val Leu Leu 

Thr Lys Gly Gin 

5 

Leu Asp Ser Asp 
20 

Thr Ser Gly He 
35 

Lys Gly Thr His 

Gly Arg Lys lie 
70 

Val Gin Lys Lys 
85 



Pro Ser Cys Asn 
-25 

Leu Leu Ser Leu 
-10 

Thr Lys Arg Asn 
10 

Leu Tyr Ala Glu 
25 

His Pro Lys Asn 
40 

Cys Asn Gin Val 
55 

Cys Leu Asp Pro 

Leu Ala Gly Asp 
90 



Ser 


Ala 


Arg 


Pro 






-20 




Leu 


Leu 


Thr 


Ala 




-5 






Leu 


Ala 


Lys 


Gly 


Leu 


Arg 


Cys 


Met 








30 


He 


Gin 


Ser 


Leu 






45 




Glu 


Val 


He 


Ala 




60 






Asp 


Ala 


Pro Arg 


75 








Glu 


Ser Ala Asp 



<210> 457 

<211> 65 

<212> PRT 

<213> Homo sapiens 



<220> 

<221> SIGNAL 
<222> -18.. -1 



<400> 457 












Met Lys Ala 


Ala 
-15 


Val 


Leu 


Thr 


Leu 


Gin Ala Arg 


His 


Phe 


Trp 


Gin 


Gin 


1 








5 




Asp Arg Val 


Lys 


Asp 


Leu 


Ala 


Thr 


15 






20 






Ser Gly Arg Asp 


Tyr 


Val 


Ser 


Gin 



35 

Ser 



Ala Val Leu Phe Leu Thr Gly Ser 
-10 -5 
Asp Glu Pro Pro Gin Ser Pro Trp 
10 

Val Tyr Val Asp Val Leu Lys Asp 

25 30 
Phe Glu Gly Ser Ala Trp Glu Asn 
40 45 



<210> 458 
<211> 80 
<212> PRT 
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<220> 

<221> SIGNAL 
<222> -26. .-1 

<220> 

<221> UNSURE 
<222> 31 

<223> Xaa = Ala,Val 
<400> 458 

Met Gly Arg Ala Met Val Ala Arg Leu Gly Leu Gly Leu Leu Leu Leu 

-25 -20 -15 

Ala Leu Leu Leu Pro Thr Gin lie Tyr Ser Ser Glu Thr Thr Thr Gly 
-10 -5 15 

Thr Ser Ser Asn Ser Ser Gin Ser Thr Ser Asn Ser Gly Leu Ala Pro 

10 15 20 

Asn Pro Thr Asn Ala Thr Thr Lys Xaa Ala Gly Gly Ala Leu Gin Ser 

25 30 35 

Thr Ala Ser Leu Phe Val Val Ser Leu Ser Leu Leu His Leu Tyr Ser 

40 45 50 

<210> 459 
<211> 101 
<212> PRT 

<213> Homo sapiens 

<220> 

<221> SIGNAL 
<222> -27.. -1 

<400> 459 



Met 


Ser 


Ser 


Ala 


Ala 


Gly 


Phe 


Cys 


Ala 


Ser Arg 


Pro 


Gly Leu Leu 


Phe 






-25 










-20 








-15 








Leu 


Gly 
-10 


Leu 


Leu 


Leu 


Leu 


Pro 
-5 


Leu 


Val 


Val Ala 


Phe 
1 


Ala 


Ser 


Ala 


Glu 
5 


Ala 


Glu 


Glu 


Asp 


Gly Asp Leu Gin Cys Leu Cys Val 


Lys 


Thr 


Thr 


Ser 










10 










15 








20 




Gin 


Val 


Arg 


Pro 

25 


Arg 


His 


He 


Thr 


Ser 

30 


Leu Glu 


Val 


He 


Lys 

35 


Ala 


Gly 


Pro 


His 


Cys 


Pro 


Thr 


Ala 


Gin 


Leu 


He 


Ala Thr 


Leu 


Lys 


Asn 


Gly Arg 






40 










45 








50 








Lys 


He 
55 


Cys 


Leu 


Asp 


Leu 


Gin 
60 


Ala 


Pro 


Leu Tyr 


Lys 
65 


Lys 


He 


He 


Lys 


Lys 


Leu 


Leu 


Glu 


Ser 























70 

<210> 460 
<211> 86 
<212> PRT 

<213> Homo sapiens 
<220> 

<221> SIGNAL 
<222> -18. .-1 

<400> 460 

Met He Pro Ala Val Val Leu Leu Leu Leu Leu Leu Val Glu Gin Ala 

-15 -10 -5 

Ala Ala Leu Gly Glu Pro Gin Leu Cys Tyr He Leu Asp Ala He Leu 

1 5 10 

Phe Leu Tyr Gly He Val Leu Thr Leu Leu Tyr Cys Arg Leu Lys He 
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15 20 25 30 

Gin Val Arg Lys Ala Ala He Thr Ser Tyr Glu Lys Ser Asp Gly Val 

35 40 45 

Tyr Thr Gly Leu Ser Thr Arg Asn Gin Glu Thr Tyr Glu Thr Leu Lys. 

50 55 60 

His Glu Lys Pro Pro Gin 
65 

<210> 461 
<211> 92 
<212> PRT 

<213> Homo sapiens 
<220> 

<221> SIGNAL 
<222> -24 . .-1 



<400> 461 










Met Val 


Met 


Gly Leu Gly 


Val Leu 








-20 






Gly Leu 


Thr 


Pro 

-5 


Pro 


Thr 


Leu Ala 


Phe Leu 


Thr 


Gin 


His 


Tyr 


Asp Ala 


10 










15 


Tyr Cys 


Glu 


Ser 


He 


Met 


Arg Arg 


25 








30 




Asp He 


Asn 


Thr 


Phe 


He 


His Gly 








45 






Val Lys 


Thr 


Arg 


Met 


Glu 


Thr Leu 



60 



Leu Leu Val Phe Val Leu Gly Leu 

-15 -10 
Gin Asp Asn Ser Arg Tyr Thr His 
1 5 
Lys Pro Gin Gly Arg Asp Asp Arg 
20 

Arg Gly Leu Thr Ser Pro Cys Lys 

35 40 
Asn Lys Arg Arg Ser Arg Pro Ser 

50 55 
Thr Glu Lys Thr 
65 



<210> 462 
<211> 111 
<212> PRT 

<213> Homo sapiens 
<220> 

<221> SIGNAL 
<222> -16. .-1 

<220> 

<221> UNSURE 
<222> 50 

<223> Xaa = Asp, Asn 



<400> 462 



Met Leu 


Leu 


He 


Leu Leu 


Ser 


Val 


-15 








-10 




Gin Asp 


Leu 


Asp 


Glu Asp 


Val 


Ser 


1 






5 






Ser Asp 


Gly 


Gly 


Asp Ser 


Glu 


Gin 






20 








Pro Pro 


Leu 


Gly 


Gly Gin 


Gin 


Ser 




35 








40 


Gin Xaa 


Asp 


Gly 


Pro Gin 


Gin 


Gly 


50 








55 




Gin Gin 


Gly 


Pro 


Pro Pro 


Pro 


Gin 


65 






70 






Gin Gly 


Gly 


Arg 


Pro Gin 


Gly 


Pro 



85 



Ala Leu Leu Ala Phe Ser Ser Ala 
-5 

Gin Glu Asp Val Pro Leu Val He 

10 15 
Phe He Asp Glu Glu Arg Gin Gly 
25 30 
Gin Pro Ser Ala Gly Asp Gly Asn 
45 

Pro Pro Gin Gin Gly Gly Gin Gin 
60 

Gly Lys Pro Gin Gly Pro Pro Pro 

75 80 
Pro Gin Gly Gin Ser Pro Gin 
90 95 



<210> 463 
<211> 106 
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<212> PRT 






<213> Homo sapiens 






<220> 






<221> SIGNAL 






<222> -16. .-1 






<400> 463 






Met Leu Leu He Leu Leu 


Ser 


Val 


-15 


-10 




Gin Asp Leu Asp Glu Asp 


Val 


Ser 


1 5 






Ser Asp Gly Gly Asp Ser 


Glu 


Gin 


20 






Pro Pro Leu Gly Gly Gin 


Gin 


Ser 


35 




40 


Gin Asp Asp Gly Pro Gin 


Gin Gly 


50 


55 




Gin Gin Gly Pro Pro Pro 


Pro 


Gin 


65 70 






Gin Gly Gly Gin Ser Cys 


Cys 


Cys 



85 



<210> 464 
<211> 141 
<212> PRT 

<213> Homo sapiens 

<220> 

<221> SIGNAL 
<222> -24. .-1 



<400> 464 



Met 


Pro 


Ser 


Ser 


Val 


Ser 


Trp Gly 










-20 








Cys 


Leu 


Val 


Pro 

_c 


Val 


Ser 


Leu 


Ala 


Gin 


Lys 


Thr Asp Thr Ser 


His 


His 




10 










15 




Lys 


He 


Thr 


Pro 


Asn 


Leu 


Ala 


Glu 


25 










30 






Leu 


Ala 


His 


Gin 


Ser 


Asn 


Ser 


Thr 










45 








He 


Ala 


Thr 


Ala 


Phe 


Ala 


Met 


Leu 








60 










His 


Asp 


Glu 


He 


Leu 


Glu 


Gly Leu 






75 










80 


Glu 


Ala 


Gin 


He 


His 


Glu 


Gly 


Phe 




90 










95 




Gin 


Pro 


Asp Ser Gin 


Leu 


Gin 


Leu 



105 110 

<210> 465 
<211> 124 
<212> PRT 
<213> Homo sapiens 

<220> 

<221> SIGNAL 
<222> -30.. -1 

<400> 465 

Met Pro Ala Cys Arg Leu Gly Pro 



Ala 


Leu 


Leu 


Ala Phe Ser Ser Ala 
-5 


Gin 


Glu 


Asp 


Val Pro Leu Val He 




10 




15 


Phe 


He 


Asp 


Glu Glu Arg Gin Gly 


25 






30 


Gin 


Pro 


Ser 


Ala Gly Asp Gly Asn 








45 


Pro 


Pro 


Gin 


Gin Gly Gly Gin Gin 








60 


Gly Lys 


Pro 


Gin Gly Pro Pro Gin 






75 


80 


Asp 


Lys 







90 



He 


Leu 


Leu 


Leu Ala Gly Leu 


Cys 




-15 




-10 




Glu 


Asp 


Pro 


Gin Gly Asp Ala 


Ala 


1 






5 




Asp Gin 


Asp 


His Pro Thr Phe 


Asn 








20 




Phe 


Ala 


Phe 


Ser Leu Tyr Arg Gin 






35 




40 


Asn 


He 


Phe 


Phe Ser Pro Val 


Ser 




50 




55 




Ser 


Leu 


Gly 


Thr Lys Ala Asp 


Thr 


65 






70 




Asn 


Phe 


Asn 


Leu Thr Glu He 


Pro 








85 




Gin 


Glu 


Leu 


Leu Arg Thr Leu 


Asn 








100 




Thr 


Thr 


Gly 


Lys Asn 








115 







Leu Ala Ala Ala Leu Leu Leu Ser 



wo 02/083898 



-30 






-25 








Leu 


Leu 


Leu Phe 


Gly Phe 


Thr 


Leu 


Val 








-10 








Thr 


Gly 


Val Cys 


Pro Glu 


Leu 


Gin 


Ala 






5 






10 




Cys 


Val 


Ser Asp Ser Glu 


Cys Ala 


Asp 




20 






25 






Gly 


Cys 


Ala Thr 


Phe Cys 


Ser 


Leu 


Pro 


35 






40 








Pro 


Gin 


Val Asn 


lie Asn 


Phe 


Pro 


Gin 








55 








Cys 


Gin 


Val Asp 


Ser Gin 


Cys 


Pro 


Gly 






70 








75 


Gly 


Cys 


Gly Lys 


Val Ser 


Cys 


Val 


Thr 






85 






90 





<210> 466 
<211> 124 
<212> PRT 
<213> Homo sapiens 

<220> 

<221> SIGNAL 
<222> -30.. -1 



<400> 466 



Met 


Pro 


Ala Cys Arg Leu 


Gly 


Pro 


Leu 


-30 




-25 








Leu 


Leu 


Leu Phe Gly Phe 


Thr 


Leu 


Val 






-10 








Thr 


Gly 


Val Cys Pro Glu 


Leu 


Gin 


Ala 






5 




10 




Cys 


Val 


Ser Asp Ser Glu 


Cys 


Ala 


Asp 




20 




25 






Gly 


Cys 


Ala Thr Phe Cys 


Ser 


Leu 


Pro 


35 




40 








Pro 


Gin 


Val Asn lie Asn 


Phe 


Pro 


Gin 






55 








Cys 


Gin 


Val Asp Ser Gin 


Cys 


Pro 


Gly 






70 






75 


Gly 


Cys 


Gly Lys Val Ser 


Cys 


Val 


Thr 






85 




90 





<210> 467 
<211> 69 
<212> PRT 

<213> Homo sapiens 
<220> 

<221> SIGNAL 
<222> -17. .-1 

<400> 467 



Met 


Phe 


Pro 
-15 


Gly 


Phe 


Val 


Phe 


Leu 
-10 


Leu 


Ala 


Ala 


His 


Leu 


Trp Val 


Leu 


Ala 


Ala 




1 








5 








Lys 


Val 


Cys 


Ser 


Phe 


Thr 


Pro 


Glu 


Ala 








20 










Glu Gly Ala 


Asn 


Asn 


Ser 


Arg 


Arg 


Thr 








35 










40 


His 


His 


Lys 


Gly 


Leu 











50 
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-20 










-15 


Ser 


Gly 


Thr 


Gly Ala 


Glu 


Lys 


-5 










1 




Asp 


Gin 


Asn 


Cys 


Thr 


Gin 


Glu 








15 








Asn 


Leu 


Lys 


Cys 


Cys 


Ser 


Ala 






30 










Asn 


Asp 


Lys 


Glu Gly Ser Cys 




45 










50 


Leu 


Gly Leu 


Cys Arg Asp Gin 


60 










65 




Gin 


Met 


Lys 


Cys 


Cys 


Arg Asn 










80 






Pro 


Asn 


Phe 










Ala 


Ala 


Ala 


Leu 


Leu 


Leu 


Ser 




-20 










-15 


Ser 


Gly Thr 


Gly Ala 


Glu 


Lys 


-5 










1 




Asp 


Gin 


Asn 


Cys 


Thr 


Gin 


Glu 








15 








Asn 


lie 


Lys 


Cys 


Cys 


Ser 


Ala 






30 










Asn 


Asp 


Lys 


Glu 


Gly 


Ser 


Cys 




45 










50 


Leu 


Gly 


Leu 


Cys 


Arg 


Asp 


Gin 


60 










65 




Gin 


Met 


Lys 


Cys 


Cys 


Arg 


Asn 










80 






Pro 


Asn 


Phe 










Phe 


Val 


He 


Ser 


Leu 


Ala 


Ala 








-5 








Phe 


Met 


Gly 


Arg 


He 


Thr 


Val 




10 










15 


Ser 


Lys 


Thr 


Val 


Ser 


Pro 


Pro 


25 










30 




Ala 


Phe 


Lys 


Ser 


Cys 


Asn 


Thr 










45 
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<210> 468 
<211> 77 
<212> PRT 

<213> Homo sapiens 
<220> 

<221> SIGNAL 
<222> -33. .-1 

<400> 468 



Met 


Gly 


Pro 


Val 


Lys 


Gin 


Leu 


Lys 


Arg Met 


Phe 


Glu 


Pro 


Thr Arg 


Leu 








-30 










-25 










-20 






He 


Ala 


Thr 
-15 


He 


Met 


Val 


Leu 


Leu 
-10 


Ser 


Phe 


Ala 


Leu 


Thr 
-5 


Leu 


Cys 


Ser 


Ala 


Phe 

1 


Trp 


Trp 


His 


Asn 
5 


Met 


Gly 


Leu 


Ala 


Leu 
10 


He 


Phe 


Cys 


He 


Leu 
15 


Gin 


Ser 


Leu 


Ala 


Leu 
20 


Thr 


Trp 


Tyr 


Ser 


Leu 
25 


Ser 


Phe 


He 


Pro 


Phe 
30 


Ala 


Arg Asp Ala 


Val 


Lys 


Lys 


Cys 


Phe 


Ala 


Val 


Cys 


Leu 


Ala 














35 










40 

















<210> 469 
<211> 146 
<212> PRT 

<213> Homo sapiens 
<220> 

<221> SIGNAL 
<222> -26. .-1 

<400> 469 



Met 


Thr 


Met 


Arg 


Ser 


Leu 


Leu Arg Thr Pro Phe 


Leu Cys 


Gly Leu 


Leu 




-25 










-20 


-15 








Trp 


Ala 


Phe 


Cys 


Ala 


Pro 


Gly Ala Arg Ala Glu 


Glu 


Pro 


Ala Ala 


Ser 


-10 










-5 


1 






5 




Phe 


Ser 


Gin 


Pro 


Gly 


Ser 


Met Gly Leu Asp Lys 


Asn 


Thr 


Val His 


Asp 








10 






15 






20 




Gin 


Glu 


His 


He 


Met 


Glu 


His Leu Glu Gly Val 


He 


Asn 


Lys Pro 


Glu 






25 








30 




35 






Ala 


Glu 


Met 


Ser 


Pro 


Gin 


Glu Leu Gin Leu His 


Tyr 


Phe 


Lys Met 


His 




40 










45 


50 








Asp 


Tyr 


Asp 


Gly 


Asn 


Asn 


Leu Leu Asp Gly Leu 


Glu 


Leu 


Ser Thr 


Ala 


55 










60 


65 








70 


He 


Thr 


His 


Val 


His 


Lys Glu Glu Gly Ser Glu 


Gin 


Ala 


Pro Leu 


Met 










75 




80 






85 




Ser 


Glu 


Asp 


Glu 


Leu 


He 


Asn He He Asp Gly 


Val 


Leu Arg Asp Asp 








90 






95 






100 




Asp 


Lys 


Asn 


Asn 


Asp 


Gly 


Tyr He Asp Tyr Ala 


Glu 


Phe 


Ala Lys 


Ser 






105 








110 




115 







Leu Gin 
120 



<210> 470 
<211> 62 
<212> PRT 

<213> Homo sapiens 

<220> 

<221> SIGNAL 
<222> -46.. -1 



<400> 470 



wo 02/083898 PCT/IBO 1/009 14 
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Met Asp Gin Leu Val Phe Lys Glu Thr He Trp Asn Asp Ala Phe Trp 

-45 -40 -35 

Gin Asn Pro Trp Asp Gin Gly Gly Leu Ala Val He He Leu Phe He 
-30 -25 -20 -15 

Thr Ala Val Leu Leu Leu He Leu Phe Ala He Val Phe Gly Leu Leu 

-10 -5 1 

Thr Ser Thr Glu Asn Thr Gin Cys Glu Ala Gly Glu Glu Glu 
5 10 15 



<210> 471 
<211> 162 
<212> PRT 

<213> Homo sapiens 

<220> 

<221> SIGNAL 
<222> -47.. -1 



<400> 471 



Met Arg 


He 


Ala 




-45 




Gly Ala 


Gly 


Trp 


-30 






Thr Ser 


Leu 


Ala 


-15 






Met Gin 


Met 


Tyr 






5 


Gin Gly 


Gly 


Leu 




20 




Phe Asn 


Asn 


Leu 


35 






He Phe 


Pro 


Glu 


50 






Gly Leu 


He 


His 


Val Gly 


Leu 


Tyr 






85 


Ala Ala 


Pro 


Val 




100 




Arg Asn 






115 







Asn Arg Thr Arg 

-40' 

Thr His Gly Arg 
-25 

Leu Ser Ser Leu 
-10 

Ser Arg Gin Leu 

Leu Gly Ser Gly 
25 

Glu Asn Leu Val 
40 

He Leu Leu Cys 

55 

Arg Val Cys Val 
70 

Tyr He Asn Lys 

Leu Thr Pro Ala 
105 



Phe Ser Ser Pro 

Gly Met Met Val 
-20 

Leu Ser Leu Leu 
-5 

Ala Ser Thr Glu 

10 

Leu Phe Val Phe 

Phe Gly Lys Gly 
45 

Leu Leu Leu Ala 
60 

Thr Thr Cys Phe 
75 

He Ser Ser Thr 
90 

Lys Val Thr Gly 



Phe 


Leu Ala Arg 


-35 








Val 


Gly Thr Gly 


Leu 


Phe 


Ala 


Gly 








1 


Trp 


Leu 


Thr 


He 




15 






Ser 


Leu 


Thr 


Ala 


30 








Phe 


Gin 


Ala 


Lys 


Leu 


Phe 


Ala 


Ser 








65 


He 


Phe 


Ser 


Met 






80 




Leu 


Tyr 


Gin 


Ala 




95 






Lys 


Ser 


Lys 


Lys 


110 









<210> 472 
<211> 90 
<212> PRT 

<213> Homo sapiens 
<220> 

<221> SIGNAL 
<222> -35. .-1 



<400> 472 










Met Glu Thr 


Gly 


Ala 


Ser 


Ala Ser 


-35 






-30 




Met Arg Arg 


He 


Trp 
-15 


Ser 


Leu Gly 


Ser Trp Ala 


Ala 
1 


Leu 


Phe 


His Asp 
5 


He Val Ser 


Asp 


Leu 


Ser 


Thr Val 


15 








20 


Phe Gin Leu 


Leu 


Asp Val 


Trp Thr 


30 






35 




Ser Met Cys 


Phe 


Val 


Phe 


Leu Leu 



He Pro Glu Leu He Cys Glu Ala 
-25 -20 
Leu Gly Leu Val Thr Leu Thr Ala 

-10 -5 
Gly Phe Ala Val Leu Gly Gly Asn 
10 

Arg Phe Val Ala Gin Gin Gin His 
25 

Arg Asn Phe Arg Lys Pro Leu Gly 
40 45 

Leu Pro 
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55 
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<210> 


473 












<211> 


61 












<212> 


PRT 












<213> 


Homo sapiens 












<220> 














<221> 


SIGNAL 












<222> 


-25. .-1 












<220> 














<221> 


UNSURE 












<222> 


27 












<223> 


Xaa = Ala, Val 












<400> 


473 












Met Arg Leu Leu Gin 


Leu Leu Phe Arg Ala 


Ser Pro 


Ala 


Thr Leu 


Leu 


-25 




-20 


-15 






-10 


Leu Val Leu Cys Leu 


Gin Leu Gly Ala Asn Lys Ala 


Gin 


Asp Asn 


Thr 




-5 


1 






5 




Arg Lys lie He He 


Lys Asn Phe Asp He 


Pro Lys 


Ser 


Val Arg 


Pro 




10 


15 




20 






Asn Asp Glu Xaa Leu 


Gin Cys Leu Gin Phe 


Lys Gin 


Asn 







25 30 35 



<210> 474 
<211> 120 
<212> PRT 

<213> Homo sapiens 
<220> 

<221> SIGNAL 
<222> -32. .-1 

<400> 474 



Met 


Ala 


Ser 


Pro 


Lys 


Gly Phe Phe Asn Tyr 


Leu Thr 


Tyr 


Phe 


Leu 


Ala 






-30 






-25 




-20 








Ala 


Gly 


Ala 


Val 


Thr 


Leu Gly He Gly Phe 


Phe Ala 


Leu 


Ala 


Ser 


Ala 




-15 








-10 


-5 










Leu 


Trp 


Phe 


Leu 


He 


Cys Lys Arg Arg Glu 


He Phe 


Gin 


Asn 


Ser 


Lys 


1 








5 


10 








15 




Phe 


Lys 


Ala 


He Asp 


Glu Arg Cys Arg Gin Arg Pro 


Ser 


Met 


Ala 


Lys 








20 




25 






30 






He 


Lys 


Ser 


His 


Ser 


Gin Cys Val Phe He 


Ser Arg 


Asn 


Phe 


His 


Thr 






35 






40 




45 








Gly Arg 


Phe 


Gin 


Leu 


Gin Gin Leu Lys He 


He Leu 


Lys 


Met 


Asn 


Pro 




50 








55 


60 










Asn 


Leu 


Gin 


Gin 


Lys 


He Ser Phe Val He 


Pro Gin 


Arg 


Pro 


Ala 


Pro 


65 










70 


75 








80 


Gin 


Gin 


He 


Ala 


Ala 


Val Leu His 













85 



<210> 475 
<211> 78 
<212> PRT 

<213> Homo sapiens 

<220> 

<221> SIGNAL 
<222> -19. .-1 



<400> 475 



wo 02/083898 



Met Lys Phe Phe 


Val 


Phe 


Ala 


Leu Val 




-15 








He Ser Ala Asp 


Ser 


His 


Glu Lys Arg 


1 








5 


Phe His Glu Lys 


His 


His 


Ser 


Tyr His 


15 






20 




Glu Glu Ser Ser 


Lys 


Ser 


Asn 


Ala Asn 


30 




35 






Tyr Thr Leu Cys 


Phe Arg 


He 


Leu Ala 




50 








<210> 476 










<211> 87 










<212> PRT 










<213> Homo sapiens 








<220> 










<221> SIGNAL 










<222> -20. .-1 










<400> 476 










Met Gin Lys Val 


Thr 


Leu 


Gly Leu Leu 


-20 




-15 






Val Leu Asp Ala 


Asn Asp 


Leu 


Glu Asp 




1 






5 


Asp Trp His Ser 


Leu 


Gin 


Val 


Gly Gly 


15 








20 


Cys Ala Met Gly 


He 


He 


He 


Val Met 


30 






35 




Phe Gly Gin Lys 


Ser 


Gly 


His 


His Pro 


45 




50 






Thr Pro Gly Ser 


Ala 


Gin 


Ser 






65 









<210> 477 
<211> 105 
<212> PRT 

<213> Homo sapiens 
<220> 

<221> SIGNAL 
<222> -19. .-1 

<220> 

<221> UNSURE 
<222> 37 

<223> Xaa = * ,Lys 



<400> 477 



Met Arg 


Gly 


Ala 


Thr Arg 


Val Ser 


He 








-15 






Ser Asp 


Cys 


Ala 


Val He 


Thr Gly Ala 






1 




5 




Gly Ala 


Gly 


Thr 


Cys Cys 


Ala He 


Ser 


15 








20 




Met Cys 


Thr 


Pro 


Leu Gly 


Arg Xaa 


Gly 


30 






35 






His Lys 


He 


Pro 


Phe Phe 


Arg Lys 


Arg 








50 






Leu Pro 


Asn 


Leu 


Leu Cys 


Ser Arg 


Phe 






65 






70 


Ser Met 


Asp 


Leu 


Lys Asn 


He Asn 


Phe 




80 






85 
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Leu 


Ala 


Leu 


Met 


He 


Ser 


Met 


-10 










-5 




His 


His 


Gly Tyr Arg Arg Lys 








10 








He 


Thr 


Leu 


Leu 


Pro 


Leu 


Phe 






25 










Glu 


Lys 


His 


Tyr 


Asn 


Leu 


Leu 




40 










45 


Phe 


Ser 


He 


Val 


Thr 






55 















Val 


Phe 


Leu Ala Gly Phe 


Pro 




-10 








"5 


Lys 


Asn 


Ser 


Pro 


Phe Tyr 


Tyr 










10 




Leu 


He 


Cys 


Ala 


Gly Val 


Leu 








25 






Ser 


Ala 


Lys 


Cys 


Lys Cys 


Lys 






40 








Gly 


Glu 


Thr 


Pro 


Pro Leu 


He 




55 








60 



Met 


Leu 


Leu Leu 


Val Thr 


Val 


-10 






-5 




Cys 


Glu 


Arg Asp 


Val Gin 


Cys 






10 






Leu 


Trp 


Leu Arg Gly Leu Arg 






25 






Glu 


Glu 


Cys His 


Pro Gly 


Ser 




40 






45 


Lys 


His 


His Thr 


Cys Pro 


Cys 


55 






60 




Pro 


Asp 


Gly Arg 


Tyr Arg 


Cys 



75 



wo 02/083898 



<210> 478 
<211> 102 
<212> PRT 

<213> Homo sapiens 

<220> 

<221> SIGNAL 
<222> -36. .-1 

<220> 

<221> UNSURE 

<222> 31 

<223> Xaa = His,Arg 

<400> 478 



Met 


Ser 


Asn 


Thr 


Thr 


Val Pro 


Asn Ala 




-35 








-30 




Met 


Val 


Gly 


Tyr 


Val 


Leu Gly 


Pro Phe 


-20 










-15 




Val 


Val 


Val 


Ala 


Val 


Val Met 


Tyr Val 










1 




5 


Arg 


Leu 


Arg 


His 


His 


Leu Leu 


Pro Met 






15 








20 


Glu 


Leu 


Xaa 


Glu 


Ala 


Glu Gin 


Glu Leu 




30 








35 




Lys 


Val 


Val 


His 


Gly 


Trp Gin 


Ser Gly 


45 










50 




Leu 


Leu 


Asp 


Val 


Lys 


Thr 





65 



<210> 479 

<211> 137 

<212> PRT 

<213> Homo sapiens 
<220> 

<221> SIGNAL 

<222> -18. .-1 



<400> 479 



Met 


Lys 


Ala 


Leu 


He 


Val Leu Gly Leu 








-15 




-10 


Gin 


Gly 


Lys 
1 


Val 


Phe 


Glu Arg Cys Glu 
5 


Leu Gly 


Met 


Asp 


Gly 


Tyr Arg Gly He 


15 










20 


Leu 


Ala 


Lys 


Trp 


Glu 


Ser Gly Tyr Asn 










35 




Ala 


Gly 


Asp 


Arg 


Ser 


Thr Asp Tyr Gly 








50 




55 


Tyr 


Trp 


Cys 


Asn Asp 


Gly Lys Thr Pro 






65 






70 


Leu 


Ser 


Cys 


Ser Gly 


Trp His Gly Glu 




80 








85 


Ser 


Val 


Ser 


Met 


Phe 


Lys Val Val Glu 


95 










100 


Leu 


Gin 


Leu 


He 


Leu 


Ser Leu Ser His 










115 
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Pro 


Gin 


Ala 


Asn 


Ser Asp Ser 






-25 










Cr lie 


Leu 


He 


Thr 


Leu 


Val 


Glv 




-10 










-5 


Gin 


Lys 


Lys 


Lys 


Arg 


Val 


Asp 










10 






1 yi. 


Ser 


Tyr Asp 


Pro 


Ala 


Glu 








25 








Leu 


Ser Asp Met 


Gly Asp 


Pro 






40 










Tvr 


Gin 


His 


Lys 


Arg 


Met 


Pro 




55 










60 


Val 


Leu 


Leu 


Ser 


Val 


Thr 


Val 










-5 






Leu 


Ala Arg 


Thr 


Leu 


Lys 


Arg 






10 










Ser 


Leu 


Ala 


Asn 


Trp 


Met 


Cys 




25 










30 


Thr 


Arg Ala 


Thr 


Asn 


Tyr 


Asn 


40 










45 




He 


Phe 


Gin 


He 


Asn 


Ser 


Arg 










60 






Gly 


Ala 


Val 


Asn 


Ala 


Cys 


His 








75 








He 


Val 


Val 


Lys 


Thr 


Glu 


Met 






90 










Cys 


Asn 


Ser Arg 


He 


Phe 


Leu 




105 










110 



<210> 480 
<211> 101 
<212> PRT 



wo 02/083898 
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<213> Homo sapiens 

<220> 

<221> SIGNAL 
<222> -29. .-1 

<400> 480 



Met Arg 


Glu 


Glu 


Lys 
-25 


Lys 


Pro 


Phe 


Val Cys 


Met 


Cys 
-10 


Val 


Phe 


Ser 


Thr 


Ala His 


Phe 


lie 


Asp 


Glu 


Lys 


Leu 


5 










10 




Arg Gly 


Thr 


Leu 


Asp 


Ser 


Leu 


Ser 


20 








25 






Ser Thr 


Ala 


Asp 


Pro 
40 


Thr 


Gin 


Ala 


Leu Pro 


Val 


Thr 


Ser Arg 


Ser 


Gin 






55 










Pro Ala 


Ala 
70 


Ala 


Leu 









<210> 481 
<211> 128 
<212> PRT 
<213> Homo sapiens 

<220> 

<221> SIGNAL 
<222> -26. .-1 



<400> 481 



Met 


Cys 


Trp 


Leu Arg Ala 


Trp Gly 




-25 








-20 




Ser 


Leu 


Phe 


Leu He 


Gin 


Leu 


Leu 


-10 








-5 






He 


His 


Ser 


Pro Arg 


Asn 


Asn 


Gin 








10 








Glu 


Cys 


Ala 


Val Lys 


Lys 


Ser 


Cys 






25 








30 


Cys 


Val 


Trp 


Cys Ser 


Glu 


Glu 


Lys 




40 








45 




Tyr 


Phe 


Gly 


Cys Arg 


Phe 


Ser 


Ser 


55 








60 






Asp 


Met 


Phe 


Gly He 


Met 


Met 


Leu 








75 








Gly 


Phe 


Val 


Trp Tyr 


Cys 


Cys Ala 



90 



<210> 482 
<211> 142 
<212> PRT 
<213> Homo sapiens 

<220> 

<221> SIGNAL 
<222> -24.. -1 

<220> 

<221> UNSURE 
<222> 80 

<223> Xaa = Leu, Arg, Trp 
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Glu 


Arg Glu Arg Glu Ser 


Val Cys 




-20 




-15 




Gin 


Gly Ala Leu Gly Glu 


Met 


Ala 


-5 




1 






Arg 


Pro 


Ser Glu Gly Asn 


Gly His 






15 






Ser 


Asp 


Gin Glu Ser Tyr 


He 


Pro 






30 






Gly 


Pro 


mn T.^n T.<ati H*i<; 




Asn 




45 




50 




Pro 


Leu 


Ptt) 5?^t* A<in T.^n 


Ala 


He 


60 




65 






Gin 


He 


Leu Leu Pro Val 


Phe 


Leu 






-15 






He 


Ser 


Phe Ser Glu Asn 


Gly 


Phe 






1 


5 




Lys 


Pro Arg Asp Gly Asn 


Glu 


Glu 


15 




20 






Gin 


Leu 


Cys Thr Glu Asp 


Lys 


Lys 






35 






Ala 


Cys 


Lys Lys Tyr Cys 


Phe 


Pro 






50 






He 


Tyr 


Trp Leu Asn Cys 


Lys 


Val 






65 




70 


Leu 


Leu 


He Ala Val Leu 


He 


Thr 




80 




85 




Tyr 


His 


Phe Tyr Leu Gin 


Asp 


He 


95 




100 







wo 02/083898 

<220> 

<221> UNSURE 
<222> 79 



<223> Xaa = Lys,Asn 










<400> 482 










Met Arg Leu Ser Trp 


Phe 


Arg 


Val 


Leu 


-20 










Ser Ala Val Ala Thr 


Ala 


Thr 


Gly 


Ala 


-5 








1 


He Gly Val Lys Lys 


Arg 


Val 


Asp 


His 


10 




15 






Gly Asp Val Leu His 


Met 


His 


Tyr 


Thr 


25 


30 








Glu Phe Asp Ser Ser 


Leu 


Pro 


Gin 


Asn 


45 










Gly Thr Gly Gin Val 


He 


Lys 


Gly 


Trp 


60 








65 


Cys Glu Gly Glu Lys 


Arg 


Xaa 


Xaa 


Val 


75 






80 




Gly Glu Arg Gly Ala 


Pro 


Pro 


Lys 


He 


90 




95 






Phe Glu Val Glu Leu 


Leu 


Lys 


He 


Glu 


105 


110 








<210> 483 










<211> 70 










<212> PRT 










<213> Homo sapiens 










<220> 










<221> SIGNAL 










<222> -24.. -1 










<400> 483 










Met Ala Ser Ser Thr 


Ser 


Val 


Ser 


Thr 


-20 










Pro Ala Cys Leu Gly 


Ser 


Trp 


Ala 


Glu 


-5 








1 


He Ser He Ala Ser 


Asp 


Phe 


Pro 


Thr 


10 




15 






Pro Pro Ala Pro Gly 


Met 


Val 


Pro 


He 


25 


30 








Trp Ala Phe Ser Pro 


Ala 








45 










<210> 484 










<211> 55 










<212> PRT 










<213> Homo sapiens 










<220> 










<221> SIGNAL 










<222> -23. .-1 










<400> 484 










Met Ala Gly Pro Ala 


Ala 


Ala 


Phe 


Arg 


-20 








-15 


Ala Ala Ala Leu Gly 


Phe 


Ala 


Ser 


Tyr 


-5 






1 




Gin Met Pro Thr Gly 


Arg 


Ser 


Cys 


Leu 


10 


15 








Ser Tyr Thr Ala Trp 


Pro 


Cys 
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Thr 


Val 


Leu 


Ser 


He 


Cys 


Leu 


-15 










-10 




Glu 


Gly 


Lys 


Arg 


Lys 


Leu 


Gin 








5 








Cys 


Pro 


He 


Lys 


Ser 


Arg 


Lys 






20 










Gly 


Lys 


Leu Glu Asp Gly Thr 




35 










40 


Gin 


Pro 


Phe 


Val 


Phe 


Ser 


Leu 


50 










55 




Asp 


Gin 


Gly Leu Leu Gly Met 










70 






He 


Pro 


Ser 


Glu 


Leu 


Gly 


Tyr 








85 








Pro 


Gly 


Gly Ala 


Thr 


Leu 


Val 






100 










Arg 


Arg 


Thr 


Glu 


Leu 








115 












Gly 


Gin 


He 


Leu 


Leu 


Thr 


Arg 


-15 










-10 




He 


Arg 


Ser 


Pro 


Val 


Arg 


Thr 








5 








Ala 


Arg 


Val 


Ser 


Leu 


Trp 


Val 






20 










Lys 


He 


Ser 


Gly Cys 


Ala 


Asn 




35 










40 


Arg 


Leu 


Gly Ala 


Leu 


Ser 


Gly 










-10 






Gly 


Ala 


His 


Gly Ala Asn 


Ser 






5 










Thr 


Arg 


Pro 


Thr 


Asn 


Thr 


Thr 




20 










25 



wo 02/083898 



30 



<210> 485 






<211> 55 






<212> PRT 






<213> Homo sapiens 




<220> 






<221> SIGNAL 






<222> -23.. -1 






<400> 485 






Met Ala Gly Pro 


Ala Ala Ala Phe 


Arg 


-20 




-15 


Ala Ala Ala Leu 


Gly Phe Ala Ser 


Tyr 


-5 


1 




Gin Met Pro Thr 


Gly Arg Ser Cys 


Leu 


10 


15 




Ser Tyr Thr Ala 


Trp Pro Cys 






30 





<210> 486 
<211> 54 
<212> PRT 

<213> Homo sapiens 
<220> 

<221> SIGNAL 
<222> -17. .-1 

<400> 486 



Met 


Phe Arg 


Arg 


Leu Thr 


Phe Ala 


Gin 




-15 






-10 




Gly 


He Ala 


Gly 


Gly Val 


Tyr He 


Phe 




1 




5 






Ala 


Lys Asp 


Gin 


Lys Glu 


Leu Lys 


Glu 








20 






Ser 


Glu Glu 


Lys 


Lys Ser 










35 









<210> 487 
<211> 83 

<212> PRT 

<213> Homo sapiens 
<220> 

<221> SIGNAL 
<222> -15.. -1 



<400> 487 



Met 


Arg 


Leu 


Phe 


Leu 


Ser 


Leu 


Pro 


Val 


-15 










-10 








Val 


Leu 


Glu 


Gly 
5 


Pro 


Ala 


Pro 


Ala 


Gin 
10 


Ala 


Leu Asp 


Lys 


Leu 


Lys 


Glu 


Phe 


Gly 






20 










25 




Arg 


Glu 


Leu 


He 


Ser Arg 


He 


Lys 


Gin 




35 










40 






Arg 


Glu 


Trp 


Phe 


Ser 


Glu 


Thr 


Phe 


Gin 



50 55 
He Asp Ser 
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Arg Leu Gly Ala Leu Ser Gly 
-10 

Gly Ala His Gly Ala Asn Ser 
5 

Thr Arg Pro Thr Asn Thr Thr 
20 25 



Leu Leu Phe Ala Thr Val Leu 
-5 

Gin Pro Val Phe Glu Gin Tyr 

10 15 
Lys Met Gin Leu Val Gin Glu 
25 30 



Leu Val Val Val Leu Ser lie 

-5 1 
Gly Thr Pro Asp Val Ser Ser 
15 

Asn Thr Leu Glu Asp Lys Ala 
30 

Ser Glu Leu Ser Ala Lys Met 
45 

Lys Val Lys Asp Lys Leu Lys 
60 65 



<210> 488 



wo 02/083898 

<211> 98 
<212> PRT 

<213> Homo sapiens 
<220> 

<221> SIGNAL 
<222> -43. ,-1 

<400> 488 

Met Phe Arg lie Glu Gly Leu Ala Pro Lys Leu Asp Pro Glu Glu Met 

-40 -35 -30 

Lys Arg Lys Met Arg Glu Asp Val He Ser Ser He Arg Asn Phe Leu 

-25 -20 -15 

He Tyr Val Ala Leu Leu Arg Val Ser Glu Cys Leu Pro Gly Cys Asp 

-10 -5 1 5 

Cys Asp Thr Ser Gly Glu Leu Thr Asp Gly His Pro Leu Thr Leu Arg 

10 15 20 

Gly His Arg Gly Leu Arg Thr Glu Leu Asn Gly Ser Gly Glu Gin Gly 

25 30 35 

Gly Ser He Tyr Leu Lys Glu He Gly Gin His Met Lys Thr Gly His 
40 45 50 

His He 
55 

<210> 489 
<211> 82 
<212> PRT 

<213> Homo sapiens 
<220> 

<221> SIGNAL 
<222> -43. .-1 

<400> 489 



Met 


Ala 


Lys 


Tyr 


Gin 


Gly Glu Val Gin 


Ser Leu 


Lys 


Leu 


Asp 


Asp 


Asp 








-40 




-35 








-30 






Ser 


Val 


He 


Glu Gly 


Val Ser Asp Gin 


Val Leu 


Val 


Ala 


Val 


Val 


Val 






-25 






-20 






-15 








Ser 


Phe 


Ala 


Leu 


He 


Ala Thr Leu Val Tyr Ala 


Leu 


Phe 


Arg 


Asn 


Val 




-10 








-5 




1 








5 


His 


Gin 


Asn 


He 


His 


Pro Glu Asn Gin 


Glu Leu 


Val 


Arg 


Val 


Leu 


Arg 










10 




15 








20 




Glu 


Gin 


Leu 


Gin 


Thr 


Glu Gin Asp Ala 


Pro Ala Asp 


Ser 


Thr 


Ala 


Val 








25 




30 








35 







Leu His 



<210> 490 
<211> 99 
<212> PRT 

<213> Homo sapiens 
<220> 

<221> SIGNAL 
<222> -28. .-1 

<400> 490 

Met Thr Ser Ala Leu 
-25 

Gly Tyr Leu Val Leu 
-10 

Leu Glu Asn Asp Glu 

5 

Thr Ala Cys Gly Phe 



PCT/IBOl/00914 

272 



Thr Gin Gly Leu Glu 
-20 

Ser Glu Gly Ala Val 
-5 

Gin Ala Ala Ser Ala 

10 

Arg Leu His Arg Gly 



Arg He Pro Asp Gin Leu 
-15 

Leu Ala Ser Ser Gly Asp 
1 

He Ser Glu Leu Val Ser 
15 20 
Met Asn Val Pro Phe Lys 



wo 02/083898 



25 

Arg Leu Ser Val Val Phe 
40 

Gly Gin Arg Val Phe Val 
55 

lie Asp Val 
70 

<210> 491 
<211> 186 
<212> PRT 

<213> Homo sapiens ' 

<220> 

<221> SIGNAL 
<222> -15. .-1 

<220> 

<221> UNSURE 

<222> 148 

<223> Xaa Glu,Gln 
<400> 491 



Met 


His 


Val 


Met 


Ala Ala 


Ser Met Ala 


Arg Gly Gly Val Ser Ala 


Arg 


-15 








-10 




-5 


1 


Val 


Leu 


Leu 


Gin 


Ala Ala 


Arg Gly Thr 


Trp Trp Asn Arg Pro Gly Gly 








5 




10 


15 




Thr 


Ser 


Gly 


Ser 


Gly Glu 


Gly Val Ala 


Leu Gly Thr Thr Arg Lys 


Phe 






20 






25 


30 




Gin 


Ala 


Thr 


Gly 


Ser Arg 


Pro Ala Gly 


Glu Glu Asp Ala Gly Gly 


Pro 




35 








40 


45 




Glu Arg 


Pro 


Gly 


Asp Val 


Val Asn Val 


Val Phe Val Asp Arg Ser Gly 


50 








55 




60 


65 


Gin Arg 


He 


Pro 


Val Ser 


Gly Arg Val 


Gly Asp Asn Val Leu His 


Leu 










70 




75 80 




Ala 


Gin 


Arg 


His 


Gly Val 


Asp Leu Glu 


Gly Ala Cys Glu Ala Ser 


Leu 








85 




90 


95 




Ala 


Cys 


Ser 


Thr 


Cys His 


Val Tyr Val 


Ser Glu Asp His Leu Asp 


Leu 






100 






105 


110 




Leu 


Pro 


Pro 


Pro 


Glu Glu 


Arg Glu Asp 


Asp Met Leu Asp Met Ala 


Pro 




115 








120 


125 




Leu 


Leu 


Gin 


Glu 


Asn Ser 


Arg Leu Gly 


Cys Gin He Val Leu Thr 


Pro 


130 








135 




140 


145 


Glu 


Leu 


Xaa 


Gly 


Ala Glu 


Phe Thr Leu 


Pro Lys He Thr Arg Asn 


Phe 










150 




155 160 




Tyr 


Val 


Asp 


Gly 


His Val 


Pro Lys Pro 


His 










165 




170 







<210> 492 
<211> 123 
<212> PRT 
<213> Homo sapiens 

<220> 

<221> SIGNAL 
<222> -48. .-1 

<400> 492 

Met Glu Ala Leu Gly Lys 
-45 

Leu Glu Asp Phe Arg Val 
-30 

Val Ser Gly Leu Leu Met 
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30 35 
Gly Glu His Thr Leu Leu Val Thr Val Ser 

45 50 
Val Lys Arg Gin Asn Arg Gly Arg Glu Pro 
60 65 



Leu Lys Gin Phe Asp Ala Tyr Pro Lys Thr 

-40 -35 
Lys Thr Cys Gly Gly Ala Thr Val Thr He 

-25 -20 
Leu Leu Leu Phe Leu Ser Glu Leu Gin Tyr 



wo 02/083898 



-15 -10 
Tyr Leu Thr Thr Glu Val His Pro Glu 
1 5 

Gly Asp Lys Leu Lys lie Asn lie Asp 

20 25 
Cys Ala Tyr Leu Ser He Asp Ala Met 
35 40 



Leu Asp Val Glu His Asn Leu Phe Lys 
50 55 



He Pro Val 


Ser Ser 


Glu 


Ala 


Glu 


Arg 


65 






70 






<210> 


493 












<211> 


77 












<212> 


PRT 












<213> 


Homo sapiens 










<220> 














<221> 


SIGNAL 










<222> 


-29. 


-1 










<220> 














<221> 


UNSURE 










<222> 


36 












<223> 


Xaa 


Gin, Arg 










<400> 


4 93 












Met Ala Ala 


Thr Asp 


Phe 


Val 


Gin 


Glu 






-25 










Leu Leu Leu 


Lys Leu 


Gin 


Arg 


Leu 


Pro 






-10 








-5 


Val Ala Phe 


Ser Val 


He 


He 


Leu 


Phe 


5 








10 






Leu Leu He 


Ala Cys 


Ser 


Cys 


Cys 


Cys 


20 






25 








Xaa Arg Gly 


Arg Lys 


Val 


Gin 


Val 


Gin 



40 



<210> 494 
<211> 107 
<212> PRT 
<213> Homo sapiens 

<220> 

<221> SIGNAL 
<222> -27.. -1 



<400> 494 



Met Leu 


Phe 


Glu 


Glu Ala 


Leu 


Pro Leu 




-25 








-20 


Ser Thr 


Leu 


Ser 


Leu Val 


Gin 


Phe Ser 


-10 








-5 




Leu Leu 


Ser 


Pro 


Gly Val 


Glu 


Asn Leu 








10 






Leu Gly 


Leu 


Ala 


Cys Cys 


Leu 


Leu Thr 






25 






30 


Pro Asp 


Pro 


Pro 


Glu Gly 


Pro 


Gly Thr 




40 








45 


Leu Arg 


He 


Leu 


Gin Leu 


Pro Gly Ala 


55 








60 




Gin Pro 


Val 


Trp 


Ser Arg 


Thr 


Pro Asp 


70 






75 
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-5 






Leu Tyr 


Val 


Asp 


Lys Ser Arg 


10 






15 


Val Leu 


Phe 


Pro 


His Met Pro 








30 


Asp Val 


Ala 


Gly 


Glu Gin Gin 






45 




Gin Arg 


Leu 


Asp 


Lys Asp Gly 




60 






His Asp 









75 



Met 


Arg 


Ala 


Val 


Gly Glu Arg 


-20 








-15 




Gin 


Ala 


Glu 


Pro 


Val Glu 
1 


He 


Thr 


Ala 


Thr 


Val 


Leu Leu 


Leu 






15 








Thr 


His 


Cys 


Cys 


Cys Pro 


Glu 




30 








35 


Pro 


Thr 


Pro 


Pro 






45 













Ser Cys 


Ser 


Asp 


Pro 


Val Leu 






-15 






Pro Ser 


Gly 


Arg 


Thr 


Gin Asp 




1 






5 


Ser Val 


Leu 


Asp 


Val 


Ser Pro 


15 








20 


Leu Thr 


Met 


Ser 


Cys 


Pro Gly 








35 




Gin Arg 


Val 


Trp 


Gin Gly Ala 






50 






Pro Asp 


Gly 


Val 


Ser 


Pro Tyr 




65 








Leu Lys 











80 



wo 02/083898 



<210> 495 
<2H> 112 
<212> PRT 

<213> Homo sapiens 
<220> 

<221> SIGNAL 
<222> -39. .-1 

<220> 

<221> UNSURE 
<222> -37 

<223> Xaa = * ,Glu,Lys,Gln 



<400> 495 



Met Leu 


Xaa 


His 


Leu 


Ser 


Ser 


Leu 


Pro Thr 


Gin 


Met 


Asp Tyr 


Lys 


Gly 








-35 








-30 








-25 




Gin Lys 


Leu 


Ala 


Glu 


Gin 


Met 


Phe 


Gin Gly He 


He 


Leu Phe 


Ser 


Ala 






-20 










-15 






-10 






He Val 


Gly 


Phe 


He 


Tyr 


Gly Tyr 


Val Ala 


Glu 


Gin 


Phe Gly 


Trp Thr 




-5 










1 






5 








Val Tyr 


He 


Val 


Met 


Ala 


Gly 


Phe 


Ala Phe 


Ser 


Cys 


Leu Leu 


Thr 


Leu 


10 








15 








20 








25 


Pro Pro 


Trp 


Pro 


He 


Tyr 


Arg Arg 


His Pro 


Leu 


Lys 


Trp Leu 


Pro 


Val 








30 








35 








40 




Gin Ala 


Gin 


Thr Thr Arg 


Asn 


Gin 


Gly Lys Glu Lys 


Leu Arg 


Gly Met 






45 










50 






55 






Leu Lys 


He 


He 


Glu 


Val 


Phe 


Met 


He Gin 


His 


Leu 


Leu Leu 


Phe 


Leu 




60 










65 








70 






<210> 496 
























<211> 103 
























<212> PRT 
























<213> Homo sapiens 




















<220> 


























<221> SIGNAL 






















<222> -21..- 


-1 






















<400> 496 
























Met Ala 


Leu Cys 


Ala 


Leu 


Thr Arg 


Ala Leu Arg Ser Leu Asn 


Leu 


Ala 


-20 










-15 








-10 








Pro Pro 


Thr 


Val 


Ala 


Ala 


Pro 


Ala 


Pro Ser 


Leu 


Phe 


Pro Ala 


Ala 


Gin 


-5 








1 






5 








10 




Met Met 


Asn 


Asn 


Gly Leu 


Leu 


Gin 


Gin Pro 


Ser 


Ala 


Leu Met 


Leu 


Leu 






15 










20 






25 






Pro Cys 


Arg 


Pro 


Val 


Leu 


Thr 


Ser 


Val Ala 


Leu 


Asn 


Ala Asn 


Phe 


Val 




30 










35 








40 






Ser Trp 


Lys 


Ser 


Arg 


Thr 


Lys 


Tyr 


Thr He 


Thr 


Pro 


Val Lys 


Met 


Arg 


45 










50 








55 








Lys Ser 


Gly Gly 


Arg 


Asp 


His 


Thr 


Gly Ala 


Gly Asn Val Arg 


Ser 


Asn 


60 








65 








70 








75 


Ser Arg 


Pro 


Ser 


He 


Gin 


Arg 

















80 



<210> 497 
<211> 92 
<212> PRT 

<213> Homo sapiens 
<220> 

<221> SIGNAL 
<222> -15. .-1 
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<400> 497 



Met Ala Val Leu 


Ala Gly Ser Leu 


Leu Gly 


Pro Thr Ser Arg 


Ser Ala 


-15 




-10 






-5 


1 


Ala Leu Leu Gly 


Gly Arg Trp Leu 


Gin 


Pro 


Arg Ala Trp Leu Gly Phe 


5 






10 




15 




Pro Asp Ala Trp 


Gly Leu Pro Thr 


Pro 


Gin 


Gin Ala Arg Gly 


Lys Ala 


20 




25 






30 




Arg Gly Asn Glu 


Tyr Gin Pro Ser 


Asn 


He 


Lys Arg Lys Asn 


Lys His 


35 




40 






45 




Gly Trp Val Arg 


Arg 


Leu Ser Thr 


Pro 


Ala 


Gly Val Gin Val 


He Leu 


50 




55 






60 


65 


Arg Arg Met Leu 


Lys 


Gly Arg Lys 


Ser 


Leu 


Ser His 






70 






75 






<210> 498 














<211> 87 














<212> PRT 














<213> Homo sapiens 












<220> 














<221> SIGNAL 














<222> -39, .-1 














<400> 498 














Met Lys Val Leu 


Leu 


Leu Thr Gly 


Leu Gly 


Ala Leu Phe Phe 


Ala Tyr 




-35 






-30 




-25 


Tyr Trp Asp Asp 


Asn 


Phe Asp Pro 


Ala 


Ser 


Leu Gin Gly Ala Arg Val 


-20 






-15 




-10 




Leu Leu Thr Gly 


Ala 


Asn Ala Gly 


Val 


Gly 


Glu Glu Leu Ala 


Tyr His 


-5 




1 






5 




Tyr Ala Arg Leu 


Gly Ser His Leu 


Val 


Leu 


Thr Ala His Thr 


Glu Ala 


10 




15 






20 


25 


Leu Leu Gin Lys 


Ala Arg Trp Leu 


Thr 


Leu 


Val Val Ser Thr 


Leu Gly 




30 






35 




40 


Gly Arg Gly Lys 


Trp 


He Thr 










45 















<210> 499 
<211> 125 
<212> PRT 

<213> Homo sapiens 

<220> 

<221> SIGNAL 
<222> -15. .-1 

<220> 

<221> UNSURE 
<222> 100 

<223> Xaa = Asn, Thr 
<400> 499 



Met 


Lys 


Leu 


Leu 


Thr 


His 


Asn 


Leu 


Leu 


Ser 


Ser His Val Arg Gly Val 


-15 










-10 










-5 






1 


Gly 


Ser Arg 


Gly Phe 


Pro 


Leu Arg 


Leu 


Gin 


Ala Thr 


Glu Val 


Arg 


He 








5 










10 






15 






Cys 


Pro 


Val 


Glu 


Phe 


Asn 


Pro 


Asn 


Phe 


Val 


Ala Arg 


Met He 


Pro 


Lys 






20 










25 








30 






Val 


Glu 


Trp 


Ser 


Ala 


Phe 


Leu 


Glu 


Ala 


Ala 


Asp Asn 


Leu Arg 


Leu 


He 




35 










40 








45 








Gin 


Val 


Pro 


Lys 


Gly 


Pro 


Val 


Glu 


Gly Tyr 


Glu Glu 


Asn Glu 


Glu 


Phe 


50 










55 










60 






65 
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Leu Arg Thr Met His His Leu Leu Leu Glu Val Glu Val lie Glu Gly 

70 75 80 

Thr Leu Gin Cys Pro Glu Ser Gly Arg Met Phe Pro lie Ser Arg Gly 

85 90 95 

lie Pro Xaa Met Leu Leu Ser Glu Glu Glu Thr Glu Ser 
100 105 110 



<210> 500 
<211> 52 
<212> PRT 

<213> Homo sapiens 
<220> 

<221> SIGNAL 
<222> -22,. -1 



<220> 

<221> UNSURE 
<222> 26 

<223> Xaa » Ile,Asn 



<400> 500 

Met Lys He Ser Val Ala Ala He Pro Phe Phe Leu Leu He Thr lie 

-20 -15 -10 

Ala Leu Gly Thr Lys Thr Glu Ser Ser Ser Arg Gly Pro Tyr His Pro 

-5 15 10 

Ser Glu Cys Cys Phe Thr Tyr Thr Thr Tyr Lys He Pro Arg Gin Xaa 
15 20 25 

Tyr Gly Leu Leu 
30 



<210> 501 
<211> 72 
<212> PRT 

<213> Homo sapiens 
<220> 

<221> SIGNAL 
<222> -42.. -1 



<400> 501 

Met Asn Arg Leu Ala Gly Val Gly 
-40 -35 
Ser Ser Leu Leu Gin Ser Val Glu 

-25 -20 
Glu Val Ala Ala Ala Pro Gly Thr 
-10 -5 
Leu Ser Ala Asp Lys Phe Gin Val 
10 

Gin Thr Leu Met Ser Ser Leu Gly 
25 30 



Trp Arg Val Asp Tyr Thr Leu Ser 
-30 

Glu Pro Met Val His Leu Arg Leu 
-15 

Pro Ala Gin Pro Val Ala Met Ser 

1 5 
Leu Leu Ala Glu Leu Lys Gin Ala 

15 20 



<210> 502 
<211> 84 
<212> PRT 

<213> Homo sapiens 
<22a> 

<221> SIGNAL 
<222> -20.. -1 



<400> 502 

Met Ala Ala Ala Ala Val Pro Ser Leu Leu Leu Ser Leu Pro Pro His 



wo 02/083898 PCT/IBO 1/009 14 

278 



-20 






-15 








-10 


-5 


Gin Glv Leu 


Thr 


Phe 
1 


Ser 


Asn 


Lys 


He 
5 


Gin Pro 


Phe Gly Ala Gin Gly 
10 


Val Leu His 


Pro 


Glu 


Pro 


Gly Leu 


Arg 


Asp Trp Leu Leu Pro Thr Cys 


15 










20 






25 


Ser Arg Gin 


Leu 


Arg Val 


Ala 


Leu 


Pro 


Glu Lys 


Gly Ser Glu Gly Ser 


30 








35 








40 


Leu Cys Gin 


Thr 


Gin 


Leu 


Pro 


Ala 


Thr 


Pro Cys 


Phe Leu Pro Ser Asn 


45 






50 








55 


60 


Thr Val Arg 


Thr 

















<210> 503 
<211> 54 
<212> PRT 

<213> Homo sapiens 



<220> 

<221> SIGNAL 
<222> -19. .-1 

<400> 503 

Met Lys Phe Asn Phe Val Leu Leu lie Leu Ser Lys Val Leu Asp Asp 

-15 -10 -5 

Thr Phe Gin Ser Val Lys Lys Trp Leu Asn Tyr Phe Gin Phe Thr Leu 

15 10 

Arg Asn Gly Leu Met Trp Pro Gly Ala Val Ala His Ala Cys Asn Pro 

15 20 25 

Ser Thr Gly Ser Arg Leu 

30 35 

<210> 504 
<211> 91 
<212> PRT 

<213> Homo sapiens 

<220> 

<221> SIGNAL 
<222> -19.. -1 

<220> 

<221> UNSURE 
<222> 29 

<223> Xaa = Met, Thr 
<400> 504 



Met 


Tyr 


Gly 


Lys 


He 
-15 


He 


Phe 


Val 


Leu 


Leu 

-10 


Leu 


Ser Glu 


He 


Val 
-5 


Ser 


He 


Ser 


Ala 


Leu 
1 


Ser 


Thr 


Thr 


Glu 
5 


Val 


Ala 


Met 


His Thr 
10 


Ser 


Thr 


Ser 


Ser 


Ser 


Val 


Thr Lys 


Ser 


Tyr 


He 


Ser Ser Gin Thr Asn Gly Glu 


Xaa 




15 










20 










25 








Gly Gin 


Leu 


Val 


His 


Arg 


Phe 


Thr 


Val 


Pro 


Ala 


Pro Val 


Val 


He 


He 


30 










35 










40 








45 


Leu 


He 


He 


Leu 


Cys 

50 


Val 


Met 


Ala 


Gly 


He 

55 


He 


Gly Thr 


He 


Leu 
60 


Leu 


He 


Ser 


Tyr 


Ser 


He Arg Arg 


Leu 


He 


Lys 


Ala 











65 70 



<210> 505 
<211> 114 
<212> PRT 

<213> Homo sapiens 
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<220> 






















<221> SIGNAL 






















<222> -32. .-1 






















<400> 505 






















Met Gin Met Tyr 


Ser 


Arg Gin 


Leu Ala Ser 


Thr 


Glu 


Trp 


Leu 


Thr 


He 










-25 






-20 








Gin Gly Gly Leu 


Leu 


Gly Ser 


Gly Leu Phe 


Val 


Phe 


Ser 


Leu 


Thr 


Ala 


-15 






-10 






-5 










Phe Asn Asn Leu 


Glu 


Asn 


Leu 


Val Phe Gly 


Lys Gly 


Phe 


Gin 


Ala 


Lys 


1 


5 






10 










15 




lie Phe Pro Glu 


He 


Leu 


Leu 


Cys Leu Leu 


Leu 


Ala 


Leu 


Phe 


Ala 


Ser 


20 








25 








30 






Gly Leu lie His 


Arg 


Val 


Cys 


Val Thr Thr 


Cys 


Phe 


He 


Phe 


Ser 


Met 


35 








40 






45 








Val Gly Leu Tyr 


Tyr 


He 


Asn 


Lys He Ser 


Ser 


Thr 


Leu 


Tyr 


Gin 


Ala 


50 






55 






60 










Ala Ala Pro Val 


Leu 


Thr 


Pro 


Ala Lys Val 


Thr Gly 


Lys 


Ser 


Lys 


Lys 


65 




70 






75 










80 


Arg Asn 






















<210> 506 






















<211> 53 






















<212> PRT 






















<213> Homo sapiens 




















<220> 






















<221> SIGNAL 






















<222> -13.. -1 






















<400> 506 






















Met Ala Val Ala 


Phe 


Val 


Leu 


Ser Leu Gly 


Val 


Ala 


Ala 


Leu 


Tyr 


Lys 


-10 








-5 








1 






Phe Arg Val Ala 


Asp 


Gin Arg 


Lys Lys Ala 


Tyr Ala 


Asp 


Phe 


Tyr Arg 


5 






10 






15 










Asn Tyr Asp Val 


Met 


Lys Asp Phe Glu Glu 


Met 


Arg 


Lys 


Ala 


Gly 


He 


20 




25 






30 










35 


Phe Gin Ser Val 


Lys 






















40 




















<210> 507 






















<211> 108 






















<212> PRT 






















<213> Homo sapiens 




















<220> 






















<221> SIGNAL 






















<222> -40. .-1 






















<400> 507 






















Met Gin Lys Phe 


Arg 


Lys 


Met 


Ser Glu Thr 


His 


His 


Ser 


Val 


He 


Ser 


-40 




-35 






-30 










-25 


Val Lys Ala Ser 


Ser 


Pro 


Trp 


Leu Ser Ser 


Ser 


Val 


Thr 


Ala 


Pro 


Ser 


-20 






-15 










-10 




Met Val Ala Pro 


Val 


Thr 


Phe 


Ala Ser He 


Val 


Glu 


Glu 


Glu 


Leu 


Gin 


-5 








1 






5 








Gin Glu Ala Ala 


Leu 


He 


Arg 


Ser Arg Glu 


Lys 


Pro 


Leu 


Ala 


Leu 


He 


10 






15 






20 










Gin He Glu Glu 


His 


Ala 


He 


Gin Asp Leu 


Leu 


Val 


Phe 


Tyr 


Glu 


Ala 


25 




30 






35 










40 


Phe Gly Asn Pro 


Glu 


Glu 


Phe 


Val He Val 


Glu 


Arg 


Thr 


Pro 


Gin 


Gly 




45 






50 










55 




Pro Leu Ala Val 


Pro 


Met 


Trp Asn Lys His 


Gly 


Cys 
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<210> 508 
<211> 127 
<212> PRT 

<213> Homo sapiens 

<220> 

<221> SIGNAL 
<222> -47. .-1 

<220> 

<221> UNSURE 
<222> -22 

<223> Xaa = * ,Cys 
<220> 

<221> UNSURE 
<222> -39 

<223> Xaa = * /Tyr 

<220> 

<221> UNSURE 
<222> 57 

<223> Xaa =» He, Leu 

<220> 

<221> UNSURE 
<222> -43,-27,63 
<223> Xaa = Leu, Met 

<220> 

<221> UNSURE 
<222> 3 

<223> Xaa = Lys,Gln 
<220> 

<221> UNSURE 
<222> -42 

<223> Xaa = Lys,Thr 

<400> 508 



Met 


Phe 


Thr 


Pro Xaa Xaa Val Lys Xaa Ala Tyr Tyr Asp Thr Glu Arg 






-45 






-40 


-35 






He 


Gly 


Val 


Asp Xaa 


He Met Lys Thr Xaa Phe Ser 


Pro Asn Arg 


Val 




-30 








-25 -20 








He 


Gly 


Leu 


Ser 


Ser Asp Leu Gin Gin Val Gly Gly Ala 


Ser Ala 


Arg 


-15 










-10 -5 






1 


He 


Xaa 


Asp 


Ala 


Leu 


Ser Thr Val Leu Gin Tyr Ala 


Glu 


Asp Val 


Leu 








5 




10 




15 




Ser 


Gly 


Lys 


Val 


Ser 


Ala Asp Asn Thr Val Gly Arg 


Phe 


Leu Met 


Ser 






20 






25 


30 






Leu 


Val 


Asn 


Gin 


Val 


Pro Lys He Val Pro Asp Asp 


Phe 


Glu Thr 


Met 




35 








40 45 








Leu 


Asn 


Ser 


Asn 


He 


Asn Asp Xaa Leu Met Val Thr 


Tyr 


Xaa Ala 


Asn 


50 










55 60 






65 


Leu 


Thr 


Gin 


Ser 


Gin 


He Ala Leu Asn Glu Lys Leu 


Val 


Asn Leu 





70 75 80 



<210> 509 
<211> 85 
<212> PRT 

<213> Homo sapiens 
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<220> 

<221> SIGNAL 
<222> -32. .-1 

<400> 509 



Met Lys 


Ala 


Ala Leu 


Glu Asp Thr Leu Ala Glu Thr 


Glu Ala Arg 


Phe 




-30 




-25 


-20 








Gly Ala 


Gin 


Leu Ala 


His He Gin Ala Leu He Ser 


Gly 


He 


Glu 


Ala 


-15 






-10 -5 










Gin Leu 


Gly 


Asp Val 


Arg Ala Asp Ser Glu Arg Gin 


Asn 


Gin 


Glu 


Tyr 


1 




5 


10 






15 




Gin Arg 


Leu 


Met Asp 


He Lys Ser Arg Leu Glu Gin 


Glu 


He 


Ala 


Thr 






20 


25 




30 






Tyr Arg 


Ser 


Leu Leu Glu Gly Gin Glu Asp His Tyr 


Asn 


Asn 


Leu 


Ser 




35 




40 


45 








Ala Ser 


Lys 


Val Leu 













50 



<210> 510 
<211> 101 
<212> PRT 

<213> Homo sapiens 
<220> 

<221> SIGNAL 
<222> -25. .-1 



<400> 510 



Met Val 


Asp 


Arg 


Glu 


Leu Ala Asp 


He His Glu Asp 


Ala 


Lys 


Thr 


Cys 


-25 








-20 


-15 








-10 


Leu Val 


Leu 


Cys 


Ser Arg Val Leu 


Ser Val He Ser 


Val 


Lys 


Glu 


He 








-5 




1 




5 






Lys Thr 


Gin 
10 


Leu 


Ser 


Leu Gly Arg 
15 


His Pro He He 


Ser 
20 


Asn 


Trp 


Phe 


Asp Tyr 


He 


Pro 


Ser 


Thr Arg Tyr 


Lys Asp Pro Cys 


Glu 


Leu 


Leu 


His 


25 








30 


35 










Leu Cys 


Arg 


Leu 


Thr 


He Arg Asn Gin Leu Leu Thr 


Asn 


Asn 


Met 


Leu 


40 








45 


50 








55 


Pro Asp Gly 


He 


Phe 


Ser Leu Leu 


He Pro Ala Arg 


Leu 


Gin 


Asn 


Tyr 








60 




65 






70 




Leu Asn 


Leu 


Glu 
75 


He 















<210> 511 
<211> 68 
<212> PRT 

<213> Homo sapiens 
<220> 

<221> SIGNAL 
<222> -47, .-1 



<400> 511 



Met Val Lys 


Ser 


Ser Leu Gin Arg 


He 


Leu 


Asn 


Ser 


His 


Cys 


Phe 


Ala 


-45 






-40 










-35 








Arg Glu Lys 


Glu 


Gly Asp Lys 


Pro 


Ser 


Ala 


Thr 


He 


His 


Ala 


Ser Arg 


-30 




-25 










-20 










Thr Met Pro 


Leu 


Leu Ser Leu 


His 


Ser Arg Gly Gly 


Ser 


Ser 


Ser 


Glu 


-15 




-10 








-5 










1 


Ser Ser Arg 


Val 


Ser Leu His 


Cys 


Cys 


Ser 


Asn 


Pro 


Gly 


Pro Gly 


Pro 




5 






10 










15 






Arg Trp Cys 


Ser 
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<210> 512 
<211> 68 
<212> PRT 

<213> Homo sapiens 

<220> 

<221> SIGNAL 
<222> -47. .-1 

<220> 

<221> UNSURE 
<222> 9 

<223> Xaa = Cys,Trp 
<400> 512 



Met 


Val Lys 


Ser 


Ser Leu Gin Arg 


He 


Leu Asn 


Ser His 


Cys 


Phe 


Ala 




-45 






-40 






-35 








Arg 


Glu Lys 


Glu 


Gly Asp 


Lys Pro 


Ser 


Ala Thr 


He His 


Ala 


Ser 


Arg 




-30 






-25 






-20 








Thr 


Met Pro 


Leu 


Leu Ser 


Leu His 


Ser Arg Gly Gly Ser Ser 


Ser 


Glu 


-15 






-10 






-5 








1 


Ser 


Ser Arg 


Val 


Ser Leu 


His Xaa 


Cys 


Ser Asn 


Pro Gly 


Pro 


Gly Pro 






5 






10 






15 






Arg Trp Cys 


Ser 



















20 



<210> 513 
<211> 129 
<212> PRT 

<213> Homo sapiens 
<220> 

<221> SIGNAL 
<222> -16. .-1 



<400> 513 



Met Ala 


Val 


Leu 


Leu Leu Leu Leu Arg Ala Leu Arg Arg 


Gly 


Pro Gly 


-15 






-10 -5 








Pro Gly 


Pro 


Arg 


Pro Leu Trp Gly Pro Gly Pro Ala Trp 


Ser 


Pro 


Gly 


1 






5 10 




15 




Phe Pro 


Ala 


Arg 


Pro Gly Arg Gly Arg Pro Tyr Met Ala 


Ser 


Arg 


Pro 






20 


25 


30 






Pro Gly 


Asp 


Leu 


Ala Glu Ala Gly Gly Arg Ala Leu Gin 


Ser 


Leu 


Gin 




35 




40 45 








Leu Arg 


Leu 


Leu 


Thr Pro Thr Phe Glu Gly He Asn Gly Leu 


Leu 


Leu 


50 






55 60 








Lys Gin 


His 


Leu 


Val Gin Asn Pro Val Arg Leu Trp Gin 


Leu 


Leu 


Gly 


65 






70 75 






80 


Gly Thr 


Phe 


Tyr 


Phe Asn Thr Ser Arg Leu Lys Gin Lys 


Asn 


Lys 


Glu 








85 90 




95 




Lys Asp 


Lys 


Ser 


Lys Gly Lys Ala Pro Glu Glu Asp Glu Gly 


He 


Phe 






100 


105 


110 







He 



<210> 514 
<211> 109 
<212> PRT 

<213> Homo sapiens 
<220> 

<221> SIGNAL 
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<222> -33. .-1 
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<220> 

<221> UNSURE 
<222> 43 

<223> Xaa = Asp, Gly, Asn, Ser 

<220> 

<22l> UNSURE 
<222> 22 

<223> Xaa ^ Glu,Gly 
<220> 

<221> UNSURE 
<222> 42 

<223> Xaa = Pro,Arg 



<400> 514 








Met Ala 


Ala 


Ser Gly Ala 


Pro 


Arg 






-30 






Asn Val 


Ala 


Pro Leu Ala 


Val 


Phe 




-15 






-10 


Gly Gin 


Arg 


Leu Ala Ser 


Glu 


Pro 


1 




5 






Pro Thr 


Ser 


Ser Val Pro 


Xaa 


Thr 






20 






Ala Leu 


Gly 


Gly Ala Leu 


Ala 


Leu 






35 






Ser Ser 


Gin 


Arg Met Pro Arg 


Gin 




50 






55 


Gly Gly 


Gly 


Pro Gly Lys 


Ser 


Pro 


65 






70 





lie Leu Val Asp Leu Leu Lys Leu 

-25 -20 
Gin Met Leu Lys Ser Met Cys Ala 
-5 

Gin Asp Pro Ala Ala Val Ser Leu 

10 15 
Arg Gly Arg Asn Lys Gly Ser Ala 

25 30 
Ala Glu Xaa Xaa Ser Arg Glu Gly 
40 45 
Pro Ser Ala Thr Arg Leu Pro Lys 
60 

Thr Arg Gly Ser Thr 
75 



<210> 515 
<211> 103 
<212> PRT 

<213> Homo sapiens 
<220> 

<221> SIGNAL 
<222> -36. .-1 



<400> 515 












Met Ala Asn 


Leu Phe lie Arg Lys Met 


Val Asn 


Pro 


Leu 


Leu Tyr Leu 


-35 


-30 




-25 






Ser Arg His 


Thr Val Lys Pro Arg Ala 


Leu Ser 


Thr 


Phe 


Leu Phe Gly 


-20 


-15 


-10 






-5 


Ser lie Arg 


Gly Ala Ala Pro Val Ala 


Val Glu 


Pro 


Gly Ala Ala Val 




1 5 








10 


Arg Ser Leu 


Leu Ser Pro Gly Leu Leu 


Pro His 


Leu 


Leu 


Pro Ala Leu 


15 


20 






25 




Gly Phe Lys 


Asn Lys Thr Val Leu Lys 


Lys Arg Cys 


Lys 


Asp Cys Tyr 


30 


35 




40 






Leu Val Lys 


Arg Arg Gly Arg Trp Tyr Val Tyr 


Cys 


Lys 


Thr His Pro 


45 


50 


55 






60 


Arg His Lys 


Gin Arg Gin Met 











65 



<210> 516 
<211> 86 
<212> PRT 

<213> Homo sapiens 
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<220> 

<221> SIGNAL 
<222> -36.. -1 

<400> 516 



Met Gly 


Pro 


Ala 


Leu Arg 


Ser 


Leu 


Gin 


Val 


Lys Lys Gly 


Thr 


Glu 


His 


-35 










-30 










-25 










Ala Asp 


Pro 


Leu 


Pro 


Phe 


Pro 


Ser 


Val 


Ser 


Leu Ser Gly 


Phe 


Thr 


Val 


-20 








-15 










-10 










-5 


Gly Thr 


Leu 


Ser 


Glu 


Thr 


Ser Thr Gly Gly 


Pro 


Ala 


Thr 


Pro 


Thr 


Trp 








1 








5 










10 






Lys Glu 


Cys 


Pro 


He 


Cys 




Glu Arg 


Phe 


Pro 


Ala 


Glu 


Ser 


Asp 


Lys 




15 










20 










25 








Asp Ala 


Leu 


Glu 


Asp 


His 




Asp Gly His 


Phe 


Phe 


Phe 


Ser 


Thr 


Gin 


30 






























Asp Pro 


Phe 


Thr 


Phe 


ox u 






















45 






























<210> 517 




























<211> 52 




























<212> PRT 




























<213> Homo sapiens 
























<220> 






























<221> SIGNAL 


























<222> -30..- 


-1 


























<400> 517 




























Met Ser 


Ser 


Cys 


Gly 


He 


Val 


Gly Ser Ser 


Vox 


C o V- 




Gin 


Leu 


Asp 


-30 




























-15 


Ala Val 


Lys 


Leu 


Leu 


Leu 


Lys 


Met 


Val 


Ser 


Ser 


Ala 


Thr 


Thr 


Glu Arg 








-10 










-5 










1 




Cys Cys 


Asn 


Gly 


Ser 




Asn 


Phe 


His 


Lys 


Asn 


Leu 


Cys 


Ala 


Thr Gly 




5 










10 










15 








lie Lys 


Asn 


Phe 


























20 






























<210> 518 




























<211> 182 




























<212> PRT 




























<213> Homo sapiens 
























<400> 518 




























Met Ser 


Ala 


Thr 


Val 


Val 


Asp Ala 


Val 


Asn 


Ala 


Ala 


Pro 


Leu 


Ser 


Gly 


1 






5 










10 










15 




Ser Lys 


Glu 


Met 


Ser 


Leu 


Glu 


Glu 


Pro 


Lys 


Lys 


Met 


Thr 


Arg Glu Asp 






20 










25 










30 






Trp Arg 


Lys 


Lys 


Lys 


Glu 


Leu 


Glu 


Glu 


Gin 


Arg Lys 


Leu 


Gly Asn Ala 




35 










40 










45 








Pro Ala 


Glu 


Val 


Asp Glu 


Glu 


Gly Lys Asp 


He 


Asn 


Pro 


His 


He 


Pro 


50 










55 










60 










Gin Tyr 


He 


Ser 


Ser 


Val 


Pro 


Trp 


Tyr 


He 


Asp 


Pro 


Ser 


Lys 


Arg 


Pro 


65 








70 










75 










80 


Thr Leu 


Lys 


His 


Gin 


Arg 


Pro 


Gin 


Pro 


Glu 


Lys 


Gin 


Lys 


Gin 


Phe 


Ser 








85 










90 










95 




Ser Ser 


Gly 


Glu 


Trp 


Tyr 


Lys 


Arg 


Gly Val 


Lys 


Glu 


Asn 


Ser 


He 


He 






100 










105 










110 






Thr Lys 


Tyr 


Arg 


Lys 


Gly 


Ala 


Cys 


Glu 


Asn 


Cys 


Gly Ala 


Met 


Thr 


His 




115 










120 










125 








Lys Lys 


Lys 


Asp 


Cys 


Phe 


Glu Arg 


Pro Arg 


Arg Val Gly 


Ala 


Lys 


Phe 


130 










135 










140 










Thr Gly 


Thr 


Asn 


He 


Ala 


Pro Asp Glu 


His 


Val 


Gin 


Pro 


Gin 


Leu 


Met 


145 








150 










155 










160 



I 
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Phe Asp Tyr Asp Gly Lys Arg Asp Arg 
165 

Glu His Met Lys lie Val 
180 

<210> 519 
<211> 147 
<212> PRT 

<213> Homo sapiens 



<400> 519 



Met 


Asp 


Val 


Leu 


Val Ser 


Glu Cys 


Ser 


Ala Arg Leu Leu Gin Gin Glu 


1 








5 








10 






15 


Glu 


Glu 


He 


Lys 


Ser Leu 


Thr 


Ala 


Glu 


He Asp Arg 


Leu 


Lys 


Asn Cys 








20 








25 






30 




Gly 


Cys 


Leu 


Gly 


Ala Ser 


Pro 


Asn 


Leu 


Glu Gin Leu 


Gin 


Glu 


Glu Asn 






35 








40 






45 






Leu 


Lys 


Leu 


Lys 


Tyr Arg 


Leu 


Asn 


He 


Leu Arg Lys 


Ser 


Leu 


Gin Ala 




50 








55 






60 








Glu 


Arg 


Asn 


Lys 


Pro Thr 


Lys 


Asn 


Met 


He Asn He 


He 


Ser 


Arg Leu 


65 








70 








75 






80 


Gin 


Glu 


Val 


Phe 


Gly His 


Ala 


He 


Lys 


Ala Ala Tyr Pro Asp Leu Glu 










85 








90 






95 


Asn 


Pro 


Pro 


Leu 


Leu Val 


Thr 


Pro 


Ser 


Gin Gin Ala 


Lys 


Phe 


Gly Asp 








100 








105 






110 




Tyr 


Gin 


Cys 


Asn 


Ser Ala 


Met 


Gly 


He 


Ser Gin Val 


Met 


Tyr 


Cys His 






115 








120 






125 






Asp 


Ser 


Trp 


Leu 


Phe Asp 


Phe 


Phe 


Lys 


Tyr Tyr Tyr 


His 


His 


Cys His 




130 








135 






140 








Leu 


Gin 


Lys 





















145 



<210> 520 

<211> 65 

<212> PRT 

<213> Homo sapiens 
<220> 

<221> SIGNAL 

<222> -28. .-1 

<400> 520 

Met Asn Leu Phe Arg Phe Leu Gly Asp 
-25 -20 
He Leu Leu Leu Leu Lys He Trp Lys 

-10 -5 
Pro Gin Leu Pro Leu Ser Phe Cys Leu 
5 10 
Leu Ser Leu Ser Val Ser Leu Ser Leu 
25 

Lys 

<210> 521 
<211> 140 
<212> PRT 
<213> Homo sapiens 

<220> 

<221> SIGNAL 
<222> -40. .-1 

<400> 521 

Met Val Ser Ser Phe Arg Val Ser Glu Leu Gin Val Leu Leu Gly Phe 



PCT/IBOl/00914 
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Trp Asn Gly Tyr Asn Pro Glu 
170 175 



Leu Ser His Leu Leu Ala He 
-15 

Ser Arg Ser Cys Ala Ala His 
1 

Ser Val Cys Leu Ser Val Ser 

15 20 
Ser Phe Ser Val Ser Lys Lys 
30 35 



wo 02/083898 
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-40 








-35 


-30 


Ala 


Gly 


Arg 


Asn 


Lys Ser Gly Arg 


Lys His Asp 










-20 


-15 


Leu 


His 


Leu 


Leu 


Lys Ser Gly Cys 


Ser Pro Ala 








-5 




1 


Arg 


Glu 


Leu 


Tyr 


Arg Arg Arg Tyr 


Pro Arg Thr 




10 






15 




Asp 


Leu 


Ser 


Thr 


lie Lys Ser Ser 


Val Phe Ser 


25 








30 


35 


Ser 


Pro 


Val 


Glu 


Pro Asp Leu Ala 


Val Ala Gly 










45 


50 


Ser 


Thr 


Ser 


Val 


Thr Pro His Ser 


Pro Ser Ser 








60 




65 


Leu 


Leu 


Gin 


Asp 


Thr Lys Pro Thr 


Phe Glu Met 






75 




80 




Pro 


lie 


Pro 


Pro 


Val His Pro Asp 


Val Gin Leu 




90 






95 





<210> 522 

<211> 154 
<212> PRT 

<213> Homo sapiens 

<220> 

<221> UNSURE 
<222> 135 

<223> Xaa = His, Asn, Pro, Thr 

<220> 

<221> UNSURE 
<222> 137 

<223> Xaa = lie, Leu 
<220> 

<221> UNSURE 

<222> 139 

<223> Xaa = Pro, Arg 
<220> 

<221> UNSURE 
<222> 136 

<223> Xaa = Pro, Gin 



<400> 522 



Met 


Ala 


Glu 


Val 


Glu 


Gin 


Lys Lys Lys Arg Thr 


1 








5 




10 


Tyr 


Arg 


Gly 


Val 


Asp 


Leu 


Asp Gin Leu Leu Asp 








20 






25 


Leu 


Met 


Gin 


Leu 


Tyr 


Ser 


Ala Arg Gin Ala Ala 






35 








40 


Ala 


Ala 


Glu 


Ala 


Ala 


Leu 


Pro Ala Glu Ala Pro 




50 










55 


Gly 


Gly 


Ala 


Ala 


His 


Gly 


Glu Ala Gly Ser Gly 


65 










70 


75 


Gly 


His 


Asp 


His 


Pro 


Thr 


Arg Asp Gly Gly Gin 










85 




90 


Gin 


Arg 


Gin 


Asp 


Leu 


Gin 


Pro Gly Gly Asp Gin 








100 






105 


Pro 


Leu 


Pro 


Gly 


Arg 


Val 


Leu His His Leu Gin 






115 








120 


Pro 


Ala 


Arg 


His 


Arg 


Gly 


Xaa Xaa Xaa Leu Xaa 




130 










135 


Val 


Met 


Ala 


Gin 


Leu 


He 


Lys Ala His Met 



PCT/IBOl/00914 







-25 


Leu 


Leu 


Met Arg Ala 






-10 


Val 


Gin 


He Lys He 




5 




Leu 


Glu 


Gly Leu Ser 


20 






Leu 


Asp 


Gly Gly Ser 






40 


He 


His 


Ser Leu Pro 






55 


Pro 


Val 


Gly Ser Val 






70 


Gin 


Gin 


Pro Ser Pro 




85 




Lys 






100 







Phe 


Arg 


Lys Phe 


Thr 






15 




Met 


Ser 


Tyr Glu 


Gin 






30 




Ala 


Glu 


Pro Gly 


Pro 




45 






Ala 


Gin 


Gly Gin 


Glu 


60 








Glu 


Asp 


Ala Pro 


Ala 






80 


His 


Gly 


Gly Arg 


Leu 






95 




Ala 


Arg 


Asp Asp Arg 






110 




Ala 


Arg 


Lys Ala 


Trp 




125 






Leu 


His 


Pro Ser 


Gin 


140 









wo 02/083898 
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<210> 523 
<211> 65 
<212> PRT 

<213> Homo sapiens 

<220> 

<221> UNSURE 
<222> 63 

<223> Xaa = * ,Ala,Glu,Gly,Ile,Lys,Leu, Pro,Gln,Arg, Ser,Thr,Val 

<400> 523 



Met Ala 


Glu 


Glu 


Gin 


Glv 


Arg 


Glu 


Arg Asp Ser 


Val 


Pro 


Lys 


Pro Ser 


1 






5 








10 








15 


Val Leu 


Phe 


Leu 


His 


Pro 


Asp Leu Gly Val Gly 


Gly Ala 


Glu 


Arg Leu 






20 










25 






30 




Val Leu 


Leu 


Pro 


Phe 


Pro 


Thr Glu Arg Arg Leu 


Leu 


Phe 


Cys 


Thr Ala 




35 










40 






45 






Pro Arg Ser Leu Asn 


Ser 


Leu 


Pro 


Leu Lys Lys 


Lys 


Lys 


He 


Xaa Lys 


50 










55 






60 








Ser 
























65 
























<210> 524 






















<211> 133 






















<212> PRT 






















<213> Homo sapiens 


















<400> 524 






















Met Glu 


Tyr 


Val 


Leu 


Glu 


Val 


Lys 


Asn Ser Pro 


Arg 


His 


Leu 


Leu Lys 


1 






5 








10 








15 


Gin Phe 


Thr 


Val 


Cys 


Asp 


Val 


Pro 


Leu Tyr Asp 


He Cys 


Asp 


Tyr Asn 






20 










25 






30 




Val Ser 


Arg 


Asp 


Arg 


Cys 


Gin 


Glu 


Leu Gly Cys 


Cys 


Phe 


Tyr 


Glu Gly 




35 










40 






45 






Val Cys 


Tyr 


Lys 


Lys 


Ala 


Val 


Pro 


He Tyr He 


His 


Val 


Phe 


Ser Ala 


50 










55 






60 








Leu He 


Val 


He 


He 


Ala 


Gly Ala 


Phe Val He 


Thr 


He 


He 


Tyr Arg 


65 








70 






75 








80 


Val He 


Gin 


Glu 


Ser Arg 


Lys 


Glu 


Lys Ala He 


Pro 


Val 


Tvr 


Val Ala 








85 








90 








95 


Leu Pro 


Gin 


Lys 


Ser 


Ser 


Glu 


Lys 


Ala Glu Leu 


Ala 


Ser 


Ser 


Ser Ser 






100 










105 






110 




Lys Leu 


Gly 


Leu 


Lys 


Leu 


Arg 


Val 


Leu Gly Leu 


Lys 


Val 


Leu 


Gly Leu 




115 










120 






125 






Asp Glu 


Ser 


Asp 


Glu 


















130 
























<210> 525 






















<211> 149 






















<212> PRT 






















<213> Homo sapiens 


















<400> 525 






















Met Arg Tyr 


Asn 


Glu 


Lys 


Glu 


Leu 


Gin Ala Leu 


Ser 


Arg 


Gin 


Pro Ala 


1 






5 








10 








15 


Glu Met 


Ala 


Ala 


Glu 


Leu 


Gly Met 


Arg Gly Pro 


Lys 


Lys 


Gly 


Ser Val 






20 










25 






30 




Leu Lys Arg 


Arg 


Leu 


Val 


Lys 


Leu 


Val Val Asn 


Phe 


Leu 


Phe 


Tyr Phe 




35 










40 






45 






Arg Thr Asp Glu Ala Glu 


Pro Val Gly Ala Leu 


Leu 


Leu 


Glu 


Arg Cys 


50 










55 






60 









wo 02/083898 



Arg Val Val 


Arg 


Glu 


,65 






Asp Pro Glu 


Arg 


Lys 






85 


Gin Glu Trp 


Met 


Glu 




100 




Arg Ser Leu 


He 


Phe 


115 






Asp Pro Leu 


Glu 


Gin 


130 






Ser Gly Leu 


Gin 


Ala 


145 






<210> 526 






<211> 92 






<212> PRT 






<213> Homo sapiens 


<400> 526 






Met Pro Val 


Val 


Pro 


1 




5 


Arg Ser Gly 


Val 


Arg 




20 




Ser Leu Leu 


Lys 


He 


35 






Trp Ser Gin 


Leu 


Leu 


50 






Gly Gly Gly 


Gly Cys 


65 






Trp Val Thr 


Glu 


Gin 



85 



<210> 527 
<211> 60 
<212> PRT 

<213> Homo sapiens 

<220> 

<221> SIGNAL 
<222> -21.. -1 

<220> 

<221> UNSURE 
<222> -5 

<223> Xaa = Ser, Thr 
<400> 527 

Met Asp Pro Asn Cys 
-20 

Xaa Pro Ala Ser Ala 
-5 

Ala Ala Pro Gly Ala 
15 

Cys His Leu Ser Val 
30 

<210> 528 
<211> 120 
<212> PRT 

<213> Homo sapiens 
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Glu Pro Gly Thr Phe 


Ser 


He 


Ser 


Phe 


He 


Glu 


70 


75 










80 


Tyr His Phe Glu Cys 


Ser 


Ser 


Glu 


Glu 


Gin 


Cys 


90 










95 




Ala Leu Arg Arg Ala 


Ser 


Tyr 


Glu 


Phe 


Met 


Arg 


105 








110 






Tyr Arg Asn Glu He Arg 


Lys 


Val 


Thr 


Gly 


Lys 


120 






125 








Phe Gly lie Ser Glu 


Glu 


Ala 


Arg 


Phe 


Gin 


Leu 



135 140 



Ala Leu Gly Arg Pro Arg 


Trp 


Ala 


Asp 


His 


Leu 


10 








15 




Asp Gin Pro Gly Gin Pro 


Gly 


Glu 


Ala 


Pro 


Pro 


25 






30 






Gin Lys Leu Ala Gly Tyr 


Gly 


Gly 


Gly 


Cys 


Leu 


40 




45 








Gly Arg Leu Arg Arg Glu 


Asn 


His 


Leu 


Ser 


Pro 


55 


60 










Ser Glu Pro Arg Leu Cys 


His 


Cys 


Thr 


Pro 


Ala 



70 75 80 

Asp Ser He Ser Lys He Glu 



Ser Cys Ala Ala Gly Val 


Ser 


Cys 


Thr 


Cys Ala 


-15 


-10 








Lys Ser Ala Asn Ala Pro 


Pro 


Ala 


Arg 


Arg Ala 


1 5 








10 


Leu Ala Pro Pro Ser Trp 


Thr 


Pro 


Ala 


Leu Pro 


20 






25 




Cys Pro Lys Glu Val Leu 


Val 









35 



<220> 

<221> SIGNAL 



wo 02/083898 
<222> -35. .-1 
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PCT/IBO 1/0091 4 



<400> 528 
















Met Ser 


Lys 


His 


Leu 


His Cys 


Lys 


Val Leu Gly Leu Pro 


Leu Arg 


Leu 


-35 








-30 




-25 




-20 


Glu Val 


Ser 


Ala 


Ser 


Cys Leu 


Ser 


Gin Ser Leu Ala Met 


Ser Pro 


Gin 








-15 






-10 


-5 




Thr His 


Ser 


Gin 


Thr 


Cys lie 


Arg 


Asn Leu Val Thr Cys 


He Asn 


Tyr 






1 






5 


10 






Pro Arg 


Thr 


Ser 


Thr 


Gly Cys 


Lys 


Gly Thr Thr Thr Gin Arg He 


Met 


15 








20 




25 






Glu Pro 


Val 


Glu 


Leu 


Glu Val Glu Gly Thr Glu Gin Asp Asn Ala 


Lys 


30 








35 




40 




45 


Thr Cys 


Gly Ser 


Leu 


Gly Arg Gly Asn Glu Asn Thr Met 


Leu Arg 


Gly 








50 






55 


60 




Gly Phe 


Ser 


Met 


Asn 


Thr Thr 


Val 


Gly Gin Gly He Ser 


Lys Gin 


Thr 






65 








70 


75 




His His 


Thr 


Ser 


Thr 


Thr Ser 


Ser 









80 85 



<210> 529 
<211> 65 
<212> PRT 

<213> Homo sapiens 
<220> 

<221> SIGNAL 
<222> -14. .-1 



<400> 529 
























Met Leu Cys Trp Leu 


Lys 


Leu 


Val 


Ser 


He 


Thr 


Thr 


Val 


Ala 


He 


He 


-10 










-5 










1 




Leu Asn Trp Ala Gin 


His 


Ala 


Glu 


Asn 


Thr 


Thr 


Glu 


Cys 


Ala 


His 


Trp 


5 






10 










15 








Leu Ser Leu He Gin 


Val 


Thr 


Leu 


Thr 


Ser 


Leu 


Glu 


Ala 


Cys 


Gin 


Asn 


20 




25 










30 










Arg Leu Val Lys Ser 


Lys 


Pro 


Phe 


His 


Leu 


Gin 


Asn 


Phe 


Thr 


Cys 


Lys 


35 


40 










45 










50 



Pro 

<210> 530 
<211> 99 
<212> PRT 

<213> Homo sapiens 



<400> 530 



Met 


Lys 


Lys 


Lys 


Glu 


Glu 


Thr 


Thr 


Leu 


Ser 


Glu 


Met 


Glu 


Pro 


Val 


Glu 


1 








5 










10 










15 




Pro 


Gin 


Tyr 


Gin 

20 


Leu 


Val 


Asn 


Ala 


Glu 
25 


Ser 


Thr 


Ser 


Pro 


Phe 

30 


Leu 


His 


Cys 


Leu 


Arg 


Glu 


Val 


He 


Gly Glu 


Tyr 


Ser 


Val 


His 


Glu 


Phe 


Ser 


Leu 






35 










40 










45 








Leu Gly 


Lys 


Thr 


Glu 


Ser 


Gin 


Gly 


He 


Gly Leu 


Trp 


He 


Ala 


Leu 


Val 




50 










55 










60 










Val 


Phe 


Leu 


Ser 


Phe 


Leu 


He 


Phe 


Ser 


Thr 


Ser 


Phe 


Tyr 


He 


Ser 


Asn 


65 










70 










75 








80 


Ala 


Glu 


Gin 


Pro 


Phe 
85 


Phe 


Lys 


Glu 


Pro 


Pro 
90 


Thr 


Glu 


Ala 


Ala 


Lys 
95 


Glu 


Leu 


Ser 


Leu 





























<210> 531 
<211> 128 
<212> PRT 



wo 02/083898 

<213> Homo sapiens 
<220> 

<221> UNSURE 
<222> 70 

<223> Xaa = Pro,Thr 



<400> 531 



Met 


Ala 


Gly 


Asn 


Leu 


Leu 


Ser 


Gly Ala Gly Arg Arg 


Leu 


Trp 


Asp 


Trp 


1 








5 








10 






15 




Val 


Pro 


Leu 


Ala 


Cys Arg 


Ser 


Phe 


Ser Leu Gly Val 


Pro 


Arg 


Leu 


He 








20 










25 




30 






Gly He 


Arg 


Leu 


Thr 


Leu 


Pro 


Pro 


Pro Lys Val Val 


Asp Arg 


Trp Asn 






35 










40 




45 








Glu 


Lys 


Arg 


Ala 


Met 


Phe 


Gly Val 


Tyr Asp Asn lie 


Gly He 


Leu Gly 




50 










55 




60 










Asn 


Phe 


Glu 


Lys 


His 


Xaa 


Lys Glu Leu He Arg Gly 


Pro 


He 


Trp 


Leu 


65 










70 






75 








80 


Arg Gly 


Trp 


Lys 


Gly Asn 


Glu 


Leu 


Gin Arg Cys He 


Arg 


Lys 


Arg 


Lys 










85 








90 






95 




Met 


Val 


Gly 


Ser 
100 


Arg 


Met 


Phe 


Ala 


Asp Asp Leu His 
105 


Asn 


Leu 
110 


Asn 


Lys 


Arg 


He 


Arg 
115 


Tyr 


Leu 


Tyr 


Lys 


His 
120 


Phe Asn Arg His 


Gly 
125 


Lys 


Phe 


Arg 
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<210> 532 
<211> 52 
<212> PRT 

<213> Homo sapiens 



<400> 532 

Met Phe Leu Thr Leu Ala Asp Thr 
1 5 
Leu Leu Asn Val Tyr Glu Gly Glu 
20 

Leu Leu Glu Ser Trp He He Val 

35 40 
He Ser Ser His 
50 



Cys Lys Leu Arg Gly Met Ser Phe 

10 15 
Ala Thr Val Ser Ser Val Leu Glu 
25 30 
Gly Asn Glu Arg Tyr Phe Asp Gly 
45 



<210> 533 
<211> 8.4 
<212> PRT 

<213> Homo sapiens 



<400> 533 






















Met 


Lys 


Val 


Lys 


He 


Lys 


Cys 


Trp Asn Gly Val Ala 


Thr 


Trp 


Leu 


Trp 


1 








5 








10 






15 




Val 


Ala 


Asn 


Asp 


Glu 


Asn 


Cys 


Gly 


He Cys Arg Met 


Ala 


Phe 


Asn Gly 








20 










25 




30 






Cys 


Cys 


Pro 


Asp 


Cys 


Lys 


Val 


Pro 


Gly Asp Asp Cys 


Pro 


Leu 


Val 


Trp 






35 










40 




45 








Gly 


Gin 


Cys 


Ser 


His 


Cys 


Phe 


His 


Met His Cys He 


Leu 


Lys 


Trp 


Leu 




50 










55 




60 










His 


Ala 


Gin 


Gin 


Val 


Gin 


Gin 


His 


Cys Pro Met Cys 


Arg 


Gin 


Glu 


Trp 


65 










70 






75 








80 


Lys 


Phe 


Lys 


Glu 





















<210> 534 
<211> 80 
<212> PRT 

<213> Homo sapiens 



wo 02/083898 
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<400> 534 

Met Ala Thr Pro Thr Gin Thr Pro Thr Lys Ala Pro Glu Glu Pro Asp 

15 10 15 

Pro Phe Tyr Tyr Asp Tyr Asn Thr Val Gin Thr Val Gly Met Thr Leu 

20 25 30 

Ala Thr lie Leu Phe Leu Leu Gly lie Leu He Val He Ser Lys Lys 

35 40 45 

Val Lys Cys Arg Lys Ala Asp Ser Arg Ser Glu Ser Pro Thr Cys Lys 

50 55 60 

Ser Cys Lys Ser Glu Leu Pro Ser Ser Ala Pro Gly Gly Gly Gly Val 
65 70 75 80 



<210> 535 
<211> 127 
<212> PRT 

<213> Homo sapiens 



<400> 535 






Met 


Ser 


Phe 


Ser 


Gly Lys Tyr Gin 


1 








5 


Ala 


Phe 


Met 


Lys 


Ala lie Gly Leu 








20 




Lys 


Asp 


He 


Lys 


Gly Val Ser Glu 






35 




40 


Lys 


Phe 


Thr 


He 


Thr Ala Gly Ser 




50 






55 


Val 


Gly 


Glu 


Glu 


Cys Glu Leu Glu 


65 








70 


Thr 


Val 


Val 


Gin 


Leu Glu Gly Asp 










85 


Asn 


He 


Lys 


Ser 


Val Thr Glu Leu 








100 




Met 


Thr 


Leu 


Gly 


Asp He Val Phe 






115 




120 



Leu 


Gin 


Ser Gin Glu Asn 


Phe 


Glu 




10 




15 




Pro 


Glu 


Glu Leu He Gin 


Lys 


Gly 


25 




30 






He 


Val 


Gin Asn Gly Lys 


His 


Phe 






45 






Lys 


Val 


lie Gin Asn Glu 


Phe 


Thr 




60 






Thr 


Met 


Thr Gly Glu Lys 


Val 


Lys 






75 




80 


Asn 


Lys 


Leu Val Thr Thr 


Phe 


Lys 




90 




95 




Asn 


Gly Asp He He Thr 


Asn 


Thr 


105 




110 






Lys 


Arg 


He Ser Lys Arg 


He 





125 



<210> 536 
<211> 77 
<212> PRT 

<213> Homo sapiens 
<400> 536 

Met Ala Ala Gly Thr Ser Pro Ala 
1 5 
Ala Ala Gly Pro Thr Ser Gly Ala 
20 

Glu Ser Leu Pro Gly Asn Tyr Ser 

35 40 
Trp He Ala Ser Arg Gin Lys Ser 

50 55 
Ser Arg Lys Asp Lys Gly Gly Lys 
65 70 



Asp 


Ala 


Leu Cys Asp Gly 


Lys 


Lys 




10 




15 




Phe 


Tyr 


Asn Glu Leu Arg 


Ala 


Leu 


25 




30 






Ser 


Ala 


Ser Thr Val Val 


Leu 


Leu 






45 






Gin 


Ser 


Ser Arg Asn Ser 


Gin 


Cys 






60 






Gin 


Glu 


Glu His Glu 







75 



<210> 537 
<211> 75 
<212> PRT 

<213> Homo sapiens 
<220> 

<221> UNSURE 
<222> 65 

<223> Xaa « Asp, Gly 



<400> 537 



wo 02/083898 



Met Pro Gly Pro Thr Pro Ser Gly Thr 
1 5 

Ser Pro Ser Lys Ala Val Ala Ala Arg 

20 25 
Arg Gly Lys Met Pro Ala Val Gly Gin 

35 40 
Arg Gin Ala Pro Gly Gly Cys Gly Asp 

50 55 
Xaa Ser Lys Leu Ala Leu Phe Gin Tyr 
65 70 

<210> 538 
<211> 56 
<212> PRT 

<213> Homo sapiens 
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292 








Asn Val 


Gly Ser 


Ser 


Gly Arg 


10 






15 


Ala Arg 


Asp Pro 


Leu 


Ser Gly 






30 




Gly Val 


Gin Ala 


Ala 


Gin Pro 




45 






Ser Thr 


Gin Lys 


He 


His Leu 




60 






Trp Leu 









75 



<400> 538 

Met Ser Ser Trp Ser Leu Met Cys 

1 5 

Ala Arg Leu Leu Tyr Glu Ala Asn 

20 

Gin Thr Asp Gin Ala Val Glu Phe 

35 40 
Lys Lys Tyr Tyr Glu Lys Met Pro 
50 55 



Met Gin Asp Met Gly Asn Thr Lys 

10 15 
Leu Pro Glu Asn Phe Arg Arg Pro 
25 30 
Phe He Arg Asp Lys Tyr Glu Lys 
45 



<210> 539 
<211> 122 
<212> PRT 

<213> Homo sapiens 



<400> 539 



Met 


Glu 


Ala 


Ser 


Ala 


Leu 


Thr Ser 


1 








5 






Val 


Val 


Arg 


Val 


Ala 


Ser 


Gly Ser 








20 








lie 


Ala 


Thr 


Val 


Val 


He 


Gly Gly 






35 








40 


Met 


Val 


Leu 


Ser 


Ala 


Met 


Gly Phe 




50 










55 


Ser 


He 


Ala 


Ala 


Lys 


Met 


Met Ser 


65 










70 




Gly 


Val 


Ala 


Ser 


Gly 


Ser 


Leu Val 










65 






Thr 


Gly 


Leu 


Ser 


Gly 


Leu 


Thr Lys 








100 








Ala 


He 


Ala 


Ala 


Val 


He Ala Arg 






115 








120 



<210> 540 

<211> 74 

<212> PRT 

<213> Homo sapiens 



Ser 


Ala 


Val 


Thr 


Ser 


Val 


Ala Lys 




10 










15 


Ala 


Val 


Val 


Leu 


Pro 


Leu 


Ala Arg 


25 










30 




Val 


Val 


Ala 


Met 


Ala 


Ala 


Val Pro 










45 






Thr 


Ala 


Ala 


Gly 


He 


Ala 


Ser Ser 








60 








Ala 


Ala 


Ala 


He 


Ala 


Asn 


Gly Gly 






75 








80 


Ala 


Thr 


Leu 


Gin 


Ser 


Leu 


Gly Ala 




90 










95 


Phe 


He 


Leu 


Gly 


Ser 


He 


Gly Ser 


105 










110 




Phe Tyr 













<400> 540 

Met Ala Ser Ser Thr Val Pro Val 
1 5 
Thr Pro Glu He Pro Asp Asn Val 
20 

Arg Phe Ala Thr Asp Arg Asn Asp 

35 40 
Leu Gly Leu Phe Ala Ala Gly Val 
50 55 



Ser Ala Ala Gly Ser Ala Asn Glu 

10 15 
Gly Asp Trp Leu Arg Gly Val Tyr 
25 30 
Phe Arg Arg Asn Leu He Leu Asn 
45 

Trp Leu Ala Arg Asn Leu Ser Asp 
60 
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He Asp Leu Met Ala Pro Gin Pro Gly Val 
65 70 



<210> 541 
<211> 97 
<212> PRT 

<213> Homo sapiens 



<400> 541 



Met 


Ser 


Ser 


Ala 


Pro 


Glu 


Pro Pro Thr Phe Lys 


Lys 


Glu Pro 


Pro 


Lys 


1 








5 




10 






15 




Glu 


Lys 


Glu 


Phe 


Gin 


Ser 


Pro Gly Leu Arg Gly 


Val Arg Thr 


Thr 


Thr 








20 






25 




30 






Leu 


Phe 


Arg Ala 


Val 


Asn 


Pro Glu Leu Phe He 


Lys 


Pro Asn 


Lys 


Pro 






35 








40 




45 






Val 


Met 


Ala 


Phe 


Gly 


Leu 


Val Thr Leu Ser Leu 


Cys 


Val Ala 


Tyr 


He 




50 










55 


60 








Gly 


Tyr 


Leu 


His 


Ala 


He 


Gin Glu Asn Lys Lys 


Asp 


Leu Tyr 


Glu 


Ala 


65 










70 


75 








80 


He 


Asp 


Ser 


Glu 


Gly His 


Ser Tyr Met Arg Arg 


Lys 


Thr Ser 


Lys 


Trp 










85 




90 






95 





Asp 



<210> 542 

<211> 98 

<212> PRT 

<213> Homo sapiens 



<400> 542 



Met 


Ala 


Ala Glu 


Pro 


Leu 


Thr Glu Leu Glu Glu 


Ser He Glu 


Asn 


Val 


1 






5 




10 




15 




Val 


Thr 


Thr Phe 


Phe 


Thr 


Phe Ala Arg Gin Glu 


Gly Arg Lys 


Asp 


Ser 






20 






25 


30 






Leu 


Ser 


Val Asn 


Glu 


Phe 


Lys Glu Leu Val Thr 


Gin Gin Leu 


Pro 


His 






35 






40 


45 






Leu 


Leu 


Lys Asp 


Val 


Gly 


Ser Leu Asp Glu Lys 


Met Lys Ser 


Leu 


Asp 




50 








55 


60 






Val 


Asn 


Gin Asp 


Ser 


Glu 


Leu Lys Phe Asn Glu 


Tyr Trp Arg 


Leu 


He 


65 








70 


75 






80 


Gly Glu 


Leu Ala 


Lys 


Glu 


He Arg Lys Lys Lys 


Asp Leu Lys 


He 


Arg 








85 




90 




95 





Lys Lys 



<210> 543 
<211> 115 
<212> PRT 

<213> Homo sapiens 



<400> 543 



Met 


Val 


Ala 


Ala 


Lys Lys 


Thr 


Lys 


Lys Ser Leu 


Glu 


Ser 


He 


Asn 


Ser 


1 








5 






10 








15 




Arg 


Leu 


Gin 


Leu 


Val Met 


Lys 


Ser Gly Lys Tyr 


Val 


Leu Gly Tyr 


Lys 








20 








25 






30 






Gin 


Thr 


Leu 
35 


Lys 


Met He 


Arg 


Gin 
40 


Gly Lys Ala 


Lys 


Leu 
45 


Val 


He 


Leu 


Ala 


Asn 

50 


Asn 


Cys 


Pro Ala 


Leu 

55 


Arg 


Lys Ser Glu 


He 

60 


Glu 


Tyr 


Tyr 


Ala 


Met 


Leu 


Ala 


Lys 


Thr Gly 


Val 


His 


His Tyr Ser 


Gly Asn 


Asn 


He 


Glu 


65 








70 






75 










80 


Leu Gly 


Thr 


Ala 


Cys Gly 


Lys 


Tyr 


Tyr Arg Val 


Cys 


Thr 


Leu 


Ala 


He 










85 






90 








95 




He 


Asp 


Pro 


Gly 
100 


Asp Ser 


Asp 


He 


He Arg Ser 
105 


Met 


Pro 


Glu 
110 


Gin 


Thr 
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Gly Glu Lys 
115 



<210> 544 
<211> 102 
<212> PRT 

<213> Homo sapiens 

<220> 

<221> UNSURE 
<222> 85 

<223> Xaa = Asp,Asn 



<400> 544 








Met 


Ala 


Gly 


Gin 


Ala 


Phe Arg Lys 


1 








5 




Leu 


Val 


Glu 


Arg 


Ser 


Ala Ala Glu 








20 






Leu 


Pro 


Glu 


Lys 


Ser Gin Gly Lys 






35 






40 


Val 


Gly 


Ser 


Gly 


Ser 


Lys Gly Lys 




50 








55 


Val 


Lys 


Val 


Gly 


Asp 


Lys Val Leu 


65 










70 


Val 


Val 


Leu 


Asp 


Xaa 


Lys Asp Tyr 










85 




Leu Gly 


Lys 


Tyr 


Val 


Asp 








100 







Phe 


Leu 


Pro 


Leu 


Phe Asp Arg 


Val 




10 






15 




Thr 


Val 


Thr 


Lys 


Gly Gly He Met 


25 








30 




Val 


Leu 


Gin 


Ala 


Thr Val Val 


Ala 










45 




Gly 


Gly Glu 


lie 


Gin Pro Val 


Ser 








60 






Leu 


Pro 


Glu 


Tyr 


Gly Gly Thr 


Lys 






75 






80 


Phe 


Leu 


Phe 


Arg 


Asp Gly Asp 


He 




90 






95 





<210> 545 
<211> 102 
<212> PRT 

<213> Homo sapiens 
<400> 545 



Met 


Ala 


Gly 


Gin 


Ala Phe Arg Lys 


1 








5 


Leu 


Val 


Glu 


Arg 


Ser Ala Ala Glu 








20 




Leu 


Pro 


Glu 


Lys 


Ser Gin Gly Lys 






35 




40 


Val 


Gly 


Ser 


Gly 


Ser Lys Gly Lys 




50 






55 


Val 


Lys 


Val 


Gly 


Asp Lys Val Leu 


65 








70 


Val 


Val 


Leu 


Asp 


Asp Lys Asp Tyr 










85 


Leu 


Gly 


Lys 


Tyr 


Val Asp 








100 





Phe 


Leu 


Pro 


Leu 


Phe Asp Arg Val 




10 






15 




Thr 


Val 


Thr 


Lys 


Gly Gly He 


Met 


25 








30 




Val 


Leu 


Gin 


Ala 


Thr Val Val 


Ala 










45 




Gly 


Gly Glu 


He 


Gin Pro Val 


Ser 








60 






Leu 


Pro 


Glu 


Tyr 


Gly Gly Thr 


Lys 






75 






80 


Phe 


Leu 


Phe 


Arg 


Asp Gly Asp 


He 




90 






95 





<210> 546 
<211> 116 
<212> PRT 

<213> Homo sapiens 



<400> 546 

Met Ser Ala Thr Ala Ala Thr Ala Pro Pro Ala Ala Pro Ala Gly Glu 

15 10 15 

Gly Gly Pro Pro Ala Pro Pro Pro Asn Leu Thr Ser Asn Arg Arg Leu 

20 25 30 

Gin Gin Thr Gin Ala Gin Val Asp Glu Val Val Asp He Met Arg Val 

35 40 45 

Asn Val Asp Lys Val Leu Glu Arg Asp Gin Lys Leu Ser Glu Leu Asp 
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50 










55 








60 










Asp Arg Ala 


Asp 


Ala 


Leu 


Gin 


Ala 


Gly Ala 


Ser 


Gin 


Phe 


Glu 


Thr 


Ser 


65 








70 








75 










80 


Ala Ala 


Lys 


Leu 


Lys 


Arg 


Lys 


Tyr 


Trp Trp 


Lys Asn 


Leu 


Lys Met 


Met 








85 








90 










95 




He He 


Leu 


Gly 


Val 


He 


Cys Ala 


He He 


Leu 


He 


He 


He 


He 


Val 






100 










105 








110 






Tyr Phe 


Ser 


Thr 


























115 


























<210> 547 


























<211> 97 


























<212> PRT 


























<213> Homo sapiens 






















<220> 




























<221> UNSURE 
























<222> 54 


























<223> Xaa = 


Glu, Lys 






















<400> 547 


























Met Ser 


Leu 


Ser 


Leu 


Val 


Phe Arg Ala Ala 


Ser Tyr 


Phe 


Lys 


Leu 


Val 


1 






5 








10 










15 




Pro Phe 


His 


Ser 


Ser 


Ser 


Ser 


Asn 


Gin Phe 


Leu 


Gin 


Pro 


Pro Gly Trp 






20 










25 








30 






Val Val 


Leu 


Thr 


Gin 


Thr 


Leu 


Val 


Leu Leu 


His 


Phe 


Glu 


Arg 


Phe 


Ser 




35 










40 








45 








Tyr Gin 


Asn 


Val 


Pro 


Xaa 


Ser 


Ala 


Gin Gly 


Lys 


Gly Asn 


Leu 


Gin 


Pro 


50 










55 








60 










Glu Thr 


Asn 


He 


His 


Leu 


Phe 


His 


Phe Leu 


Thr 


Phe 


Pro 


Lys 


Gin 


He 


65 








70 








75 










80 


Ser Arg 


Asn 


Leu 


Phe 


Asn 


Ser 


Leu 


Leu Cys 


Leu 


Met 


Cys 


Leu 


Thr 


Tyr 








85 








90 










95 




Phe 




























<210> 548 


























<211> 113 


























<212> PRT 


























<213> Homo sapiens 






















<220> 




























<221> UNSURE 
























<222> 110 


























<223> Xaa = 


Ser, 


Thr 






















<400> 548 


























Met Ser 


Met 


Thr 


Asp 


Leu 


Leu 


Asn 


Ala Glu 


Asp 


He 


Lys 


Lys 


Ala 


Val 


1 






5 








10 










15 




Gly Ala 


Phe 


Ser 


Ala 


Thr Asp Ser 


Phe Asp 


His 


Lys 


Lys 


Phe 


Phe 


Gin 






20 










25 








30 






Met Val 


Gly 


Leu 


Lys 


Lys 


Lys 


Ser 


Ala Asp 


Asp 


Val 


Lys 


Lys 


Val 


Phe 




35 










40 








45 








His Met 


Leu 


Asp 


Lys 


Asp 


Lys 


Ser Gly Phe 


He Glu Glu Asp Glu 


Leu 


50 










55 








60 










Gly Phe 


He 


Leu 


Lys 


Gly 


Phe 


Ser 


Pro Asp 


Ala 


Arg Asp 


Leu 


Ser 


Ala 


65 








70 








75 










80 


Lys Glu 


Thr 


Lys 


Met 


Leu 


Met 


Ala 


Ala Gly 


Asp Lys Asp Gly Asp Gly 








85 








90 










95 




Lys He 


Gly 


Val 


Asp 


Glu 


Phe 


Ser 


Thr Leu 


Val 


Ala 


Glu 


Xaa 


Lys 


Lys 






100 










105 








110 







His 



<210> 549 
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<211> 112 
<212> PRT 

<213> Homo sapiens 



<400> 549 



Met 


Asp 


Pro 


Ara 


Lys 


Val 


Asn Glu 


1 








5 






Lys 


Gin 


Asp 


Pro 


Ser 


Val 


Leu His 








20 








Glu Trp 


Val 


Glu 


Ser 


Met 


Gly Gly 






35 








40 


Ala 


Lys 


Ser 


Glu 


Glu 


Asn 


Thr Lys 




50 










55 


Val 


Glu 


Glu 


Asp 


Leu 


Lys 


Ala Asp 


65 










70 




Leu 


Glu 


He 


Asp 


Lys 


Glu 


Gly Val 










85 






Gin 


Glu 


Met 


Gly 


Asp Glu 


Asn Ala 








100 









<210> 550 
<211> 118 
<212> PRT 

<213> Homo sapiens 



<400> 550 



Met 


Val 


Leu 


Leu 


Glu 


Ser 


Glu Gin 


1 








5 






Phe 


Gin 


Lys 


Cys 
20 


Arg 


Thr 


Ser Gly 


Tyr 


Asp 


Gly 
35 


Arg 


Thr 


Lys 


Pro lie 
40 


Phe 


Glu 


Pro 


Ala 


Asp Asn 


Lys Cys 




50 










55 


Lys 


Lys 


He 


Ser 


Thr 


Val 


Val Ser 


65 










70 




Met 


Ala 


Tyr 


Ser 


Asn 
85 


Leu 


Leu Arg 


Arg 


Asp 


Lys 


Lys 
100 


Asn 


Glu 


Thr Lys 


Ala 


Ala 


Thr 
115 


Ala 


Ala 


Gin 





<210> 551 
<211> 109 
<212> PRT 

<213> Homo sapiens 

<220> 

<221> UNSURE 
<222> 40 

<223> Xaa = Gin, Arg 
<220> 

<221> UNSURE 
<222> 10 

<223> Xaa = His, Arg 
<220> 

<221> UNSURE 
<222> 38 

<223> Xaa = Phe, He 



296 



Leu Arg 


Ala 


Phe 


Val 


Lys 


Met Cys 




10 










15 


Thr 


Glu 


Glu 


Met 


Arg 


Phe 


Leu Arg 


25 










30 




Lys 


Val 


Pro 


Pro 


Ala 


Thr 


Gin Lys 










45 






Glu 


Glu 


Lys 


Pro Asp 


Ser 


Lys Lys 








60 








Glu 


Pro 


Ser 


Ser 


Glu 


Glu 


Ser Asp 






75 








80 


He 


Glu 


Pro Asp 


Thr Asp 


Ala Pro 




? w 










95 


Glu 


He 


Thr 


Ala 


Gin 


His 


Phe Leu 


105 










110 




Phe 


Leu 


Thr 


Glu 


Leu 


Thr 


Arg Leu 




10 










15 


Ser 


Val 


Tyr 


He 


Thr 


Leu 


Lys Lys 


25 










30 




Pro Lys 


Lys Gly Thr Val 


Glu Gly 










45 






Leu 


Leu 


Arg 


Ala 


Thr Asp 


Gly Lys 








60 








Ser 


Lys 


Glu 


Val 


Asn 


Lys 


Phe Gin 






75 








80 


Ala 


Asn 


Met 


Asp 


Gly 


Leu 


Lys Lys 




90 










95 


Lys 


Thr 


Lys 


Ala 


Ala 


Ala 


Ala Ala 


105 










110 
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<220> 

<221> UNSURE 
<222> 97 

<223> Xaa = Phe,Leu 
<400> 551 



Met 


ASD 


Ser 


Leu 


Pro 


Ser Gly Lys 


He 


Xaa Arg 


Lys Val 


Lys 


He He 


1 








5 






10 






15 


Leu 


Gly 


Arg 


Asn 


Arg 


Lys Glu Asn 


Leu 


Glu Pro 


Asn Ala 


Glu 


Phe Asp 








20 






25 






30 




Lys 


Arg 


Thr 


Glu 


Phe 


Xaa Thr Xaa 


Glu 


Glu Asn 


Arg He 


Cys 


Ser Ser 






35 






40 






45 






Pro 


Val 


Gin 


Ser 


Leu 


Leu Asp Leu 


Phe 


Gin Thr 


Ser Glu 


Glu 


Lys Ser 




50 








55 






60 






Glu 


Phe 


Leu 


Gly 


Phe 


Thr Ser Tyr 


Thr 


Glu Lys 


Ser Gly 


He 


Cys Asn 


65 










70 




75 






80 


Val 


Leu 


Asp 


lie 


Trp 


Glu Glu Glu 


Asn 


Ser Asp 


Asn Leu 


Leu 


Thr Ala 










85 






90 






95 


Xaa 


Phe 


Ser 


Ser 


Pro 


Ser Thr Ser 


Thr 


Phe Thr 


Gly Phe 












100 






105 











<210> 552 
<211> 103 
<212> PRT 

<213> Homo sapiens 



<400> 552 



Met Ser 


Gly 


Asp 


Gly 


Ala 


Thr 


Glu 


Gin 


Ala 


Ala 


Glu 


Tyr Val 


Pro 


Glu 


1 






5 










10 








15 




Lys Val 


Lys 


Lys 
20 


Ala 


Glu 


Lys 


Lys 


Leu 
25 


Glu 


Glu 


Asn 


Pro Tyr 
30 


Asp 


Leu 


Asp Ala 


Trp 


Ser 


He 


Leu 


He Arg Glu 


Ala 


Gin 


Asn 


Gin Pro 


He 


Asp 




35 










40 










45 






Lys Ala 


Arg 


Lys 


Thr 


Tyr 


Glu 


Arg 


Leu 


Val 


Ala 


Gin 


Phe Pro 


Ser 


Ser 


50 










55 










60 








Gly Arg 


Phe 


Trp 


Lys 


Leu 


Tyr 


He 


Glu 


Ala 


Glu 


Val 


Thr He 


Leu 


Phe 


65 








70 










75 








80 


Tyr Phe 


Phe 


Leu 


Tyr 


Gin 


Tyr 


Cys 


Ser 


He 


His 


Cys 


Ser Asp Arg 


Lys 








85 










90 








95 




Gin Val 


Arg 


Asn 
100 


He 


Ala 


Asn 



















<210> 553 
<211> 106 
<212> PRT 
<213> Homo sapiens 



<400> 553 



Met 


Leu 


Lys 


Ser 


Asn 


Gly Glu 


Arg Arg 


Ser Arg 


Asn 


Ala Leu 


Pro Ala 


1 








5 








10 








15 


Val 


Tyr 


Ala 


Arg 
20 


Lys 


Met 


Ala 


Ala Ser 
25 


Gin 


Gin 


Gin 


Ala Ser 
30 


Ala Ala 


Ser 


Ser 


Ala 
35 


Ala 


Gly 


Val 


Ser 


Gly Pro 
40 


Ser 


Ser 


Ala 


Gly Gly 
45 


Pro Gly 


Pro 


Gin 


Gin 


Gin 


Pro 


Gin 


Pro 


Pro Ala 


Gin 


Leu 


Val Gly Pro Ala Gin 




50 










55 








60 






Ser Gly 


Leu 


Leu 


Gin 


Gin 


Gin 


Gin Gin 


Asp 


Phe 


Asp 


Pro Val 


Gin Arg 


65 










70 








75 






80 


Tyr 


Lys 


Met 


Leu 


He 
85 


Pro 


Gin 


Leu Lys 


Glu 
90 


Ser 


Leu 


Gin Val 


He Gly 
95 


Leu 


Lys 


Gin 


Arg 
100 


Glu 


Ala 


Asn 


Trp He 
105 


Trp 
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<210> 554 
<211> 86 
<212> PRT 

<213> Homo sapiens 
<400> 554 



Met Ala Glu 


Asp Met 


Glu 


Thr 


Lys 


He 


Lys 


Asn Tyr Lys 


Thr 


Ala 


Pro 


1 


5 










10 






15 




Phe Asp Ser 


Arg Phe 


Pro 


Asn 


Gin 


Asn 


Gin 


Thr Arg Asn 


Cys 


Trp 


Gin 




20 








25 






30 






Asn Tyr Leu Asp Phe 


His 


Arg Cys 


Gin 


Lys 


Ala Met Thr 


Ala 


Lys 


Gly 


35 








40 






45 








Gly Asp lie 


Ser Val 


Cys 


Glu 


Trp 


Tyr 


Gin 


Arg Val Tyr 


Gin 


Ser 


Leu 


50 






55 








60 








Cys Pro Thr 


Ser Trp Val 


Thr Asp Trp Asp 


Glu Gin Arg Ala Glu 


Gly 


65 




70 










75 






80 


Thr Phe Pro Gly Lys 


He 




















85 




















<210> 555 






















<211> 105 






















<212> PRT 






















<213> Homo sapiens 




















<400> 555 






















Met Cys Val 


Gly Leu 


Leu 


Met 


Val 


Ser 


He 


Leu Thr Aso 


He 


Asn 


Leu 


1 


5 










10 






15 




Ser Asn Leu 


Val Ala 


Val 


Gin 


Tyr 


Ser 


Pro 


Asp Tyr Cys 


Asn 


Phe 


Arg 




20 








25 






30 






Lys Arg Ser 


Asp Lys 


Asn 


Gin 


Asp 


Ala 


Ser 


Thr Phe Cvs 


His 


Asn 


Cvs 


35 








40 






45 








Asn Gin Phe 


His Leu Cys 


Leu 


Gin 


Tyr 


His 


His Lys He 


Val 


Leu 


Pro 


50 






55 








60 








Trp Ser Val 


Leu Ala 


lie 


Leu 


Ser 


Gin 


Leu 


Phe Leu His 


He 


Thr 


Ser 


65 




70 










75 






80 


Arg lie Arg 


Pro Thr 


He 


Tyr Lys 


Ser 


Asn 


Lys Pro Lys 


Ser 


He 


Glu 




85 










90 






95 




lie Leu lie 


Gly Phe 


Val 


Leu 


Asn 


Leu 














100 








105 












<210> 556 






















<211> 101 






















<212> PRT 






















<213> Homo sapiens 




















<220> 






















<221> UNSURE 




















<222> 95 






















<223> Xaa = 


Cys, Trp 




















<220> 






















<221> UNSURE 




















<222> 86 






















<223> Xaa = 


Leu r Val 




















<400> 556 






















Met Glu Arg 


Pro Asp Lys 


Ala 


Ala 


Leu 


Asn 


Ala Leu Gin 


Pro 


Pro 


Glu 


1 


5 










10 






15 




Phe Arg Asn 


Glu Ser 


Ser 


Leu 


Ala 


Ser 


Thr 


Leu Lys Thr 


Leu 


Leu 


Phe 




20 








25 






30 






Phe Thr Ala 


Leu Met 


He 


Thr 


Val 


Pro 


He 


Gly Leu Tyr 


Phe 


Thr 


Thr 


35 








40 






45 








Lys Ser Tyr 


lie Phe 


Glu 


Gly Ala 


Leu Gly 


Met Ser Asn 


Arg 


Asp 


Ser 



I 



wo 02/083898 



50 55 
Tyr Phe Tyr Ala Ala lie Val Ala Val 
65 70 
Ala Leu Phe Val Tyr Xaa Ala Trp Asn 
85 

Glu Gly Lys Gin Asp 
100 



PCT/TBOl/00914 
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60 

Val Ala Val His Val Val Leu 

75 80 
Glu Gly Ser Arg Gin Xaa Arg 
90 95 



<210> 557 
<211> 92 
<212> PRT 

<213> Homo sapiens 



<400> 557 




Met 


Phe 


Arg Asp 


Phe Gly Arg Arg 


1 






5 


Val 


Asp 


Ala Arg 


Leu Arg Leu Ser 






20 




Lys 


Pro 


Lys Pro 


Val Glu Val Gin 






35 


40 


Tyr 


Ala 


Val Trp 


Phe Gly Gly Ser 




50 




55 


Phe 


Gin 


Val Cys 


His Thr Lys Lys 


65 






70 


He 


Cys 


Arg His 


Asn Pro Val Phe 



85 



Leu Gin Arg Asp Leu Lys Arg Val 

10 15 
Glu Glu Leu Ser Gly Gly Arg lie 
25 30 
Val Val Thr His His Met Gin Arg 
45 

Met Leu Ala Ser Thr Pro Glu Phe 
60 

Asp Tyr Glu Glu Tyr Gly Pro Ser 

75 80 
Gly Val Met Ser 
90 



<210> 558 
<211> 98 
<212> PRT 

<213> Homo sapiens 
<220> 

<221> UNSURE 
<222> 86 

<223> Xaa = ♦ ,Glu 

<220> 

<221> UNSURE 
<222> 51 

<223> Xaa » Asp, Tyr 



<400> 558 



Met Ser 


Val 


Ser Phe His 


Thr 


His 


Thr Lys Glu 


Leu Trp Thr Trp Met 


1 




5 






10 


15 


Glu Asp 


Leu 


Gin Lys Glu 


Met 


Leu 


Glu Asp Val 


Cys Ala Asp Ser Val 






20 






25 


30 


Asp Ala 


Val 


Gin Glu Leu 


He 


Lys 


Gin Phe Gin 


Gin Gin Gin Thr Ala 




35 






40 




45 


Thr Leu 


Xaa 


Ala Thr Leu 


Asn 


Val 


He Lys Glu 


Gly Glu Asp Leu He 


50 






55 






60 


Gin Gin 


Leu 


Arg Ser Ala 


Pro 


Pro 


Ser Leu Gly 


Glu Pro Ser Glu Ala 


65 




70 






75 


80 


Arg Ser 


Ala 


Trp Ala Xaa 


Leu 


Ser 


Ser Gly Lys 


Cys Leu Gly Leu Asp 






85 






90 


95 



Val Arg 



<210> 559 

<211> 89 

<212> PRT 

<213> Homo sapiens 



<400> 559 



wo 02/083898 PCT/IBO 1/009 14 

300 

Met Val Ser Gly Asp Gly Phe Leu Val Ser Arg Pro Glu Ala He His 

15 10 15 

Leu Gly Pro Arg Gin Ala Val Arg Pro Ser Val Arg Ala Glu Ser Arg 

20 25 30 

Arg Val Asp Gly Gly Gly Arg Ser Pro Arg Glu Pro Asp Gly Arg Gly 

35 40 45 

Arg Ser Arg Gin Ala Arg Phe Ser Pro Tyr Pro He Pro Ala Val Glu 

50 55 60 

Pro Asp Leu Leu Arg Ser Val Leu Gin Gin Arg Leu He Ala Leu Gly 
65 70 75 80 

Gly Val He Ala Ala Arg He Ser Val 
85 

<210> 560 
<211> 172 
<212> PRT 

<213> Homo sapiens 
<400> 560 

Met Ala Gly Ala Ala Thr Gin Ala Ser Leu Glu Ser Ala Pro Arg He 

15 10 15 

Met Arg Leu Val Ala Glu Cys Ser Arg Ser Arg Ala Arg Ala Gly Glu 

20 25 30 

Leu Trp Leu Pro His Gly Thr Val Ala Thr Pro Val Phe Met Pro Val 

35 40 45 

Gly Thr Gin Ala Thr Met Lys Gly He Thr Thr Glu Gin Leu Asp Ala 

50 55 60 

Leu Gly Cys Arg He Cys Leu Gly Asn Thr Tyr His Leu Gly Leu Arg 
65 70 75 80 

Pro Gly Pro Glu Leu He Gin Lys Ala Asn Gly Leu His Gly Phe Met 

85 90 95 

Asn Trp Pro His Asn Leu Leu Thr Asp Ser Gly Gly Phe Gin Met Val 

100 105 110 

Ser Leu Val Ser Leu Ser Glu Val Thr Glu Glu Gly Val Arg Phe Arg 

115 120 125 

Ser Pro Tyr Asp Gly Asn Glu Thr Leu Leu Ser Pro Glu Lys Ser Val 

130 135 140 

Gin He Gin Asn Ala Leu Gly Ser Asp He He Met Gin Leu Asp Asp 
145 150 155 160 

Val Val Ser Ser Thr Val Thr Gly Pro Arg Val Glu 
165 170 

<210> 561 
<211> 528 
<212> DMA 
<213> Homo sapiens 

<220> 
<221> CDS 
<222> 57.. 527 

<220> 

<221> sig^peptide 

<222> 57. .143 

<223> Von Heijne matrix 

score 14.3000001907349 

seq LLVCCLLTPGVQG/QE 

<400> 561 

cttttttccc ctgcctgccc ttcgggcacc tcaggaaggc accttcctct gtcaga atg 59 

Met 

get acc atg gta cca tec gtg ttg tgg ccc agg gee tgc tgg act ctg 107 
Ala Thr Met Val Pro Ser Val Leu Trp Pro Arg Ala Cys Trp Thr Leu 



wo 02/083898 



PCT/IBOl/00914 









-25 






ctg 


qtc 


tac 


tqt 


eta 


eta 


Leu 


Val 


Cys 


Cys 


Leu 


Leu 






-10 








ttg 


cag 


Qtq 


gag 


ccc 


cag 


Leu 


Arg 


Val 


Glu 


Pro 


Gin 


5 










10 


ttt 


QtQ 


aac 


tgc 


agt 


act 


Phe 


Val 


Asn 


Cys 


Ser 


Thr 










25 




gag 


acg 


tec 


eta 


tea 


aag 


Glu 


Thr 


Ser 


Leu Ser Lys 








40 






gcc 


ttc 


aat 


etc 


age 


aac 


Ala 


Phe 


Asn 


Leu 


Ser 


Asn 






55 










tac 


tgc 


aat 


ggc 


tec 


Val 


Tvr 


Cvs 


Asn Gly Ser 




70 










tac 


agg 


etc 


ccg 


gag 


cgt 


Tyr 


Arg 


Leu 


Pro 


Glu 


Arg 


85 










90 


ccg 


gtg 


ggc 


cag 


aac 


ttc 


Pro 


Val 


Gly 


Gin 


Asn 


Phe 










105 




gcc 


ccg 


gac 


cak 


cet 


eac 


Ala 


Pro 


Asp 


Xaa 


Pro 


His 



120 



301 

-20 

ace cca ggt gtc cag ggg 
Thr Pro Gly Val Gin Gly 
-5 

aac cet gtg etc tet get 
Asn Pro Val Leu Ser Ala 
15 

gat tgt ccc age tet gag 
Asp Cys Pro Ser Ser Glu 
30 

gag ctg gtg gee agt ggc 
Glu Leu Val Ala Ser Gly 
45 

gtg act ggc aac agt egg 
Val Thr Gly Asn Ser Arg 
60 

cag ata aca ggc tec tet 
Gin lie Thr Gly Ser Ser 
75 80 
gtg gag ctg gea ccc ctg 
Val Glu Leu Ala Pro Leu 
95 

ace cet gcg ctg cca agt 
Thr Pro Ala Leu Pro Ser 
110 

ggt ggt get get teg ctg 
Gly Gly Ala Ala Ser Leu 
125 



-15 

cag gag ttc ett 155 

Gin Glu Phe Leu 

1 

gga ggg tec ctg 203 
Gly Gly Ser Leu 
20 

aaa ate gee ttg 251 
Lys lie Ala Leu 
35 

atg ggc tgg gea 299 
Met Gly Trp Ala 

50 

ate etc tgc tea 347 

lie Leu Cys Ser 

65 

aac ate acc gtg 395 
Asn lie Thr Val 

cet cet tgg cag 4 43 

Pro Pro Trp Gin 
100 

gga gga tgg gtc 4 91 

Gly Gly Trp Val 
115 

g 528 



<210> 562 
<211> 682 
<212> DNA 
<213> Homo sapiens 

<220> 
<221> CDS 
<222> 230.. 682 



<220> 

<221> sigjpeptide 
<222> 230.. 277 
<223> Von Heijne matrix 
score 13 

seq LLLLLSALAGLFG/AA 



<400> 562 

gtgtttcgat acagactaag cttttaggcc aaccctcctg 

gtggcatgca tgaaaagtaa acatcagaga ectgaagaag 

aggccagtca ceaagacagg eatctcaaat eggctgattc 

atcttgaaag aaaageteca ggtcccttet ecagceacec 



ctg ctg 
Leu Leu 

gga caa 
Gly Gin 
5 

aat ttt 
Asn Phe 
20 

ate cca 
lie Pro 



ctg ctg 
Leu Leu 
-10 
gea ttt 
Ala Phe 

gac gtg 
Asp Val 

aca acc 
Thr Thr 



ett tec gea ctg 
Leu Ser Ala Leu 



cat ett 
His Leu 

aat aag 
Asn Lys 

25 
ttt gag 
Phe Glu 



ggg aag 
Gly Lys 
10 

tat etc 
Tyr Leu 

aat gga 
Asn Gly 



get ggc etc 
Ala Gly Leu 
-5 

tgc ccc aat 
Cys Pro Asn 

gga aga tgg 
Gly Arg Trp 
30 

egc tgc ate 
Arg Cys lie 



actggatggg ggeggeggge 60 
cttataaaat agcttgggag 120 
tgcatctgga aactgcettc 180 
agecccaag atg gtg atg 238 
Met Val Met 
-15 

ttc ggt gcg gea gag 286 
Phe Gly Ala Ala Glu 

1 

cet ccg gtg cag gag 334 

Pro Pro Val Gin Glu 

15 

tac gaa att gag aag 382 
Tyr Glu He Glu Lys 
35 

cag gcc aac tac tea 4 30 

Gin Ala Asn Tyr Ser 
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302 











40 






45 


50 






eta 


atg 


gaa 


aac 


gga 


aag 


ate 


aaa gtg tta aac eag gag ttg 


aga 


get 


478 


Leu 


Met 


Glu 


Asn 


Gly Lys 


He 


Lys Val Leu Asn Gin Glu Leu Arg Ala 










55 








60 65 








gat 


gga 


act 


ata 


aat 


caa 


ate 


gaa ggt gaa gee ace cea gtt 


aac 


etc 


526 


Asp 


Gly 


Thr 


Val 


Asn Gin He Glu Gly Glu Ala Thr Pro Val 


Asn 


Leu 








70 










75 80 








aca 


Qaa 


cct 


QCC 


aag 


ctg 


gaa 


gtt aag ttt tec tgg ttt atg 


cca 


teg 


574 


Thr 


Glu 
85 


Pro 


Ala 


Lys 


Leu 


Glu 
90 


Val Lys Phe Ser Trp Phe Met 
95 


Pro 


Ser 




gca 


ccg 


tac 


tgg 


ate 


etg gee 


ace gae tat gag aac tat gee 


etc 


gtg 


622 


Ala 


Pro 






lie 


Leu 


Ala 


Thr Asp Tyr Glu Asn Tyr Ala 


Leu 


Val 




100 










105 




110 




115 




tat 


tec 


tgt 


ace 


tgc 


ate 


ate 


caa ett ttt cac gtg gat ttt 


get 


tgg 


670 


Tyr 


Ser 


Cys 


Thr 


Cys 


He 


He Gin Leu Phe His Val Asp Phe Ala Trp 












120 






125 


130 






ate 


ttg 


gca 


aga 














682 


He 


Leu 


Ala 


Arg 
135 

















<210> 563 
<211> 500 
<212> DNA 
<213> Homo sapiens 

<220> 

<221> CDS 
<222> 72. .500 

<220> 

<221> sig__peptide 

<222> 72. .122 

<223> Von Heijne matrix 

score 10.6999998092651 

seq VLMLLAVLIWTGA/EN 

<220> 

<221> misc_feature 
<222> 1..3 

<223> n«a, g, e or t 
<400> 563 

nnngaacatc taatcgaaaa cttgtttcct gagttgtcct gtgctggagg tctgctcaga 60 
cgaaggtetc e atg gcg tta gaa gtc ttg atg etc etc get gtc ttg att 110 
Met Ala Leu Glu Val Leu Met Leu Leu Ala Val Leu He 













-15 








-10 








-5 




tgg 


ace 


ggt 


get 


gag 


aac 


etc 


cat 


gtg 


aaa 


ata 


agt 


tgc 


tet 


etg 


gae 


158 


Trp 


Thr 


Gly 


Ala 


Glu 
1 


Asn 


Leu 


His 


Val 
5 


Lys 


He 


Ser 


Cys 


Ser 
10 


Leu 


Asp 




tgg 


ttg 


atg 


gtc 


tea 


gtt 


ate 


cca 


gtt 


gca 


gaa 


age 


aga 


aat 


ctg 


tat 


206 


Trp 


Leu 


Met 
15 


Val 


Ser 


Val 


He 


Pro 
20 


val 


Ala 


Glu 


Ser 


Arg 
25 


Asn 


Leu 


Tyr 




ata 


ttt 


gcg 


gat 


gaa 


tta 


eat 


etg 


gga 


atg 


ggc 


tgc 


cct 


gca 


aat 


egg 


254 


He 


Phe 

30 


Ala 


Asp 


Glu 


Leu 


His 

35 


Leu 


Gly 


Met 


Gly 


Cys 
40 


Pro 


Ala 


Asn 


Arg 




ata 


cat 


aca 


tat 


gta 


tat 


gag 


ttt 


ata 


tat 


ctt 


gtt 


cgt 


gat 


tgt 


ggc 


302 


He 


His 


Thr 


Tyr 


Val 


Tyr 


Glu 


Phe 


He 


Tyr 


Leu 


Val 


Arg 


Asp 


Cys 


Gly 




45 










50 










55 










60 




ate 


agg 


aca 


agg 


gta 


gtt 


tet 


gag 


gaa 


act 


etc 


ctt 


ttt 


caa 


ace 


gag 


350 


He 


Arg 


Thr 


Arg 


Val 
65 


Val 


Ser 


Glu 


Glu 


Thr 
70 


Leu 


Leu 


Phe 


Gin 


Thr 
75 


Glu 




ctg 


tac 


ttt 


ace 


cea 


agg 


aat 


ata 


gat 


cat 


gac 


cct 


eag 


gaa 


ate 


cat 


398 


Leu 


Tyr 


Phe 


Thr 


Pro 


Arg 


Asn 


He 


Asp 


His 


Asp 


Pro 


Gin 


Glu 


He 


His 
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303 

80 85 
ttg gag tgt tec acc tct agg aaa tea gtg tgg 
Leu Glu Cys Ser Thr Ser Arg Lys Ser Val Trp 

95 100 
act gag aat gaa ata aaa ttg gat cct agt cct 
Thr Glu Asn Glu He Lys Leu Asp Pro Ser Pro 

110 115 
cag aca 
Gin Thr 
125 

<210> 564 
<211> 497 
<212> DNA 
<213> Homo sapiens 

<220> 
<221> CDS 
<222> 52. .495 

<220> 

<221> sig^peptide 

<222> 52.. 102 

<223> Von Heijne matrix 

score 10.6999998092651 

seq VLMLLAVLIWTGA/EN 



<400> 564 

cttgtttcct gagttgtcct gtgctggagg tctgctcaga cgaaggtctc c atg gcg 57 

Met Ala 



tta 


gaa 


gte 


ttg 


atg 


etc 


etc 


get 


gte 


ttg 


att 


tgg 


ace 


ggt 


get 


gag 


105 


Leu 


Glu 


Val 


Leu 


Met 


Leu 


Leu 


Ala 


Val 


Leu 


He 


Trp 


Thr 


Gly 


Ala 


Glu 




-15 










-10 










-5 










1 




aac 


etc 


cat 


gtg 


aaa 


ata 


agt 


tgc 


tct 


ctg 


gac 


tgg 


ttg 


atg 


gte 


tea 


153 


Asn 


Leu 


His 


Val 


Lys 


He 


Ser 


Cys 


Ser 


Leu 


Asp 


Trp 


Leu 


Met 


Val 


Ser 










5 










10 










15 








gtt 


ate 


cea 


gtt 


gca 


gaa 


age 


aga 


aat 


ctg 


tat 


ata 


ttt 


gcg 


gat 


gaa 


201 


Val 


He 


Pro 


Val 


Ala 


Glu 


Ser 


Arg 


Asn 


Leu 


Tyr 


He 


Phe 


Ala 


Asp 


Glu 








20 










25 










30 










tta 


cat 


ctg 


gga 


atg 


ggc 


tgc 


cct 


gca 


aat 


egg 


ata 


cat 


aca 


tat 


gta 


249 


Leu 


His 


Leu Gly Met 


Gly Cys 


Pro 


Ala 


Asn 


Arg 


He 


His 


Thr 


Tyr 


Val 






35 










40 










45 












tat 


gag 


ttt 


ata 


tat 


ctt 


gtt 


cgt 


gat 


tgt 


ggc 


ate 


agg 


aca 


agg 


gta 


297 


Tyr 


Glu 


Phe 


He 


Tyr 


Leu 


Val 


Arg 


Asp 


Cys 


Gly 


He 


Arg 


Thr 


Arg 


Val 




50 










55 










60 










65 




gtt 


tct 


gag 


gaa 


act 


etc 


ctt 


ttt 


caa 


acc 


gag 


ctg 


tac 


ttt 


acc 


cea 


345 


Val 


Ser 


Glu 


Glu 


Thr 


Leu 


Leu 


Phe 


Gin 


Thr 


Glu 


Leu 


Tyr 


Phe 


Thr 


Pro 












70 










75 










80 






agg 


aat 


ata 


gat 


cat 


gac 


cct 


cag 


gaa 


ate 


cat 


ttg 


gag 


tgt 


tec 


acc 


393 


Arg 


Asn 


He Asp His 


Asp 


Pro 


Gin 


Glu 


He 


His 


Leu 


Glu 


Cys 


Ser 


Thr 










85 










90 










95 








tct 


agg 


aaa 


tea 


gtg 


tgg 


ctt 


aca 


cea 


gtt 


tct 


act 


gag 


aat 


gaa 


ata 


441 


Ser Arg 


Lys 


Ser 


Val 


Trp 


Leu 


Thr 


Pro 


Val 


Ser 


Thr 


Glu 


Asn 


Glu 


He 








100 










105 










110 










aaa 


ttg 


gat 


cct 


agt 


cct 


ttt 


att 


get 


gac 


ttt 


cag 


aca 


aca 


gca 


gaa 


489 


Lys 


Leu 


Asp 


Pro 


Ser 


Pro 


Phe 


He 


Ala 


Asp 


Phe 


Gin 


Thr 


Thr 


Ala 


Glu 





115 120 125 

gag tta gg 497 
Glu Leu 
130 



PCT/IBOl/00914 

90 

ctt aca cea gtt tct 44 6 

Leu Thr Pro Val Ser 
105 

ttt att get gac ttt 4 94 

Phe He Ala Asp Phe 

120 

500 



<210> 565 
<211> 342 
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304 

<212> DNA 

<213> Homo sapiens 

<220> 
<221> CDS 
<222> 118.. 342 

<220> 

<221> sig peptide 

<222> 1187.189 

<223> Von Heijne matrix 

score 10.6000003814697 

seq LLLLLLCPAHVGG/LW 

<400> 565 

tccccggccg ccgccgttgc gctcgccgcg ctcgcactga agcccgggcc ctcgcgcgcc 60 
gcggttcgcc ccgcagcctc gccccctgcc cacccgggcg gccgtagggc ggtcacg 117 



atg 


ctg 


ccg 


cec 


tta 


ecc 


tee cge etc ggg 


ctg 


ctg 


ctg 


ctg 


ctg 


etc 


165 


Met 


Leu 


Pro 


Pro 


Leu 


Pro 


Ser Arg Leu Gly 


Leu 


Leu 


Leu 


Leu 


Leu 


Leu 












-20 




-15 










-10 






ctg 


tgc 


ccg 


gcg 


cac 


gtc 


ggc gga ctg tgg 


tgg 


get 


gtg 


ggc 


age 


cec 


213 


Leu 


Cys 


Pro 


Ala 
-5 


His 


Val 


Gly Gly Leu Trp 
1 


Trp 


Ala 


Val 

5 


Gly 


Ser 


Pro 




ttg 


gtt 


atg 


gac 


cet 


acc 


age ate tgc agg 


aag 


gca 


egg 


egg 


ctg 


gee 


261 


Leu 


Val 


Met 


Asp 


Pro 


Thr 


Ser He Cys Arg 


Lys 


Ala Arg 


Arg 


Leu 


Ala 






10 










15 




20 












ggg 


egg 


cag 


gcc 


gag 


ttg 


tgc cag get gag 


ccg 


gaa 


gtg 


gtg 


gca 


gag 


309 


Gly Arg 


Gin 


Ala 


Glu 


Leu 


Cys Gin Ala Glu 


Pro 


Glu 


Val 


Val 


Ala 


Glu 




25 










30 




35 










40 




ctg 


get 


egg 


ggc 


gcc 


egg 


etc ggg gtg ega 


rag 












342 


Leu 


Ala 


Arg Gly 


Ala Arg 


Leu Gly Val Arg 


Xaa 






















45 




50 

















<210> 566 
<211> 434 
<212> DNA 
<213> Homo sapiens 

<220> 
<221> CDS 
<222> 242.. 433 

<220> 

<221> sig_peptide 

<222> 242. .307 

<223> Von Heijne matrix 

score 10.1999998092651 

seq LLALAALLQGAVS/LK 

<400> 566 

acatttgccc cagggaaggt cacagctgee tgaactbbtc aaaactceca gaeacgeact 60 
geetgtgcag gatceggagc ecagcageac tgeeagggcc ttgaagtgct tattcagaga 120 
cctttcttca tagactactt ttttttcttt aagcagcaaa aggagaaaat tgtcatcaaa 180 
ggatattcca gattcttgac agcattctcg tcatctctga ggacatcacc atcatctcag 240 
g atg agg ggc atg aag ctg ctg ggg gcg ctg ctg gca ctg gcg gcc eta 289 
Met Arg Gly Met Lys Leu Leu Gly Ala Leu Leu Ala Leu Ala Ala Leu 
-20 -15 -10 

ctg cag ggg gee gtg tec ctg aag ate gca gee ttc aac ate cag aca 337 
Leu Gin Gly Ala Val Ser Leu Lys He Ala Ala Phe Asn He Gin Thr 

-5 15 10 

ttt ggg gag acc aag atg tec aat gee ace etc gtc age tac att gtg 385 
Phe Gly Glu Thr Lys Met Ser Asn Ala Thr Leu Val Ser Tyr He Val 
15 20 25 
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305 

cag ate ctg age cgc tat gac ate gee ctg gte cag gag gtc aga gac a 434 
Gin lie Leu Ser Arg Tyr Asp lie Ala Leu Val Gin Glu Val Arg Asp 
30 35 40 

<210> 567 
<211> 487 
<212> DNA 
<213> Homo sapiens 

<220> 
<221> CDS 
<222> 136.. 486 

<220> 

<221> sig^peptide 
<222> 136.. 234 
<223> Von Heijne matrix 
score 8.5 

seq FFFVLGSLIFCFG/IW 
<400> 567 

ctcttttggg gttcttcctt tctctctcag ctctcegtct ctctttctct ctcagcctct 60 
ttctttctcc ctgtctccec cactgtcagc acctettctg tgtggtgagt ggaccgctta 120 
ccccactagg tgaag atg tea gee cag gag age tge etc age etc ate aag 171 
Met Ser Ala Gin Glu Ser Cys Leu Ser Leu lie Lys 
-30 -25 
tac ttc etc ttc gtt ttc aac etc ttc ttc ttc gtc etc ggc age ctg 219 
Tyr Phe Leu Phe Val Phe Asn Leu Phe Phe Phe Val Leu Gly Ser Leu 

-20 -15 -10 

ate ttc tge ttc ggc ate tgg ate etc att gac aag ace age ttc gtg 267 
lie Phe Cys Phe Gly He Trp He Leu He Asp Lys Thr Ser Phe Val 
-5 15 10 

tec ttt gtg ggc ttg gee ttc gtg cct ctg cag ate tgg tec aaa gtc 315 
Ser Phe Val Gly Leu Ala Phe Val Pro Leu Gin He Trp Ser Lys Val 

15 20 25 

ctg gee ate tea gga ate ttc acc atg ggc ate gee etc ctg ggt tgt 363 
Leu Ala He Ser Gly He Phe Thr Met Gly He Ala Leu Leu Gly Cys 

30 35 40 

gtg ggg gee tea agg age tec get gee tec tgg gee tgt att ttg gga 411 
Val Gly Ala Ser Arg Ser Ser Ala Ala Ser Trp Ala Cys He Leu Gly 

45 50 55 

tge tge tge tee tgt ttg eca cac aga tea cec tgg gaa tec tea tct 459 
Cys Cys Cys Ser Cys Leu Pro His Arg Ser Pro Trp Glu Ser Ser Ser 
60 65 70 75 

cca etc age ggg eca get gra geg aag c 487 
Pro Leu Ser Gly Pro Ala Xaa Ala Lys 
80 

<210> 568 
<211> 478 
<212> DNA 

<213> Homo sapiens 

<220> 
<221> CDS 
<222> 233. .478 

<220> 

<221> sig_peptide 

<222> 233.. 328 

<223> Von Heijne matrix 

score 8.19999980926514 

seq LVAAMVLLSWFC/LY 
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306 



<400> 568 

atgaatgtca gaataggccc attctaccag ctctggctga gcctgagctt ccaaaagtga 60 

gctgagctgt tcaaccttgg atcttaatta ctcctagcag ggataattag gtccctcttt 120 

ctcagattac aggtttgaga agttccaaat atccgatgcc ctagaggttt ttttccctta 180 

attttattgt tacaattggc cttcatctag ctctggatgg ttgaactgta gc atg gca 238 

Met Ala 



aag 


atg 


ttt gat 


etc 


agg 


a eg 


aag 


ate 


atg 


ate 


ggc ate 


gga 


age age 


286 


Lys 


Met 


Phe Asp 


Leu 


Arg 


Thr 


Lys 


lie 


Met 


lie 


Gly He 


Gly 


Ser Ser 




-30 








-25 










-20 






-15 




tta 


ctg 


gtt gcc 


gcg 


atg 


gtg 


etc 


eta 


agt 


gtt 


gtg ttc 


tgt 


ett tac 


334 


Leu 


Leu 


Val Ala 


Ala 
-10 


Met 


Val 


Leu 


Leu 


Ser 
-5 


Val 


Val Phe 


Cys 


Leu Tyr 
1 




ttc 


aaa 


gta get 


aag 


gca 


eta 


aaa 


get 


gca 


aag 


gae cct 


gat 


get gtg 


382 


Phe 


Lys 


Val Ala 


Lys Ala 


Leu 


Lys 


Ala 


Ala 


Lys 


Asp Pro Asp Ala Val 








5 








10 








15 








get 


gta 


aaa aat 


eac 


aac 


cea 


gae 


aag 


gtg 


tgt 


tgg gee 


aeg 


aac age 


430 


Ala 


Val 


Lys Asn 


His 


Asn 


Pro Asp 


Lys 


Val 


Cys 


Trp Ala 


Thr 


Asn Ser 






20 








25 










30 








cak 


gcc 


aaa gcc 


acc 


acc 


atg 


gag 


tct 


tgt 


cca 


tct etc 


cag 


tgc tgt 


478 


Xaa 


Ala 


Lys Ala 


Thr 


Thr 


Met 


Glu 


Ser Cys 


Pro 


Ser Leu 


Gin 


Cys Cys 




35 








40 










45 






50 





<210> 569 
<211> 469 
<212> DNA 

<213> Homo sapiens 

<220> 
<221> CDS 
<222> 290.. 469 



<220> 

<221> sig_peptide 
<222> 290.. 346 
<223> Von Heijne matrix 
score 7.5 

seq LAVFFLVSTQLFA/EE 



<400> 569 

agatgttatg ggcatcgacg agttaccgtc tcggaaactc tcaatcacgc aagcgaaagg 60 

agaggaggcg gctaattaaa tattgagcag aaagtcgcgt ggggagaatg tcacgtgggt 120 

ctggaggctc aaggaggctg ggataaatae egeaagcact gagcaggega aagagcgcgc 180 

teggacctcc ttceeggegg cagctacega gagtgeggas gaecagegtg egeteggagg 240 

aaccagagaa actcagcacc ccgcgggact gtccgtcgca aaatccaac atg aaa ate 298 

Met Lys He 



etc 


gtg 


gee 


ttg 


gea 


gte 


ttt 


ttt 


ett 


gte 


tee 


act 


eag 


ctg 


ttt 


gea 


34 6 


Leu 


Val 
-15 


Ala 


Leu 


Ala 


Val 


Phe 
-10 


Phe 


Leu 


Val 


Ser 


Thr 
-5 


Gin 


Leu 


Phe 


Ala 




gaa 


gaa 


ata 


gga 


gcc 


aat 


gat 


gat 


ctg 


aat 


tac 


tgg 


tec 


gae 


tgg 


tac 


394 


Glu 


Glu 


He 


Gly Ala Asn 


Asp Asp 


Leu 


Asn 


Tyr 


Trp 


Ser 


Asp 


Trp 


Tyr 




1 








5 










10 










15 






gae 


age 


gae 


cag 


ate 


aag 


gag 


gaa 


ctg 


ccg 


gag 


ecc 


ttt 


gag 


eat 


ett 


442 


Asp 


Ser 


Asp Gin 


He 


Lys 


Glu 


Glu 


Leu 


Pro 


Glu 


Pro 


Phe 


Glu 


His 


Leu 










20 










25 










30 








ctg 


cag 


aga 


ate 


gee 


egg 


aga 


ecc 


aag 
















469 


Leu 


Gin 


Arg 


He 


Ala 


Arg 


Arg 


Pro 


Lys 



















35 40 



<210> 570 
<211> 489 
<212> DNA 
<213> Homo sapiens 



wo 02/083898 



307 



PCT/IBOl/00914 



<220> 
<221> CDS 
<222> 253.. 489 



<220> 

<221> sig_peptide 

<222> 253,. 336 

<223> Von Heijne matrix 

score 7.40000009536743 
seq CLLLNLAPAPLNA/DS 



<400> 570 

ctctttgctc 

ctctttcact 

tcttgatctg 

ggaaaaacaa 

gattttactt 



tct gta tgc 
Ser Val Cys 
-15 

tct gag gaa 
Ser Glu Glu 

cca ctg aaa 
Pro Leu Lys 

20 

cca tgt aaa 
Pro Cys Lys 

35 
agt gcg 
Ser Ala 
50 



taacagacag cagcgacttt aggctggata atagtcaaat tcttacctcg 
gctagtaaga tcagattgcg tttctttcag ttactcttca atcgccagtt 
cttctaaaag aagaagtaga gaagataaat cctgtcttca atacctggaa 
aataacctca actccgtttt gaaaaaaaca ttccaagaac tttcatcaga 
ag atg att tac aca atg aag aaa gta cat gca ctt tgg get 
Met lie Tyr Thr Met Lys Lys Val His Ala Leu Trp Ala 
-25 -20 
ctg ctg ctt aat ctt gcc cct gcc cct ctt aat get gat 
Leu Leu Leu Asn Leu Ala Pro Ala Pro Leu Asn Ala Asp 

-10 -5 1 

gat gaa gaa cac aca att ate aca gat acg gag ttg cca 
Asp Glu Glu His Thr He He Thr Asp Thr Glu Leu Pro 

15 

gca ttc aag gcg gat gat ggc 



ctt atg cat tea 



10 
ttt tgt 



Leu Met His Ser Phe Cys Ala Phe Lys Ala Asp Asp Gly 

25 30 
gca ate atg aaa aga ttt tct tea ata ttt tea etc gae 
Ala He Met Lys Arg Phe Ser Ser He Phe Ser Leu Asp 



40 



45 



60 
120 
180 
240 
291 



339 



387 



435 



483 



489 



<210> 571 
<211> 484 
<212> DNA 
<213> Homo sapiens 

<220> 
<221> CDS 
<222> 256. .483 



<220> 

<221> sig_peptide 

<222> 256.. 327 

<223> Von Heijne matrix 

score 7.09999990463257 
seq FLFLFNLIFFILG/AV 



<400> 571 

ggggcggggc ctgccgagtc cgcggcgttc cccggctgca gcegggaggg ggccgaggag 60 
tgactgagcc ccgggctgtg cagtccgaeg ccgactgagg caegagcggg tgacgetggg 120 
cctgcagcgc ggagcagaaa geagaacccg cagagtcctc cctgetgctg tgtggaegae 180 
aegtgggcac aggcagaagt gggccctgtg accagctgea etggtttcgt ggaaggaagc 240 
tceaggaetg gcggg atg ggc tea gcc tgt ate aaa gtc ace aaa tac ttt 291 
Met Gly Ser Ala Cys He Lys Val Thr Lys Tyr Phe 
-20 -15 
etc ttc etc ttc aac ttg ate ttc ttt ate ctg ggc gca gtg ate ctg 339 
Leu Phe Leu Phe Asn Leu He Phe Phe He Leu Gly Ala Val He Leu 
-10 -5 1 



wo 02/083898 PCT/IBO 1/009 14 

308 

ggc ttc ggg gtg tgg ate ctg gcc gac aag age agt ttc ate tct gtc 387 

Gly Phe Gly Val Trp He Leu Ala Asp Lys Ser Ser Phe He Ser Val 

5 10 15 20 

ctg caa ace tec tec age teg ett agg atg ggg gcc tat gtc ttc ate 435 

Leu Gin Thr Ser Ser Ser Ser Leu Arg Met Gly Ala Tyr Val Phe He 

25 30 35 

ggc gtg ggg gca gtc act atg etc atg ggc ttc ctg ggc tgc ate ggc g 484 
Gly Val Gly Ala Val Thr Met Leu Met Gly Phe Leu Gly Cys He Gly 
40 45 50 

<210> 572 
<211> 482 
<212> DNA 
<213> Homo sapiens 

<220> 
<221> CDS 
<222> 29. .481 

<220> 

<221> sig_peptide 
<222> 29. .82 
<223> Von Heijne matrix 
score 7 

seq VLGLVLLSVTVQG/KV 
<400> 572 

agcctagcac tctgacetag cagtcaac atg aag get etc att gtt ctg ggg 52 

Met Lys Ala Leu He Val Leu Gly 

























-15 












ett 


gtc 


etc 


ett 


tct 


gtt 


acg 


gtc 


cag 


ggc 


aag 


gtc 


ttt 


gaa 


agg 


tgt 


100 


Leu 


Val 


Leu 


Leu 


Ser 


Val 


Thr 


Val 


Gin 


Gly 


Lys 


Val 


Phe 


Glu 


Arg 


Cys 




-10 










-5 










1 








5 






gag 


ttg 


gee 


aga 


act 


ctg 


aaa 


aga 


ttg 


gga 


atg 


gat 


ggc 


tac 


agg 


gga 


148 


Glu 


Leu 


Ala 


Arg 
10 


Thr 


Leu 


Lys 


Arg 


Leu 
15 


Gly 


Met 


Asp 


Gly 


Tyr 
20 


Arg 


Gly 




ate 


age 


eta 


gca 


aac 


tgg 


atg 


tgt 


ttg 


gee 


aaa 


tgg 


gag 


agt 


ggt 


tac 


196 


He 


Ser 


Leu 
25 


Ala 


Asn 


Trp 


Met 


Cys 
30 


Leu 


Ala 


Lys 


Trp 


Glu 
35 


Ser 


Gly 


Tyr 




aac 


aca 


cga 


get 


aca 


aac 


tac 


aat 


get 


gga 


gac 


aga 


age 


act 


gat 


tat 


244 


Asn 


Thr 


Arg Ala 


Thr 


Asn 


Tyr 


Asn 


Ala 


Gly 


Asp 


Arg 


Ser 


Thr 


Asp 


Tyr 






40 










45 










50 












ggg 


ata 


ttt 


cag 


ate 


aat 


age 


ege 


tac 


tgg 


tgt 


aat 


gat 


ggc 


aaa 


acc 


292 


Gly 


He 


Phe 


Gin 


He 


Asn 


Ser 


Arg 


Tyr 


Trp 


Cys 


Asn 


Asp 


Gly 


Lys 


Thr 




55 










60 










65 










70 




cca 


gga 


gca 


gtt 


aat 


gcc 


tgt 


eat 


tta 


tec 


tgc 


agt 


get 


ttg 


ctg 


caa 


340 


Pro Gly 


Ala 


Val 


Asn 


Ala 


Cys 


His 


Leu 


Ser 


Cys 


Ser 


Ala 


Leu 


Leu 


Gin 












75 










80 










85 






gat 


aac 


ate 


get 


gat 


get 


gta 


get 


tgt 


gca 


aag 


agg 


gtt 


gtc 


egt 


gat 


388 


Asp Asn 


He Ala Asp Ala 


Val 


Ala 


Cys 


Ala 


Lys 


Arg 


Val 


Val 


Arg 


Asp 










90 










95 










100 








cca 


cec 


ace 


teg 


gcc 


tec 


caa 


agt 


get 


ggg 


att 


aca 


ggc 


gtg 


ase 


aac 


436 


Pro 


Pro 


Thr 
105 


Ser 


Ala 


Ser 


Gin 


Ser 
110 


Ala 


Gly 


He 


Thr 


Gly 
115 


Val 


Xaa 


Asn 




tgc 


gee 


egg 


cca 


cat 


tea 


gtt 


ett 


ate 


aaa 


gaa 


ata 


ace 


cag 


act 


t 


482 


Cys 


Ala 
120 


Arg 


Pro 


His 


Ser 


Val 
125 


Leu 


He 


Lys 


Glu 


He 
130 


Thr 


Gin 


Thr 







<210> 573 
<211> 541 
<212>DNA 
<213> Homo sapiens 



wo 02/083898 



PCT/IBOl/00914 



309 

<220> 
<221> CDS 
<222> 294. .539 

<220> 

<221> sig_peptide 
<222> 294. ,380 
<223> Von Heijne matrix 
score 7 

seq LLMLFHTMAQIMA/EQ 
<400> 573 

ataactatgt gacattactt agggcataat tctaaaattt ctgtcctagg agacactaga 60 
taactctcaa aaacttctga ttttcttgca tccctttttg gcagagtggt cacagtataa 120 
tgcgggaggc cctcctcccc cgaatcagca acaatttttg ctccaccgtt ttctttcggg 180 
aacagatgcg ttttcagaag acagtactcc agtgataagc ggcgactttg agggattccc 240 
tctctggcgg cctctgcagc agcacagccg gcctcattcg gggcactgcg agt atg 296 

Met 

gat etc caa gga aga ggg gtc ccc age ate gac aga ctt cga gtt etc 344 
Asp Leu Gin Gly Arg Gly Val Pro Ser lie Asp Arg Leu Arg Val Leu 

-25 -20 -15 

ctg atg ttg ttc cat aca atg get caa ate atg gca gaa caa gaa gtg 392 
Leu Met Leu Phe His Thr Met Ala Gin lie Met Ala Glu Gin Glu Val 

-10 -5 1 

gaa aat etc tea ggc ctt tec act aac ect gaa aaa gat ata ttt gtg 4 40 

Glu Asn Leu Ser Gly Leu Ser Thr Asn Pro Glu Lys Asp lie Phe Val 
5 10 15 20 

gtg egg gaa aat ggg acg acc tgt etc atg gca gag ttt gca gee aaa 488 
Val Arg Glu Asn Gly Thr Thr Cys Leu Met Ala Glu Phe Ala Ala Lys 

25 30 35 

ttt att gta ect tat gat gtg tgg gee age aac tac gta gat ctg ate 536 
Phe lie Val Pro Tyr Asp Val Trp Ala Ser Asn Tyr Val Asp Leu lie 

40 45 50 

aca ga 541 
Thr 

<210> 574 
<211> 453 
<212> DNA 
<213> Homo sapiens 

<220> 
<221> CDS 
<222> 274. .453 

<220> 

<221> sig_peptide 

<222> 274.. 345 

<223> Von Heijne matrix 

score 6.59999990463257 

seq KMVHLLVLSGAWG/MQ 

<400> 574 

attattggtt gggggaaacc aegaggggac geggccgagg agggtcgctg tccacccggg 60 
ggcgtgggag tgaggyacca gattcagccc atttggcccc gacgcctctg ttctcggaat 120 
ecgggtgctg cggattgagg tcccggttcc taacgaatet ctgctggatt ggccgtaacc 180 
ctgtccccga gcgggctcac agggtctgaa ggecacgcay gaggcaaagg yaaagyycyg 240 
agecaccegg tgchtestte ccaggasege aag atg gag gaa ggc ggg aac eta 294 

Met Glu Glu Gly Gly Asn Leu 
-20 

gga ggc ctg att aag atg gtc cat eta ctg gtc ttg tea ggt gcc tgg 342 
Gly Gly Leu lie Lys Met Val His Leu Leu Val Leu Ser Gly Ala Trp 
-15 -10 -5 



wo 02/083898 



ggc 


atg 


caa 


atg 


tgg 


gtg 


Gly Met 


Gin 


Met 


Trp Val 




1 








5 


age 


ctt 


ccc 


cga 


cat 


acc 


Ser 


Leu 


Pro 


Arg 


His 


Thr 










20 




ttc 


tac 


ttc 


cac 


ate 




Phe 


Tyr 


Phe 


His 


He 










35 







310 

acc ttc gtc tea ggc ttc 
Thr Phe Val Ser Gly Phe 
10 

ttc gga eta gtg cag age 
Phe Gly Leu Val Gin Ser 
25 



PCT/reOl/00914 

ctg ctt ttc cga 390 
Leu Leu Phe Arg 
15 

aaa etc ttc ecc 438 
Lys Leu Phe Pro 
30 

453 



<210> 575 
<211> 497 
<212> DNA 
<213> Homo sapiens 



<220> 
<221> CDS 
<222> 70. .495 



<220> 

<221> sig^peptide 

<222> 70. . 147 

<223> Von Heijne matrix 

score 6.09999990463257 
seq IGLASLFTENVLA/HP 



<400> 575 

ataggatgaa accatggaag tcttsctsag atctggggag ggagcctggc ccaagagccc 60 

gtggagcta atg aaa aca gaa cag ctg aat aga ttt get gga ttt ggt att 111 

Met Lys Thr Glu Gin Leu Asn Arg Phe Ala Gly Phe Gly He 
-25 -20 -15 



gga 


ctt 


gea 


agt 


etc 


ttt 


aca 


gaa 


aat 


gta 


ttg 


gea 


cat 


cet 


tge 


att 


159 


Gly Leu 


Ala 


Ser 


Leu 


Phe 


Thr 


Glu 


Asn 


Val 


Leu 


Ala 


His 


Pro 


Cys 


He 








-10 










-5 










1 










gtt 


eta 


egc 


egc 


caa 


tgt 


cag 


gtt 


aat 


tac 


cat 


get 


cag 


cat 


tac 


cat 


207 


Val 


Leu. 


Arg 


Arg 


Gin 


Cys 


Gin 


Val 


Asn 


Tyr 


His 


Ala 


Gin 


His 


Tyr 


His 




5 










10 










15 










20 




etc 


act 


cea 


ttt 


aca 


gtc 


ate 


aat 


att 


atg 


tac 


agt 


ttc 


aac 


aaa 


act 


255 


Leu 


Thr 


Pro 


Phe 


Thr 
25 


Val 


He 


Asn 


He 


Met 
30 


Tyr 


Ser 


Phe 


Asn 


Lys 
35 


Thr 




cag 


gga 


cet 


aga 


gee 


ctg 


tgg 


aaa 


gga 


atg 


rga 


agt 


aca 


ttt 


att 


gtc 


303 


Gin Gly 


Pro 


Arg Ala 


Leu 


Trp 


Lys 


Gly 


Met 


Xaa 


Ser 


Thr 


Phe 


He 


Val 










40 










45 










50 








cag 


gga 


gtc 


aca 


ctt 


gga 


gea 


gaa 


ggc 


ata 


att 


agt 


gaa 


ttt 


aca 


cet 


351 


Gin Gly 


Val 


Thr 


Leu 


Gly 


Ala 


Glu 


Gly 


He 


He 


Ser 


Glu 


Phe 


Thr 


Pro 








55 










60 










65 










ttg 


cea 


agg 


gag 


gtt 


tta 


eat 


aaa 


tgg 


agt 


cet 


aaa 


caa 


ata 


gga 


gaa 


399 


Leu 


Pro 
70 


Arg 


Glu 


Val 


Leu 


His 
75 


Lys 


Trp 


Ser 


Pro 


Lys 
80 


Gin 


He 


Gly 


Glu 




cac 


ctt 


eta 


ctg 


aaa 


tec 


eta 


act 


tac 


gtg 


gtg 


gea 


atg 


ctt 


ttt 


tat 


447 


His 


Leu 


Leu 


Leu 


Lys 


Ser 


Leu 


Thr 


Tyr 


Val 


Val 


Ala 


Met 


Leu 


Phe 


Tyr 




85 










90 










95 










100 




tea 


gea 


agt 


ctg 


att 


gaa 


aca 


gtg 


cag 


agt 


gag 


ata 


att 


cga 


gat 


aat 


495 


Ser 


Ala 


Ser 


Leu 


He 
105 


Glu 


Thr 


Val 


Gin 


Ser 
110 


Glu 


He 


He 


Arg 


Asp 
115 


Asn 





4 97 



<210> 576 
<211> 467 
<212> DNA 
<213> Homo sapiens 



<220> 



wo 02/083898 PCT/IBO 1/009 14 

311 

<221> CDS 
<222> 88. .465 

<220> 

<221> sig_peptide 

<222> 88. .219 

<223> Von Heijne matrix 

score 6.09999990463257 

seq ACCLVMVISCVFC/MG 

<400> 576 

tcttacacca acctgctcca aaccacaaga ggagttactt gttccagcct cctgtgtgga 60 
ctgctttcct atcaaagcac cttagac atg cac gag gaa gaa ata tac acc tct 114 

Met His Glu Glu Glu He Tyr Thr Ser 
-40 



ctt 


cag 


tgg 


gat 


age 


cca 


gca 


cca 


gac 


act 


tac 


cag 


aaa 


tgt 


ctg 


tct 


162 


Leu 


Gin 


Trp 


Asp 


Ser 


Pro 


Ala 


Pro 


Asp Thr 


Tyr Gin 


Lys 


Cys 


Leu 


Ser 




-35 










-30 










-25 










-20 




tec 


aac 


aaa 


tgt 


tea 


gga 


gca 


tgc 


tgt 


ctt 


gtg atg 


gtg 


att 


tea 


tgt 


210 


Ser 


Asn 


Lys 


Cys 


Ser 


Gly Ala 


Cys 


Cys 


Leu 


Val 


Met 


Val 


He 


Ser 


Cys 












-15 










-10 










-5 






gtt 


ttc 


tgc 


atg 


gga 


tta 


tta 


acg 


gca 


tec 


att 


ttc 


ttg 


ggc 


gtc 


aag 


258 


Val 


Phe 


Cys 


Met 


Gly 


Leu 


Leu 


Thr 


Ala 


Ser 


He 


Phe 


Leu Gly 


Val 


Lys 










1 








5 










10 










ttg 


ttg 


cag 


gtg 


tec 


acc 


att 


gcg 


atg 


cag 


cag 


caa 


gaa 


aaa 


etc 


ate 


306 


Leu 


Leu 

15 


Gin 


Val 


Ser 


Thr 


He 
20 


Ala 


Met 


Gin 


Gin 


Gin 

25 


Glu 


Lys 


Leu 


He 




caa 


caa 


gag 


agg 


gca 


ctg 


eta 


aac 


ttt 


aca 


gaa 


tgg 


aag 


aga 


age 


tgt 


354 


Gin 


Gin 


Glu 


Arg 


Ala 


Leu 


Leu 


Asn 


Phe 


Thr 


Glu Trp 


Lys Arg 


Ser 


Cys 




30 










35 










40 










45 




gcc 


ctt 


cag 


atg 


aaa 


tat 


tgc 


caa 


gcc 


ttc 


atg 


caa 


aac 


tea 


tta 


agt 


402 


Ala 


Leu 


Gin 


Met 


Lys 
50 


Tyr 


Cys 


Gin 


Ala 


Phe 
55 


Met 


Gin 


Asn 


Ser 


Leu 
60 


Ser 




tea 


gcc 


cat 


aac 


age 


agt 


cct 


tgt 


cca 


aac 


aat 


tgg 


att 


cag 


aac 


aga 


450 


Ser 


Ala 


His 


Asn 
65 


Ser 


Ser 


Pro 


Cys 


Pro 
70 


Asn 


Asn 


Trp 


He 


Gin 
75 


Asn 


Arg 




gaa 


agt 


tgt 


tac 


tat 


gt 






















4 67 


Glu 


Ser 


Cys 


Tyr 


Tyr 



























80 

<210> 577 
<211> 485 
<212> DNA 

<213> Homo sapiens 

<220> 
<221> CDS 
<222> 328.. 483 

<220> 

<221> sig_peptide 

<222> 328.. 387 

<223> Von Heijne matrix 

score 5.90000009536743 

seq GWMAAGLMIGAGA/CY 

<400> 577 

ctagageaca agtcctgeta gtcgectccg tetgggtace agccccctat tactctgcag 60 

gcgtgtgaag aaagaaggaa actagctcgg accgtgcagg tttgtaggtc tgttggectg 120 

taggtttcgg cacaagttte agcgagagaa ggagaaaact gccttggttg gaaccttgca 180 

gatteatcae aagagagcta caagagectg gaagaagctg aagactgcta ccctccatcc 240 

ttacteaecc tggacctgag agacctcttc aatcaggtgg agcaaggece tctectgtcc 300 

tgccccaagg ctggcacaga cttgagc atg ggc egg get egg gaa gtg ggt tgg 354 



wo 02/083898 PCT/IBO 1/009 14 

312 

Met Gly Arg Ala Arg Glu Val Gly Trp 
-20 -15 
atg gcg gca gga ctg atg att ggg get ggt gcc tgc tac tgc gtt tac 402 
Met Ala Ala Gly Leu Met He Gly Ala Gly Ala Cys Tyr Cys Val Tyr 

-10 -5 15 

aaa ctg acc ata gga aga gat gac agt gag aag ctg gag gag gag ggg 4 50 

Lys Leu Thr He Gly Arg Asp Asp Ser Glu Lys Leu Glu Glu Glu Gly 

10 15 20 

gaa gag gag tgg gac gat gac cag gag ctg gat ga 485 
Glu Glu Glu Trp Asp Asp Asp Gin Glu Leu Asp 
25 30 



<210> 578 
<211> 502 
<212>. DNA 
<213> Homo sapiens 



<220> 
<221> CDS 
<222> 39. .500 

<220> 

<221> sig_peptide 

<222> 39. .83 

<223> Von Heijne matrix 

score 5.90000009536743 

seq ILMLTFIICGLLT/RV 

<400> 578 

attcctcagg acacagagct tcctctctcc caggagcc atg aat ate ctg atg ctg 56 

Met Asn lie Leu Met Leu 

-15 -10 
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ttc 


att 


ate 


tgt 


ggg 


ttg 


eta 


act 


egg 


gtg 


acc 


aaa 


ggt 


age 


ttt 
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Leu 


Leu 


Thr 
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Lys 
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Ser 
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ccc 


caa 
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tgg 


aag 


aat 
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gta 


gga 


cat 


tgc 


aga 


aga 


ega 
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Cys 


Arg 


Asn 


Lys 


Leu 


Ser 






25 










30 










35 












tgc 


tgc 


att 


tct 


ata 


ata 


tea 


cat 


gaa 


tat 


act 


ega 
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Cys 
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296 


Pro 
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He 


His 


Leu 


Glu 


Asp 


He 


Thr 


Leu 


Asp 


Tyr 


Ser 


Asp 


Val 


Asp 
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ttt 
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gta 
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cct 
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Asp 
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Met 
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Pro 
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Pro 






90 
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440 


Glu 


Thr 
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Met 


Pro 
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Glu 


Ala 


Thr 


Thr 


Pro 


Glu 


Thr 


Thr 


Met 
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<210> 579 



wo 02/083898 

<211> 754 
<212> DNA 
<213> Homo sapiens 

<220> 
<221> CDS 
<222> 259. .753 

<220> 

<221> sig_peptide 
<222> 259.. 375 
<223> Von Heijne matrix 

score 5.80000019073486 
seq FCVCVIAIGVVQA/LI 

<400> 579 

aagctcccgg ccgggctgac tcaagcggag gcgcgcggaa cagtcgccga ggcgattccc 60 

gcccagcagt tcgacagaag tgtacagagg cttctggcaa cacggattgc cgtctacctg 120 

atgacctttc tcatcgtgac agtggcctgg gcagcacaca caaggttgtt ccaagttgtt 180 

gggaaaacag acgacacact tgccctgctc aacctggccg catcatggct gtgatgccct 240 



ccctccctcc aggcctgc 


atg 


atg 
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ate 


acc 


ttc 


ctg cct 


tac 


aeg 


ttt 
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Thr 
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Leu Pro 


Tyr 


Thr 
-30 


Phe 




teg 


tta 


ata 


Qta 


acc 


ttc 
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ggc ate 
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Met 
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Pro Asp Val 
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Leu 


Gly He 


Phe 


Leu 
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He Gly Val 
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<210> 580 
<211> 488 
<212> DNA 
<213> Homo sapiens 

<220> 
<221> CDS 
<222> 54. .488 



PCT/IBOl/00914 

313 



wo 02/083898 



PCT/IBOl/00914 



314 

<220> 

<221> sig_peptide 

<222> 54 . .119 

<223> Von Heijne matrix 

score 5.80000019073486 

seq PVTLTSLLPMVTX/GP 

<220> 

<221> misc_feature 
<222> 181,256,369 
<223> n=a, g, c or t 



<400> 580 

gtctttttaa tctctgcagt cccctgagtg tcacccaacc tgcaggcagt gtc atg 56 

Met 



ggc 


cgt 


too 


agt 


ctg 


act 


gca 


tct 
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Thr 
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cct 
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Gin 


Arg 


Glu 
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Pro 
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Val 


Thr 


Pro 
90 
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<210> 581 
<211> 343 
<212> DNA 

<213> Homo sapiens 

<220> 
<221> CDS 
<222> 17.. 343 

<220> 

<221> sig_peptide 

<222> 17. .106 

<223> Von Heijne matrix 

score 5.69999980926514 

seq LGVNLVVMPPTRA/RS 

<400> 581 

atagattaca aeggtg atg gcg ggc aag egg tec ggc tgg age egg gcg get 52 

Met Ala Gly Lys Arg Ser Gly Trp Ser Arg Ala Ala 

-30 -25 -20 
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<210> 582 
<211> 465 
<212> DNA 
<213> Homo sapiens 

<220> 
<221> CDS 
<222> 293. .463 

<220> 

<221> sig_peptide 

<222> 293.. 388 

<223> Von Heijne matrix 

score 5.59999990463257 

seq WMGLACFRSLAAS/SP 

<400> 582 

aaaagattga agaggaactc tttcaaacct ttcaagtgcc ccctcctttc cttaaagtct 60 

tttatagggg tccccttctt ggccatctcc atcctgtgag tcaggactga aagggcacag 120 

acaggtcact gecageattg ttggggeaag cetgeaagea cgcatcactg gggatctgac 180 

atgacaatgg eegectgccc cctctgaggg ctaeaggact taccccagtg ggaagcagct 240 

aagcaggtct gaccagccga cctggacctg gccaagggtc ctgtcatccc tc atg gcc 298 
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<210> 583 
<211> 539 
<212> DNA 
<213> Homo sapiens 



<220> 



wo 02/083898 PCT/IBO 1/009 14 

316 

<221> CDS 
<222> 302.. 538 

<220> 

<221> sig^eptide 

<222> 302.. 358 

<223> Von Heijne matrix 

score 5.40000009536743 

seq LCCSGCVPSLCCS/SY 

<400> 583 

aaaaacttcc gccgccgcgt ccgccgcctc cggaactaaa cggggtgagg tcacattcgg 60 

ttatctctaa cgttggaaaa cgatggagct aacacccatt atggagatta accacttttc 120 

atcaggtttt taacttaagt cgtgaggaat acaacggtga acacaagatt cattttattt 180 

tcatcaccat gggacgtatc ctgttgttga gtwctctggg tcagacctct gaagacttct 240 

cagatggatc ctagtctctg ggcttgcctg aaattactcg ctgctcaggg agagagttga 300 

a atg gtt ggc ate etc eca etc tgt tgc tec ggc tgt gtc ccc teg etc 349 

Met Val Gly lie Leu Pro Leu Cys Cys Ser Gly Cys Val Pro Ser Leu 
-15 -10 -5 
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<210> 584 
<211> 526 
<212> DNA 
<213> Homo sapiens 

<220> 
<221> CDS 
<222> 85.. 525 

<220> 

<221> sig_peptide 

<222> 85, .144 

<223> Von Heijne matrix 

score 5.19999980926514 

seq SPVFLVFPPEITA/SE 

<400> 584 

cctartcgaa aaaeaagaaa agaaeatggt ctagactgaa gtaeeaaeta aatcatetcc 60 
tttcaaatta teacegacae eate atg gat tea age ace gea eae agt ecg 111 

Met Asp Ser Ser Thr Ala His Ser Pro 
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ttt ctt 


agt 


gcc 


ctg 


gga 


gca 


ata 


get 


gga 


ate 


att 


etc 


etc 


aca 


ttt 


495 


Phe Leu 


Ser 


Ala 


Leu 


Gly Ala 


He 


Ala 


Gly 


He 


He 


Leu 


Leu 


Thr 


Phe 








105 










110 










115 








ggg ttt 


cat 


cct 


aga 


tea 


aaa 


eta 


cat 


ttg 


t 












526 


Gly Phe 


His 


Pro 


Arg 


Ser 


Lys 


Leu 


His 


Leu 

















120 125 



<210> 585 

<211> 482 
<212> DNA 

<213> Homo sapiens 

<220> 
<221> CDS 
<222> 195. .482 



<220> 

<221> sig_peptide 

<222> 195. .293 

<223> Von Heijne matrix 

score 5.09999990463257 

seq LYALVQLGQPCDC/LP 

<400> 585 

gcacagatga ccggaagcgt gccctggcgg cgctgccagc cccagctggg agctggtagg 60 

ggagtgcgag gaaggcggga cgcagggegg agccgctgca ggggcggggc ctgcggacag 120 

gagceggggt ttggggcggg aacccctcgt cccctgcaga ccccgcctgc tcgggcgcgg 180 



gcggcggcgc ggcc 


atg 


aag 


ctg 


aag 


ctg 


aag aac 


gtg 


ttt 


etc 


gcc 


tac 


230 










Met 


Lys 


Leu 


Lys 
-30 


Leu 


Lys Asn 


Val 


Phe 

-25 


Leu 


Ala 


Tyr 




ttc 


ctg 


gtg 


teg 


ate 


gcc 


ggc 


etc 


etc 


tac gcg 


ctg 


gta 


cag 


etc 


ggc 


278 


Phe 


Leu 
-20 


Val 


Ser 


He 


Ala 


Gly 
-15 


Leu 


Leu 


Tyr Ala 


Leu 
-10 


Val 


Gin 


Leu 


Gly 




cag 


cca 


tgt 


gae 


tgc 


ctt 


cct 


ccc 


ctg 


egg gca 


gca 


gcs 


gag 


cag 


eta 


326 


Gin 


Pro 


Cys 


Asp 


Cys 


Leu 


Pro 


Pro Leu Arg Ala 


Ala 


Ala 


Glu 


Gin 


Leu 




-5 










1 








5 








10 






egg 


cag 


aag 


gat 


ctg 


agg 


att 


tec 


cag 


ctg caa 


gcg 


gaa 


etc 


cga 


egg 


374 


Arg 


Gin 


Lys 


Asp 


Leu 


Arg 


He 


Ser 


Gin 


Leu Gin 


Ala 


Glu 


Leu Arg Arg 










15 










20 








25 








cca 


ccc 


cct 


gcc 


cct 


gcc 


cag 


ccc 


cct 


gaa ccc 


gag 


gcc 


ctg 


cct 


act 


422 


Pro 


Pro 


Pro 
30 


Ala 


Pro 


Ala 


Gin 


Pro 
35 


Pro 


Glu Pro 


Glu 


Ala 
40 


Leu 


Pro 


Thr 




ate 


tat 


gtt 


gtt 


acc 


ccc 


acc 


tat 


gcc 


agg ctg 


gta 


cag 


aag 


gca 


gag 


470 


He 


Tyr, 


Val 


Val 


Thr 


Pro 


Thr Tyr Ala Arg Leu 


Val 


Gin 


Lys 


Ala 


Glu 






45 










50 








55 












ctg 


gta 


cga 


ctg 
























482 


Leu 


Val 


Arg 


Leu 



























60 



<210> 586 
<211> 543 
<212> DNA 



wo 02/083898 
<213> Homo sapiens 



318 



PCT/IBOl/00914 



<220> 
<221> CDS 
<222> 35.. 541 

<220> 

<221> sig_peptide 
<222> 35. .82 
<223> Von Heijne matrix 
score 5 

seq CGLFLRTTAAARA/CR 
<400> 586 

agactgaggc tgaggctggg gaacatcggg cage atg age gge tgc ggg etc ttc 55 

Met Ser Gly Cys Gly Leu Phe 
-15 -10 



ctg 


cgc 


acc 


acg 


get 


gcg 


get cgt 


gcc tgc 


egg 


ggt 


ctg 


gtg 


gtc 


tct 


103 


Leu Arg 


Thr 


Thr 


Ala Ala Ala Arg Ala Cys Arg Gly Leu Val 


Val 


Ser 




acc 


gcg 


aac 


egg 


egg 


eta 


ctg cgc 


acc age 


ccg 


cet 


gta 


ega 


get 


ttc 


151 


Thr 


Ala 


Asn 


Arg 


Arg Leu 


Leu Arg 


Thr Ser 


Pro 


Pro 


Val 


Arg 


Ala 


Phe 








10 








15 








20 










gcc 


aaa 


gag 


ett 


ttc 


eta 


gge aaa 


ate aag 


aag 


aaa 


gaa 


gtt. 


ttc 


cca 


199 


Ala 


Lys 


Glu 


Leu 


Phe 


Leu 


Gly Lys 


He Lys 


Lys 


Lys 


Glu 


Val 


Phe 


Pro 






.25 










30 






35 












ttt 


cca 


gaa 


gtt 


age 


caa 


gat gaa 


ett aat 


gaa 


ate 


aat 


cag 


ttc 


ttg 


247 


Phe 


Pro 


Glu 


Val 


Ser Gin Asp Glu 


Leu Asn 


Glu 


He 


Asn 


Gin 


Phe 


Leu 




40 










45 






50 










55 




gga 


ccc 


gtg 


gaa 


aaa 


ttc 


ttc act 


gaa gag 


gtg 


gac 


tee 


ega 


aaa 


att 


295 


Gly Pro 


Val 


Glu 


Lys 


Phe 


Phe Thr Glu Glu Val Asp Ser Arg 


Lys 


He 












60 






65 










70 






gac 


cag 


gaa 


ggg 


aaa 


ate 


cca gat 


gaa act 


ttg 


gag 


aaa 


ttg 


aag 


age 


343 


Asp Gin 


Glu Gly 


Lys 


He 


Pro Asp Glu Thr Leu Glu Lys 


Leu 


Lys 


Ser 










75 








80 








85 








eta 


ggg 


ett 


ttt 


ggg 


ctg 


caa gtc 


cca gaa 


gaa 


tat 


ggt 


gge 


ctg 


gge 


391 


Leu Gly 


Leu 


Phe 


Gly 


Leu 


Gin Val 


Pro Glu 


Glu 


Tyr 


Gly 


Gly 


Leu 


Gly 








90 








95 








100 










ttc 


tec 


aac 


acc 


atg 


tac 


tea ega 


eta ggg 


gag 


ate 


ate 


age 


atg 


gat 


439 


Phe 


Ser 


Asn 


Thr 


Met 


Tyr 


Ser Arg 


Leu Gly Glu 


He 


He 


Ser 


Met 


Asp 






105 










110 






115 












ggg 


tec 


ate 


act 


gtg 


acc 


ctg gca 


gcg cac 


cag 


get 


att 


gge 


tea 


agg 


487 


Gly 


Ser 


He 


Thr 


Val 


Thr 


Leu Ala 


Ala His 


Gin 


Ala 


He Gly 


Ser 


Arg 




120 










125 






130 










135 




gga 


tea 


tct 


tgg 


ctg 


gea 


ctg agg 


age aga 


aag 


cca 


aat 


aac 


ttg 


cet 


535 


Gly Ser 


Ser 


Trp 


Leu 


Ala 


Leu Arg 


Ser Arg 


Lys 


Pro 


Asn 


Asn 


Leu 


Pro 












140 






145 










150 






aaa 


ctg 


gc 
























543 



Lys Leu 

<210> 587 
<211> 473 
<212> DNA 
<213> Homo sapiens 

<220> 
<221> CDS 
<222> 64. .471 

<220> 

<221> sig_peptide 

<222> 64.. 171 

<223> Von Heijne matrix 



wo 02/083898 PCT/IBO 1/0091 4 

319 

score 4.80000019073486 
seq LLLLRLNDAALRA/LQ 

<400> 587 

aggaggtggc ggtggtggcc ctcgcctgtg gcccccgtgc tgcttgcact cgaactcgtc 60 



gcc 


atg 
Met 


gag 
Glu 
-35 


gag 
Glu 


etc 
Leu 


cag 
Gin 


gag 
Glu 


cct 
Pro 
-30 


ctg 
Leu 


aga gga gag 
Arg Gly Glu 


etc egg 
Leu Arg 
-25 


etc 
Leu 


tgc 
Cys 


108 


ttc 
Phe 


acg 
Thr 
-20 


caa 
Gin 


get 
Ala 


gcc 
Ala 


egg 
Arg 


act 
Thr 
-15 


age 
Ser 


etc 
Leu 


tta 
Leu 


ctg 
Leu 


etc 
Leu 
-10 


agg 
Arg 


etc 
Leu 


aac gac 
Asn Asp 


156 


got 
Ala 
-5 


gcc 
Ala 


ctg 
Leu 


egg 
Arg 


gcg 
Ala 


ctg 
Leu 

1 


caa 
Gin 


gag tgt 
Glu Cys 


cag 
Gin 
5 


egg 
Arg 


caa 
Gin 


cag 
Gin 


gta 
Val 


egg 
Arg 
10 


ccg 
Pro 


204 


gtg 
Val 


att 
He 


get 
Ala 


ttc 
Phe 
15 


caa 
Gin 


ggc cac 
Gly His 


ega 
Arg 


ggg 

Gly 
20 


tat 
Tyr 


ctg aga 
Leu Arg 


etc 
Leu 


cea 
Pro 
25 


ggc 
Gly 


cct 
Pro 


252 


ggt 
Gly 


tgg 
Trp 


tec 
Ser 
30 


tgc 
Cys 


etc 
Leu 


ttc 
Phe 


tec 
Ser 


ttc 
Phe 

35 


ata 
He 


gtg 
Val 


tec 
Ser 


cag 
Gin 


tgt 

Cys 
40 


tgt 
Cys 


cag 
Gin 


gag 
Glu 


300 


ggc 
Gly 


get 
Ala 
45 


ggt 
Gly 


ggt 
Gly 


age 
Ser 


ttg gac 
Leu Asp 
50 


ctt 
Leu 


gtg 
Val 


tgc 
Cys 


caa 
Gin 


cgc 
Arg 
55 


ttc 
Phe 


etc agg 
Leu Arg 


tet 
Ser 


348 


ggg 

Gly 
60 


cct 
Pro 


aac 
Asn 


age 
Ser 


etc 
Leu 


cac 
His 
65 


tgc 
Cys 


ctg 
Leu 


ggc 
Gly 


tea 
Ser 


etc agg 
Leu Arg 
70 


gag 
Glu 


cgc 
Arg 


etc 
Leu 


att 
He 
75 


396 


att 

He 


tgg 
Trp 


gca 
Ala 


gee 
Ala 


atg 
Met 
80 


gat 
Asp 


tet 
Ser 


ate 
He 


eca 
Pro 


gcc 
Ala 
85 


cea 
Pro 


tea 
Ser 


tea 
Ser 


gtt 
Val 


cag gga 
Gin Gly 
90 


444 


cac 
His 


aay 
Asn 


ctg 
Leu 


act 
Thr 
95 


gaa gat gee aga 
Glu Asp Ala Arg 


cat 
His 
100 


cc 














473 



<210> 588 
<211> 509 
<212> DNA 
<213> Homo sapiens 

<220> 
<221> CDS 
<222> 183.. 509 

<220> 

<221> sig_peptide 

<222> 183. .251 

<223> Von Heijne matrix 

score 4.59999990463257 

seq GHSLVGLALLCIA/AN 

<400> 588 

aggaceaatg aaagcaaagt aecteatccc ctcagtgact aagaatcgca gtatttaaga 60 

ggtagcagga atgggetgag agtggtgttt gctttctcca ccagaagggc acactttcat 120 

ctaatttggg gtatcaetga gctgaagaca aagagaaggg ggagaaaacc tagcagacca 180 
cc atg tgc tat ggg aag tgt gca cga tgc ate gga cat tet ctg gtg 227 
Met Cys Tyr Gly Lys Cys Ala Arg Cys He Gly His Ser Leu Val 
-20 -15 -10 

ggg etc gee etc ctg tgc ate gcg get aat att ttg ctt tac ttt cec 275 

Gly Leu Ala Leu Leu Cys He Ala Ala Asn He Leu Leu Tyr Phe Pro 

-5 1 5 

aat ggg gaa aca aag tat gcc tec gaa aac cac etc age cgc ttc gtg 323 

Asn Gly Glu Thr Lys Tyr Ala Ser Glu Asn His Leu Ser Arg Phe Val 

10 15 20 

tgg ttc ttt tet ggc ate gta gga ggt ggc ctg ctg atg etc ctg cea 371 



wo 02/083898 



PCT/IBOl/00914 



320 



Trp 


Phe 


Phe 


Ser Gly lie Val 


Gly Gly Gly Leu 


Leu 


Met 


Leu 


Leu 


Pro 




25 






30 


35 










40 




gca 


ttt 


gtc 


ttc att ggg ctg 


gaa cag gat gae 


tgc 


tgt 


ggc 


tgc 


tgt 


419 


Ala 


Phe 


Val 


Phe lie Gly Leu 


Glu Gin Asp Asp 


Cys 


Cys 


Gly Cys Cys 










45 


50 








55 






ggc 


cat 


gaa 


aac tgt ggc aaa 


cga tgt gcg atg 


ctt 


tct 


tct 


gta 


ttg 


4 67 


Gly 


His 


Glu 


Asn Cys Gly Lys 


Arg Cys Ala Met 


Leu 


Ser 


Ser 


Val 


Leu 










60 


65 






70 








get 


get 


etc 


att gga att gca 


gga yet ggc tac 


tgt 


gtc 


att 






509 


Ala 


Ala 


Leu 


lie Gly He Ala 


Gly Xaa Gly Tyr 


Cys 


Val 


He 












75 




80 




85 











<210> 589 
<211> 523 
<212> DNA 

<213> Homo sapiens 

<220> 
<221> CDS 
<222> 302.. 523 



<220> 

<221> sig_peptide 

<222> 302.. 352 

<223> Von Heijne matrix 

score 4.59999990463257 
seq LLPLCQRIHCIIG/RF 



<400> 589 

agatgagtgt tcagctctca gcagagaggt tagctcctct ctgcagcttg tcctgttgtc 60 

tcctcaagtc tggctgagtc cggagttttt atgagcctca gaggggagga agtgcatgct 120 

gattaatcca tgggcaggcc tggaaaagtt cccactccag tctgcgggac ccacagcctg 180 

gccctcaggc tcaggcttcc cwggcttgaa gattgggctt cacctgggac ctaccccttc 240 

tgcctaggag catgtctgcc tcctgctgcc tttcatggtg cccagccaag gctaagtcga 300 
a atg tgg ccc aac ctt ctt ccc ctg tgc cag egg ate cat tgc ate att 34 9 
Met Trp Pro Asn Leu Leu Pro Leu Cys Gin Arg He His Cys He He 
-15 -10 -5 

ggt cgc ttc egg tgc aat ggg ttt gag gac tgt ccc gat ggc age gat 397 

Gly Arg Phe Arg Cys Asn Gly Phe Glu Asp Cys Pro Asp Gly Ser Asp 
15 10 15. 

gaa gag aac tgc aca gca aac cet ctg ctt tgc tec acc gee cgc tac 445 

Glu Glu Asn Cys Thr Ala Asn Pro Leu Leu Cys Ser Thr Ala Arg Tyr 

20 25 30 

cae tgc aag aac ggc etc tgt att gac aag age ttc ate tgc gat gga 4 93 

His Cys Lys Asn Gly Leu Cys He Asp Lys Ser Phe He Cys Asp Gly 

35 40 45 

cag aat aac tgt caa gac aac agt gat gag 523 
Gin Asn Asn Cys Gin Asp Asn Ser Asp Glu 
50 55 



<210> 590 
<211> 513 
<212> DNA 
<213> Homo sapiens 

<220> 
<221> CDS 
<222> 103. .513 



<220> 

<221> sig_peptide 

<222> 103.. 243 

<223> Von Heijne matrix 



wo 02/083898 



321 



PCT/IBOl/00914 



score 4.40000009536743 
seq MATMLQLVPLLKA/SI 

<400> 590 

aatcattaca gaaggatttg gcctctaagg gcctctggaa ttctttggaa gaaggcattg 60 
gctgtgtagc tgacactcta ctcactgtgg acatggacgt cc atg acc ctg cag 114 

Met Thr Leu Gin 
-45 

tgg cac ttg tat gag atg gca cgc aac ctg aag gtg cag gat atg ctg 162 
Trp His Leu Tyr Glu Met Ala Arg Asn Leu Lys Val Gin Asp Met Leu 

-40 -35 -30 

egg gca gag gtc ttg get gcg egg cac cag gcc cag gga gae atg gcc 210 
Arg Ala Glu Val Leu Ala Ala Arg His Gin Ala Gin Gly Asp Met Ala 

-25 -20 -15 

acg atg eta cag ctg gtc ccc etc etc aaa gcc age ate aag gag aca 258 
Thr Met Leu Gin Leu Val Pro Leu Leu Lys Ala Ser lie Lys Glu Thr 

-10 -5 15 

eta aga ctt cac ccc ate tec gtg acc ctg cag aga tat ctt gta aat 306 
Leu Arg Leu His Pro He Ser Val Thr Leu Gin Arg Tyr Leu Val Asn 

10 15 20 

gae ttg gtt ctt cga gat tac atg att eet gcc aag aca ctg gtg caa 354 
Asp Leu Val Leu Arg Asp Tyr Met He Pro Ala Lys Thr Leu Val Gin 

25 30 35 

gtg gcc ate tat get ctg ggc cga gag ccc acc ttc ttc ttc gae ccg 402 
Val Ala He Tyr Ala Leu Gly Arg Glu Pro Thr Phe Phe Phe Asp Pro 

40 45 50 

gaa aat ttt gae cea acc cga tgg ctg age aaa gae aag aac ate ace 450 
Glu Asn Phe Asp Pro Thr Arg Trp Leu Ser Lys Asp Lys Asn He Thr 

55 60 65 

tac ttc egg aac ttg ggc ttt ggc tgg ggt gtg egg cag tgt ctg gga 498 
Tyr Phe Arg Asn Leu Gly Phe Gly Trp Gly Val Arg Gin Cys Leu Gly 
70 75 80 85 

egg csg ate get gag 513 
Arg Xaa He Ala Glu 
90 

<210> 591 
<211> 490 
<212> DNA 
<213> Homo sapiens 

<220> 
<221> CDS 
<222> 196.. 489 

<220> 

<221> sig_peptide 

<222> 196.. 255 

<223> Von Heijne matrix 

score 4.30000019073486 

seq LLDSSLMASGTAS/RS 

<400> 591 

agacgtacet caeggaagee ggctttggcc ctgeggctgc taccgtcgcc geggagaaat 60 
tgttggatct ggcagtctag gaatgaatct cetctcagee tttaagctca cetggtcaga 120 
atecttggat gagcctgtgg gacegttcct cetagecegg tggtttggaa ccagtggctt 180 
tgggactgta agagg atg gae aaa gat tct cag ggg ctg eta gat tea tec 231 
Met Asp Lys Asp Ser Gin Gly Leu Leu Asp Ser Ser 
-20 -15 -10 

ctg atg gca tea ggc act gee age cgc tea gag gat gag gag tea ctg 279 
Leu Met Ala Ser Gly Thr Ala Ser Arg Ser Glu Asp Glu Glu Ser Leu 

-5 15 
gca ggg cag aag cga gcc tec tec cag gcc ttg ggc acc ate eet aaa 327 



wo 02/083898 



Ala Gly 


Gin 


Lys 


Arg 


Ala 


10 










egg aga 


age 


tee 


tec 


agg 


Arg Arg 


Ser 


Ser 


Ser 


Arg 


25 








30 


ctg gtg 


gag 


age 


age 


ctg 


Leu Val 


Glu 


Ser 


Ser 

45 


Leu 


agt ggg 


gtg 


gaa 


cag 


gge 


Ser Gly 


Val 


Glu 


Gin 


Gly 






60 






aga aga 


aga 


agg 


tat 


caa 


Arg Arg 


Arg 


Arg 


Tyr 


Gin 




75 









322 



Ser 


Ser 


Gin 


Ala Leu Gly 


15 






20 


tte 


ate 


aag 


agg aag aag 


Phe 


He 


Lys 


Arg Lys Lys 








35 


gca 


aaa 


tct 


tct acc egg 


Ala 


Lys 


Ser 


Ser Thr Arg 








50 


get 


gtt 


egg 


gta gtg aac 


Ala 


Val Arg 


Val Val Asn 



65 



PCT/IBOl/00914 

Thr He Pro Lys 

ttc gat gat gag 375 
Phe Asp Asp Glu 
40 

gca aag ggg gcc 423 
Ala Lys Gly Ala 
55 

cet cet cca gtg 471 
Pro Pro Pro Val 
70 

490 



<210> 592 
<211> 537 
<212> DNA 
<213> Homo sapiens 

<220> 
<221> CDS 
<222> 197. .535 



<220> 

<221> sig_peptide 

<222> 197. .304 

<223> Von Heijne matrix 

score 3.79999995231628 
seq WLGCIVFPFSALL/QQ 



<400> 592 

cetttttaaa caatatccat tctcttgatg tctacaatat cacctttctt atagattcgc 60 
atatacatgg ecaaaggaac aactccatgt tttctaaaag gcctagagaa catacatcag 120 
ctcaccagga eatgattata gcttcteaag cttatctaaa ataaaagata acatatatat 180 
eaacattttt gatgaa atg atg act gaa aaa eat gag gat cac tgt etc aag 232 
Met Met Thr Glu Lys His Glu Asp His Cys Leu Lys 
-35 -30 -25 

age tgt agt ggt cac tea tat ata aga aag aat tgg ctt gga tgc att 280 
Ser Cys Ser Gly His Ser Tyr He Arg Lys Asn Trp Leu Gly Cys He 

-20 -15 -10 

gtc ttc cet ttt tct get ctt ctg caa caa tct gag att agt gga aca 328 
Val Phe Pro Phe Ser Ala Leu Leu Gin Gin Ser Glu He Ser Gly Thr 

-5 15 
ttt caa gta act att cca cca gtt ttg etc ggg tat act tgg agt aat 376 
Phe Gin Val Thr He Pro Pro Val Leu Leu Gly Tyr Thr Trp Ser Asn 

10 15 20 

act tat gta ttt cet aaa gaa gat tec aat gag cag aat tta aaa gaa 424 
Thr Tyr Val Phe Pro Lys Glu Asp Ser Asn Glu Gin Asn Leu Lys Glu 
25 30 35 40 

tgt aca ttc tta aat att ttt get acc att gaa cet caa ata tea tat 472 
Cys Thr Phe Leu Asn He Phe Ala Thr He Glu Pro Gin He Ser Tyr 

45 50 55 

gtc acc tgt aat cca aca eta gat aag ttt tta gat caa aca gag gtc 520 
Val Thr Cys Asn Pro Thr Leu Asp Lys Phe Leu Asp Gin Thr Glu Val 

60 65 70 

ttg cag aga gca cag at 537 
Leu Gin Arg Ala Gin 
75 



<210> 
<211> 
<212> 



593 
488 
DNA 



wo 02/083898 



PCT/IBOl/00914 



323 

<213> Homo sapiens 

<220> 
<221> CDS 
<222> 79. .486 



<220> 

<221> sig_peptide 

<222> 79. .201 

<223> Von Heijne matrix 

score 3.59999990463257 
seq LHTSVTLFLLSYC/DC 



•<400> 593 

agatgattcc ctgattctcc agagagatta cacacttcgt ttgtggctaa ggttactgtg 60 

acccaatgaa agaagaaa atg aaa gcc ata aag aaa agt ctt aca gaa gaa 111 

Met Lys Ala lie Lys Lys Ser Leu Thr Glu Glu 

-40 -35 

gaa tac ctg tac ctg gac ttt tct cac caa aca gaa gga tgc ate ttt 159 

Glu Tyr Leu Tyr Leu Asp Phe Ser His Gin Thr Glu Gly Cys He Phe 
-30 -25 -20 -15 

cot ctt cat aca tct gta act tta ttt ctg tta tct tac tgt gac tgt 207 

Pro Leu His Thr Ser Val Thr Leu Phe Leu Leu Ser Tyr Cys Asp Cys 

-10 -5 1 

aaa ate ttt aaa att tgc tta gtt gtc acc aaa gag gtg agt aga gat 255 

Lys He Phe Lys He Cys Leu Val Val Thr Lys Glu Val Ser Arg Asp 

5 10 15 

agt tea eta eta aga gat gac ctg ate eag gat gtt gaa ata cag att 303 

Ser Ser Leu Leu Arg Asp Asp Leu He Gin Asp Val Glu He Gin He 

20 25 30 

att tea agg cag gag etc cca cca ata gtc caa aat tgc tgt ttg cct 351 

He Ser Arg Gin Glu Leu Pro Pro He Val Gin Asn Cys Cys Leu Pro 
35 40 45 50 

gca gta gta gaa ega tea gac aat ttt tgt aga gea gga ett get gtt 399 

Ala Val Val Glu Arg Ser Asp Asn Phe Cys Arg Ala Gly Leu Ala Val 

55 60 65 

gta ttg aga cac ata ate cag aaa tec tat gaa gca gac cec tta aag 4 47 

Val Leu Arg His He He Gin Lys Ser Tyr Glu Ala Asp Pro Leu Lys 

70 75 80 

aag gaa ctt ttg gaa ett etg gge ttw aaa aga ett get tg 488 

Lys Glu Leu Leu Glu Leu Leu Gly Xaa Lys Arg Leu Ala 
85 90 95 



<210> 594 
<211> 490 
<212> DNA 

<213> Homo sapiens 

<220> 
<221> CDS 
<222> 80. .490 



<220> 

<221> sig_peptide 

<222> 80. .127 

<223> Von Heijne matrix 

score 3.59999990463257 
seq FSLLSISGPPISS/SA 



<400> 594 

gaatgtttat cctctggaca aaceageeag ecteteeaga geaggegtgt gatctetgta 60 
eeecegeagt ggteagaat atg gag aac tte tea etc etc age ate tct gga 112 

Met Glu Asn Phe Ser Leu Leu Ser He Ser Gly 
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-15 -10 

cct cca ate tct tec tec gee ctg agt get ttt ecc gac att atg tte 160 

Pro Pro He Ser Ser Ser Ala Leu Ser Ala Phe Pro Asp He Met Phe 
-5 15 10 

tct cgt gee ace age ctg cca gac att gea aag aca gea gta ecc act 208 

Ser Arg Ala Thr Ser Leu Pro Asp He Ala Lys Thr Ala Val Pro Thr 

15 20 25 

gag gea tee age cca get eag gee ctg cca ecc eag tac caa age ate 256 

Glu Ala Ser Ser Pro Ala Gin Ala Leu Pro Pro Gin Tyr Gin Ser lie 
30 35 40 



att 


gtc 


agg 


caa 


ggg 


ata 


cag aac 


aca 


gtg 


etc 


tea 


cca 


gre 


wgc 


age 


304 


He 


Val 


Arg 


Gin 


Gly 


He 


Gin Asn 


Thr 


Val 


Leu 


Ser 


Pro 


Xaa 


Xaa 


Ser 






45 










50 








55 












ttg 


ggg 


gac 


acc 


cag 


cac 


gga gag 


aag 


ctg 


agg 


egg 


aac 


tgc 


act 


ate 


352 


Leu 


Gly 


Asp 


Thr 


Gin 


His Gly Glu 


Lys 


Leu 


Arg 


Arg 


Asn 


Cys 


Thr 


He 




60 










65 








70 










75 




tac 


egg 


ecc 


tgg 


tte 


tec 


ecc tac 


age 


tac 


tte 


gtg 


tgt 


gea 


gac 


aaa 


400 


Tyr Arg 


Pro 


Trp 


Phe 


Ser 


Pro Tyr 


Ser 


Tyr 


Phe 


Val 


Cys 


Ala 


Asp 


Lys 












80 








85 










90 






gag 


age 


eag 


ctg 


gag 


gcc 


tat gac 


tte 


cca 


gag 


gtg 


cag 


cag 


gat 


gag 


448 


Glu 


Ser 


Gin 


Leu 


Glu 


Ala 


Tyr Asp 


Phe 


Pro 


Glu 


Val 


Gin 


Gin 


Asp 


Glu 










95 








100 










105 








ggc 


aag 


tgg 


gac 


aac 


tgc 


ctt tct 


gag 


gac 


atg 


get 


gag 


aac 






490 


Gly 


Lys 


Trp 


Asp 


Asn 


Cys 


Leu Ser 


Glu 


Asp 


Met 


Ala 


Glu 


Asn 









110 115 120 

<210> 595 
<211> 495 
<212> DNA 

<213> Homo sapiens 

<220> 
<221> CDS 
<222> 157.. 495 

<400> 595 

attetttcga caggagttga aaccagcacc ttccctttct ctgagtcctg cctcettctg 60 
cagaagggag eteaaaagaa etttgttgtt ttgcetttta ctctggggtg aaagcggcag 120 
gaggtatgtg agatggtgaa atgatttget tetgee atg ctg ggg tea egg gtg 174 

Met Leu Gly Ser Arg Val 
1 5 

gat cgc cct aaa etc teg gtg gcc ecc tea gta gtt ttg gaa gag gac 222 
Asp Arg Pro Lys Leu Ser Val Ala Pro Ser Val Val Leu Glu Glu Asp 

10 15 20 

caa gtc ctt gtc tct cca gea gtg gac ctg gaa gma gga tgc egg etc 270 
Gin Val Leu Val Ser Pro Ala Val Asp Leu Glu Xaa Gly Cys Arg Leu 

25 30 35 

agg gac tte act gag aaa ata atg aat gtc aaa gga aaa gta att ctg 318 
Arg Asp Phe Thr Glu Lys He Met Asn Val Lys Gly Lys Val He Leu 

40 45 50 

tea atg ctg gtt gtc tea act gtg ate att gtg ttt tgg gaa ttt ate 366 
Ser Met Leu Val Val Ser Thr Val He He Val Phe Trp Glu Phe He 
55 60 65 70 

aac age aca gaa ggc tct tte ttg tgg ata tat cac tea aaa aac cca 414 
Asn Ser Thr Glu Gly Ser Phe Leu Trp He Tyr His Ser Lys Asn Pro 

75 80 85 

gaa gtt gat gac age agt get cag aag ggc tgg tgg ttt ctg age tgg 462 
Glu Val Asp Asp Ser Ser Ala Gin Lys Gly Trp Trp Phe Leu Ser Trp 

90 95 100 

ttt aac aat ggg ate cac aat tat caa caa ggg 495 
Phe Asn Asn Gly He His Asn Tyr Gin Gin Gly 
■ 105 110 
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<210> 596 
<211> 939 
<212> DNA 

<213> Homo sapiens 

<220> 
<221> CDS 
<222> 416. .937 



<400> 596 

aaaaggagta gctattagcc aattcggcag ggcccgcttt ttagaagctt gatttcyttt 60 
gaagatgaaa gactagcgga agctctgcct ctttccccag tggcgaggga actcggggcg 120 
attggctggg aactgtatcc acccaaatgt caccgatttc ttcctatgca ggaaatgagc 180 
agacccatca ataagaaatt tctcagcctg gccgaaaatg gttggcccca cgaagccacg 240 
acaactggag gcaaagaggg ttgctcaacg ccccgcctca ttggaaaacc aaatcagatc 300 
tgggacctat atagcgtggc ggaggcgggg cgatgattgt cgcgctcgca cccactgcag 360 
ctgcgcacag tcgcatttct ttccccgccc ctgagaccct gcagcaccat ctgtc atg 418 

Met 
1 

gcg get ggg ctg ttt ggt ttg age get cge cgt ctt ttg gcg gca gcg 466 
Ala Ala Gly Leu Phe Gly Leu Ser Ala Arg Arg Leu Leu Ala Ala Ala 

5 10 15 

gcg acg cga ggg etc eeg gee gee cge gte cge tgg gaa tet age ttc 514 
Ala Thr Arg Gly Leu Pro Ala Ala Arg Val Arg Trp Glu Ser Ser Phe 

20 25 30 

tec agg act gtg gte gee eeg tec get gtg gcg gga aag egg cec cca 562 
Ser Arg Thr Val Val Ala Pro Ser Ala Val Ala Gly Lys Arg Pro Pro 

35 40 45 

gaa ccg acc aca eeg tgg caa gag gac cca gaa ecc gag gac gaa aac 610 
Glu Pro Thr Thr Pro Trp Gin Glu Asp Pro Glu Pro Glu Asp Glu Asn 
50 55 60 65 

ttg tat gag aag aae cca gae tee eat ggt tat gae aag gae ecc gtt 658 
Leu Tyr Glu Lys Asn Pro Asp Ser His Gly Tyr Asp Lys Asp Pro Val 

70 75 80 

ttg gac gte tgg aae atg cga ctt gte ttc ttc ttt ggc gte tec ate 706 
Leu Asp Val Trp Asn Met Arg Leu Val Phe Phe Phe Gly Val Ser lie 

85 90 95 

ate ctg gte ctt ggc age acc ttt gtg gcc tat ctg ect gac tac agg 754 
lie Leu Val Leu Gly Ser Thr Phe Val Ala Tyr Leu Pro Asp Tyr Arg 

100 105 110 

atg aaa gag tgg tee cge cge gaa get gag agg ctt gtg aaa tac cga 802 
Met Lys Glu Trp Ser Arg Arg Glu Ala Glu Arg Leu Val Lys Tyr Arg 

115 120 125 

gag gcc aat ggc ttc cca tea tgg aat cca act get teg acc cca gca 850 
Glu Ala Asn Gly Phe Pro Ser Trp Asn Pro Thr Ala Ser Thr Pro Ala 
130 135 140 145 

aga tee age tge eag agg atg agt gae eag ttg eta agt ggg get caa 898 
Arg Ser Ser Cys Gin Arg Met Ser Asp Gin Leu Leu Ser Gly Ala Gin 

150 155 160 

gaa gca ccg ect tec cca ecc cet gcc tge cat tet gae et 939 
Glu Ala Pro Pro Ser Pro Pro Pro Ala Cys His Ser Asp 
165 170 



<210> 597 
<211> 528 
<212> DNA 

<213> Homo sapiens 



<220> 
<221> CDS 
<222> 51.. 527 



<400> 597 
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aagctggggg agctcgtcgc cgccgccggc ggctagcggg cgtccgcgcc atg gag 56 

Met Glu 
1 



cgc 


tac 


gcg 


gee 


gcc 


ttg 


gag 


gag 


gtg 


gcg 


gac 


ggt 


gcc 


egg 


cag 


cag 


104 


Arg 


Tyr 


Ala 

5 


Ala 


Ala 


Leu 


Glu 


Glu 

10 


Val 


Ala 


Asp 


Gly 


Ala 
15 


Arg 


Gin 


Gin 




gag 


cga 


cae 


tac 


cag 


ttg 


ctg 


teg 


gcg 


eta 


cag 


age 


ctg 


gtg 


aag 


gag 


152 


Glu 


Arg 
20 


His 


Tyr 


Gin 


Leu 


Leu 
25 


Ser 


Ala 


Leu 


Gin 


Ser 

30 


Leu 


Val 


Lys 


Glu 




ttg 


ecc 


age 


tet 


ttc 


cag 


cag 


cgc 


ctg 


tee 


tac 


acc 


acg 


etc 


age 


gac 


200 


Leu 


Pro 


Ser 


Ser 


Phe 


Gin 


Gin Arg 


Leu 


Ser 


Tyr 


Thr 


Thr 


Leu 


Ser 


Asp 




35 










40 










45 










50 




ctg 


gee 


ctg 


gcg 


ctt 


etc 


gac 


ggc 


act 


gtg 


ttc 


gaa 


ate 


gtg 


cag 


ggg 


248 


Leu 


Ala 


Leu 


Ala 


Leu 


Leu 


Asp Gly 


Thr 


Val 


Phe 


Glu 


He 


Val 


Gin 


Gly 












55 










60 










65 






eta 


etg 


gag 


ate 


cag 


cac 


etc 


ace 


gaa 


aag 


age 


ctg 


tac 


aac 


cag 


cgc 


296 


Leu 


Leu 


Glu 


He 


Gin 


His 


Leu 


Thr 


Glu 


Lys 


Ser 


Leu 


Tyr 


Asn 


Gin 


Arg 










70 










75 








80 








ctg 


cgc 


eta 


cag 


aac 


gag 


cac 


cga 


gtg 


etc 


agg 


cag 


gcg 


ctg 


egg 


cag 


344 


Leu Arg 


Leu 


Gin 


Asn 


Glu 


His 


Arg 


Val 


Leu 


Arg 


Gin 


Ala 


Leu Arg 


Gin 








85 










90 










95 










aag 


cac 


cag 


gaa 


gcc 


cag 


cag 


gcc 


tgc 


egg 


ecc 


cac 


aac 


ctg 


cct 


gtg 


392 


Lys 


His 
100 


Gin 


Glu 


Ala 


Gin 


Gin 
105 


Ala 


Cys 


Arg 


Pro 


His 
110 


Asn 


Leu 


Pro 


Val 




gtt 


cag 


gcg 


get 


cag 


cag 


cga 


gaa 


eta 


gag 


gcc 


gtg 


gaa 


cac 


egg 


ate 


440 


Val 


Gin 


Ala 


Ala 


Gin 


Gin 


Arg 


Glu 


Leu 


Glu 


Ala 


Val 


Glu 


His 


Arg 


He 




115 










120 










125 










130 




cgt 


gag 


gag 


cag 


egg 


gcg 


atg 


gac 


cat 


aag 


ate 


ate 


ctg 


gag 


etg 


gac 


488 


Arg 


Glu 


Glu 


Gin 


Arg Ala 


Met 


Asp 


His 


Lys 


He 


He 


Leu 


Glu 


Leu 


Asp 












135 










140 










145 






egg 


aag 


gtg 


get 


gac 


cag 


cag 


age 


aca 


ctg 


gag 


aag 


gcg 


g 






528 


Arg 


Lys 


Val 


Ala 
150 


Asp 


Gin 


Gin 


Ser 


Thr 
155 


Leu 


Glu 


Lys 


Ala 











<210> 598 
<211> 556 
<212> DNA 
<213> Homo sapiens 

<220> 
<221> CDS 
<222> 128. .556 

<400> 598 

cttctttgge cctgtgacac gtagcaacgg ggctggttca gggtctgaaa cagagtttgg 60 
gggttgtttg ggattagtga agctactgcc tttgccgcea gcgcasctca gagtttgatt 120 



atttgca 


atg 


tea 


ggc 


ttt 


gaa 


aac 


tta 


aac 


acg 


gat 


ttc 


tac 


cag 


aca 


169 




Met 


Ser 


Gly 


Phe 


Glu 


Asn 


Leu 


Asn 


Thr 


Asp 


Phe 


Tyr 


Gin 


Thr 






1 








5 










10 












agt tac 


age 


ate 


gat 


gat 


cag 


tea 


cag 


cag 


tec 


tat 


gat 


tat 


gga 


gga 


217 


Ser Tyr 


Ser 


He 


Asp 


Asp 


Gin 


Ser 


Gin 


Gin 


Ser 


Tyr 


Asp 


Tyr 


Gly 


Gly 




15 








20 










25 










30 




agt gga 


gga 


ecc 


tat 


age 


aaa 


cag 


tat 


get 


ggc 


tat 


gac 


tat 


teg 


cag 


265 


Ser Gly 


Gly 


Pro 


Tyr 


Ser 


Lys 


Gin 


Tyr 


Ala 


Gly 


Tyr 


Asp 


Tyr 


Ser 


Gin 










35 










40 










45 






caa ggc 


aga 


ttt 


gtc 


cct 


eca 


gac 


atg 


atg 


cag 


eca 


caa 


cag 


eca 


tac 


313 


Gin Gly 


Arg 


Phe 


Val 


Pro 


Pro 


Asp 


Met 


Met 


Gin 


Pro 


Gin 


Gin 


Pro 


Tyr 








50 










55 










60 








acc ggg 


cag 


att 


tac 


cag 


eca 


act 


cag 


gca 


tat 


act 


eca 


get 


tea 


cct 


361 


Thr Gly 


Gin 


He 


Tyr 


Gin 


Pro 


Thr 


Gin 


Ala 


Tyr 


Thr 


Pro 


Ala 


Ser 


Pro 






65 










70 










75 










cag cct 


ttc 


tat 


gga 


aac 


aac 


ttt 


gag 


gat 


gag 


eca 


cct 


tta 


tta 


gaa 


409 


Gin Pro 


Phe 


Tyr 


Gly 


Asn 


Asn 


Phe 


Glu 


Asp 


Glu 


Pro 


Pro 


Leu 


Leu 


Glu 
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80 ' 85 90 

gag tta ggt ate aat ttt gac cac ate tgg caa aaa aca eta aea gta 457 
Glu Leu Gly lie Asn Phe Asp His He Trp Gin Lys Thr Leu Thr Val 
95 100 105 110 

tta cat ecg tta aaa gta gca gat ggc age ate atg aat gaa act gat 505 
Leu His Pro Leu Lys Val Ala Asp Gly Ser He Met Asn Glu Thr Asp 

115 120 125 

ttg gea ggt cca atg gtt ttt tgc ctt get ttt tgg age aea ttg eta 553 
Leu Ala Gly Pro Met Val Phe Cys Leu Ala Phe Trp Ser Thr Leu Leu 

130 135 140 

ctg 556 
Leu 



<210> 599 
<211> 564 
<212> DNA 
<213> Homo sapiens 

<220> 
<221> CDS 
<222> 151. .564 



<400> 599 

ataacaaaae aatccaaeta tactgaataa taeagaatea aaetgggate eaaaaacaat 60 
ttaaaacaaa gttggaaaet caaecgaaac aagaacaaat egaatcegac atgaaaeaag 120 
tctcaeaagt gttettaage tgacagaaag atg gtc aaa acc aaa ctt aaa ace 174 

Met Val Lys Thr Lys Leu Lys Thr 

1 5 



gta 


gta 


ate gac 


ata 


aeg 


gtc 


ata 


cca 


tte 


gee 


gac 


gat 


ttt 


tte 


tgc 


222 


Val 


Val 
10 


He Asp 


He 


Thr 


Val 
15 


He 


Pro 


Phe 


Ala 


Asp 
20 


Asp 


Phe 


Phe 


Cys 




cct 


gga 


get cca 


gcc 


ggc 


gta 


gag 


ata 


tea 


cca 


gcg 


get 


gat 


gga 


cca 


270 


Pro Gly 


Ala Pro 


Ala 


Gly 


Val 


Glu 


He 


Ser 


Pro 


Ala 


Ala 


Asp 


Gly 


Pro 




25 








30 










35 










40 




tct 


cct 


gaa gga 


ccc 


ate 


gee 


gag 


cct 


gac 


gga 


gaa 


tct 


gac 


ggt 


gaa 


318 


Ser 


Pro 


Glu Gly 


Pro 
45 


He 


Ala 


Glu 


Pro 


Asp 
50 


Gly 


Glu 


Ser 


Asp 


Gly 
55 


Glu 




gaa 


gaa 


gta gag 


cct 


ccc 


gga 


gcc 


aac 


gat 


gga 


gaa 


aeg 


gtg 


ggt 


gag 


366 


Glu 


Glu 


Val Glu 
60 


Pro 


Pro 


Gly 


Ala 


Asn 
65 


Asp 


Gly 


Glu 


Thr 


Val 
70 


Gly 


Glu 




ttt 


gaa 


aca ggg 


gat 


gaa 


gat 


tta 


ggt 


gcc 


att 


gat 


cca 


ggt 


ggt 


gaa 


414 


Phe 


Glu 


Thr Gly 
75 


Asp 


Glu 


Asp 


Leu 

80 


Gly 


Ala 


He 


Asp 


Pro 

85 


Gly 


Gly 


Glu 




gtc 


ate 


gga gea 


gaa 


gac 


gga 


gga 


ata 


gtt 


gaa 


gat 


gat 


tta 


ggt 


gee 


4 62 


Val 


He 
90 


Gly Ala 


Glu 


Asp 


Gly 
95 


Gly 


He 


Val 


Glu 


Asp 
100 


Asp 


Leu 


Gly 


Ala 




att 


ggt 


gee ggc 


ggt 


gaa 


gtc 


ate 


gga 


gca 


gag 


gat 


ggg 


gaa 


aca 


ggg 


510 


He Gly 


Ala Gly 


Gly 


Glu 


Val 


He 


Gly 


Ala 


Glu 


Asp 


Gly 


Glu 


Thr 


Gly 




105 








110 










115 










120 




gae 


gtg 


gat tta 


ggt 


ggt 


gcc 


ggt 


ggt 


gaa 


gtc 


gtc 


gga 


gaa 


aca 


ggg 


558 


Asp 


Val 


Asp Leu 


Gly 
125 


Gly 


Ala 


Gly 


Gly 


Glu 
130 


Val 


Val 


Gly 


Glu 


Thr 
135 


Gly 





gag etc 564 
Glu Leu 



<210> 600 
<211> 421 
<212> DNA 

<213> Homo sapiens 



<220> 
<221> 
<222> 



CDS 

198. .419 



wo 02/083898 



328 



PCT/IBOl/00914 



<220> 

<221> misc_feature 
<222> 275. .278 
<223> n=ar g, c or t 

<400> 600 

cttttacttc aggcgaacct gaactcagtt aatgcaattt tgaagcatgc cctgaataga 60 

ttcagtcatt atcaagtcaa actaaaaacg gtgaaaggtt gcgactatta ccaaatagga 120 

aaaacctgaa gacataagaa ctacacatga ggaatatgtc atttagcact ttcacttttt 180 

gatctccaca gaagaca atg aga agt cat acc ata aca atg acg aca act 230 
Met Arg Ser His Thr He Thr Met Thr Thr Thr 











1 








5 






10 






tea 


gte 


age 


age 


tgg 


cct 


tac 


tec 


tec cac aga 


atg 


cgc 


ttt 


awn 


nnn 


278 


Ser 


Val 


Ser 


Ser 


Trp 


Pro 


Tyr 


Ser 


Ser His Arg 


Met 


Arg 


Phe 


Xaa 


Xaa 










15 










20 






25 








hat 


cat 


age 


gac 


caa 


ccg 


cca 


caa 


aac tte tea 


gca 


aca 


cca 


aat 


gtt 


326 


Xaa 


His 


Ser Asp 


Gin 


Pro 


Pro 


Gin 


Asn Phe Ser 


Ala 


Thr 


Pro 


Asn 


Val 








30 










35 






40 










act 


acc 


tgt 


cec 


atg 


gat 


gaa 


aaa 


ttg khs act 


act 


gtg 


tta 


acc 


aca 


374 


Thr 


Thr 


Cys 


Pro 


Met 


Asp 


Glu 


Lys 


Leu Xaa Thr 


Thr 


Val 


Leu 


Thr 


Thr 






45 










50 






55 












tec 


tac 


tct 


gtt 


att 


tte 


ate 


gtg 


gga ctg gtt 


ggg 


aac 


ata 


ate 


gc 


421 


Ser 


Tyr 


Ser 


Val 


He 


Phe 


He Val Gly Leu Val 


Gly 


Asn 


He 


He 







60 65 70 

<210> 601 
<211> 529 
<212> DNA 
<213> Homo sapiens 

<220> 
<221> CDS 
<222> 124.. 528 

<400> 601 

ggctgcgcct agtgggccgt tgccttacag ttgetgagag gaggegagag gcgggggegc 60 
tagggccgag atcatgtctg actgggagag gtttccttgg cagcagagga cgctaggttt 120 



ggg 


atg 


aaa 


gaa 


get 


ggg 


cag 


atg 


saa 


aat 


ctg 


gag 


age 


gcg 


agg 


gcc 


168 




Met 


Lys 


Glu 


Ala 


Gly 


Gin 


Met 


Xaa 


Asn 


Leu 


Glu 


Ser 


Ala 


Arg 


Ala 






1 








5 










10 










15 




ggg 


egg 


tea 


gte 


age 


ace 


cag 


act 


ggc 


age 


atg 


acc 


ggt 


cag 


ata 


cca 


216 


Gly Arg 


Ser 


Val 


Ser 


Thr 


Gin 


Thr 


Gly 


Ser 


Met 


Thr 


Gly 


Gin 


He 


Pro 












20 










25 










30 






agg 


ett 


tct 


aaa 


gte 


aac 


ett 


tte 


act 


ctg 


etc 


age 


etc 


tgg 


atg 


gag 


264 


Arg 


Leu 


Ser 


Lys 


Val 


Asn 


Leu 


Phe 


Thr 


Leu 


Leu 


Ser 


Leu 


Trp 


Met 


Glu 








35 










40 










45 








etc 


ttt 


cca 


gca 


gaa 


gcc 


cag 


egg 


caa 


aaa 


tct 


cag 


aaa 


aat 


gaa 


grg 


312 


Leu 


Phe 


Pro 

50 


Ala 


Glu 


Ala 


Gin 


Arg 

55 


Gin 


Lys 


Ser 


Gin 


Lys 
60 


Asn 


Glu 


Xaa 




gga 


aag 


cat 


gga 


cec 


tta 


gga 


gat 


aat 


gaa 


gag 


agg 


acc 


aga 


gta 


tct 


360 


Gly Lys 


His 


Gly 


Pro 


Leu 


Gly 


Asp 


Asn 


Glu 


Glu 


Arg 


Thr 


Arg 


Val 


Ser 






65 










70 










75 












act 


gac 


aaa 


aga 


cag 


gta 


aag 


aga 


act 


ggt 


ett 


gtg 


gtg 


gtg 


aaa 


aac 


408 


Thr Asp 


Lys 


Arg 


Gin 


Val 


Lys 


Arg 


Thr 


Gly 


Leu 


Val 


Val 


Val 


Lys 


Asn 




80 










85 










90 










95 




atg 


aaa 


att 


gtt 


ggt 


etc 


cac 


tgt 


tct 


agt 


gaa 


gat 


tta 


cat 


gcc 


ggg 


456 


Met 


Lys 


He 


Val 


Gly 
100 


Leu 


His 


Cys 


Ser 


Ser 
105 


Glu 


Asp 


Leu 


His 


Ala 
110 


Gly 




cag 


att 


get 


ett 


att 


aaa 


cat 


ggg 


tea 


agg 


ctg 


aaa 


aac 


tgt 


gat 


cct 


504 


Gin 


He 


Ala 


Leu 
115 


He 


Lys 


His 


Gly 


Ser 
120 


Arg 


Leu 


Lys 


Asn 


Cys 
125 


Asp 


Pro 




tat 


ttt 


tec 


aga 


aaa 


cmt 


gtt 


ctg 


c 
















529 
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Tyr Phe Ser Arg Lys Xaa Val Leu 
130 135 

<210> 602 
<211> 208 
<212> DNA 
<213> Homo sapiens 

<220> 
<221> CDS 
<222> 24. .206 

<400> 602 

ccttccctgc ggccgcgcag age atg ccg gga gcc gcc ggc cgc egg tgt ecg 53 

Met Pro Gly Ala Ala Gly Arg Arg Cys Pro 













1 






5 






10 




gac ege 


teg 


ecc 


ccg 


ttt 


gga 


ecc 


gac 


ttc ggt tct 


tct 


ggg gtg 


ttg 


101 


Asp Arg 


Ser 


Pro 


Pro 


Phe Gly Pro Asp Phe Gly Ser Ser Gly Val 


Leu 










15 










20 




25 






atg etc 


eta 


aag 


ecc 


gag 


age 


acg 


tgt 


cca gac cct 


age 


ctg tac 


gac 


149 


Met Leu 


Leu 


Lys 


Pro 


Glu 


Ser 


Thr 


Cys 


Pro Asp Pro 


Ser 


Leu Tyr 


Asp 








30 










35 






40 






gcd baa 


etc 


tgc 


ccg 


gtc 


cca 


gaa 


cca 


aag cca tgc 


egg 


ggt gtg 


gcc 


197 


Ala Xaa 


Leu 


Cys 


Pro 


Val 


Pro 


Glu 


Pro 


Lys Pro Cys 


Arg 


Gly Val Ala 






45 










50 






55 








tct gac 


ecc 


ac 




















208 


Ser Asp 


Pro 

























60 



<210> 603 
<211> 587 
<212> DNA 

<213> Homo sapiens 

<220> 
<221> CDS 
<222> 198. .587 

<400> 603 

cttttacttc aggcgaacct gaactcagtt aatgcaattt tgaagcatgc cctgaataga 60 
tteagtcatt atcaagteaa actaaaaacg gtgaaaggtt gcgactatta ceaaatagga 120 
aaaacctgaa gacataagaa etacaeatga ggaatatgtc atttagcaet ttcacttttt 180 
gatctccaca gaagaca atg aga agt cat ace ata aca atg acg aca act 230 
Met Arg Ser His Thr He Thr Met Thr Thr Thr 
15 10 



tea 


gtc 


age 


age 


tgg 


cct 


tac 


tee 


tec 


eac 


aga 


atg 


cgc 


ttt 


ata 


acc 


278 


Ser 


Val 


Ser 


Ser 


Trp 


Pro 


Tyr 


Ser 


Ser 


His 


Arg 


Met 


Arg 


Phe 


He 


Thr 










15 










20 










25 








aat 


cat 


age 


gac 


caa 


ccg 


cca 


caa 


aac 


ttc 


tea 


gca 


aca 


cca 


aat 


gtt 


326 


Asn 


His 


Ser 


Asp 


Gin 


Pro 


Pro 


Gin 


Asn 


Phe 


Ser 


Ala 


Thr 


Pro 


Asn 


Val 








30 










35 










40 










act 


acc 


tgt 


ecc 


atg 


gat 


gaa 


aaa 


ttg 


eta 


tct 


act 


gtg 


tta 


acc 


aca 


374 


Thr 


Thr 


Cys 


Pro 


Met 


Asp 


Glu 


Lys 


Leu 


Leu 


Ser 


Thr 


Val 


Leu 


Thr 


Thr 






45 










50 










55 












tec 


tac 


tct 


gtt 


att 


ttc 


ate 


gtg 


gga 


ctg 


gtt 


ggg 


aac 


ata 


ate 


gee 


422 


Ser 


Tyr 


Ser 


Val 


He 


Phe 


He 


Val 


Gly 


Leu 


Val 


Gly 


Asn 


He 


He 


Ala 




60 










65 










70 










75 




etc 


tat 


gta 


ttt 


ctg 


ggt 


att 


cac 


cgt 


aaa 


aga 


aat 


tec 


att 


caa 


att 


470 


Leu 


Tyr 


Val 


Phe 


Leu 


Gly 


He 


His 


Arg 


Lys 


Arg 


Asn 


Ser 


He 


Gin 


He 












80 










85 










90 






tat 


eta 


ctt 


aac 


gta 


gcc 


att 


gca 


gae 


etc 


eta 


etc 


ate 


ttc 


tgc 


etc 


518 


Tyr 


Leu 


Leu 


Asn 


Val 


Ala 


He 


Ala 


Asp 


Leu 


Leu 


Leu 


He 


Phe 


Cys 


Leu 





95 100 105 



wo 02/083898 

cct ttc cga ata atg tat cat att aac 

Pro Phe Arg He Met Tyr His lie Asn 

110 115 

gtg att ytg tgc aag gtt gtg 

Val He Leu Cys Lys Val Val 
125 130 
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caa aac aag tgg aca eta ggt 566 
Gin Asn Lys Trp Thr Leu Gly 
120 

587 



<210> 604 
<211> 528 
<212> DNA 

<213> Homo sapiens 

<220> 
<221> CDS 
<222> 327.. 527 



<400> 604 

acaagaaaag aacatggtct agactgaagt accaactaaa tcatctcctt tcaaattatc 60 
accgacacca tcatggattc aagcaccgca cacagtccgg tgtttctggt atttcctcca 120 
gaaatcactg cttcagaata tgagtccaca gaactttcag ccacgacctt ttcaactcaa 180 
agccccttgc aaaaattatt tgctagaaaa atgaaaatct taggggatat ccattctggg 240 
gctctgtttt gttcattaat tctggagcct tcctaattgc agtgaaaaga aaaaccacag 300 
aaactctgat aatattgagc cgaata atg aat ttt ctt agt gcc ctg gga gca 353 

Met Asn Phe Leu Ser Ala Leu Gly Ala 

1 5 

ata get gga ate att etc etc aea ttt ggt ttc ate eta gat caa aac 401 

He Ala Gly He He Leu Leu Thr Phe Gly Phe He Leu Asp Gin Asn 

10 15 20 25 

tac att tgt ggt tat tct cac caa aat agt cag tgt aag get gtt act 449 

Tyr He Cys Gly Tyr Ser His Gin Asn Ser Gin Cys Lys Ala Val Thr 

30 35 40 

gtc ctg ttc ttg gga att ttg att aca ttg atg act ttc age att att 497 
Val Leu Phe Leu Gly He Leu He Thr Leu Met Thr Phe Ser He He 

45 50 55 

gaa tta ttc att tct ctg ctt tct caa ttt t 528 
Glu Leu Phe He Ser Leu Leu Ser Gin Phe 
60 65 



<210> 605 
<211> 227 
<212> DNA 
<213> Homo sapiens 



<220> 
<221> CDS 
<222> 62. .226 



<400> 605 

atttccggtt ccggtgtcag ttcgaggcgc cgccgccgcc gccgcagccg ccggagccgc 60 
a atg cct aaa gga gga aga aag gga ggc cac aaa ggc egg geg agg cag 109 
Met Pro Lys Gly Gly Arg Lys Gly Gly His Lys Gly Arg Ala Arg Gin 
15 10 15 

tat aca age cct gag gag ate gae gcg cas tgc agg ctg aga age aga 157 
Tyr Thr Ser Pro Glu Glu He Asp Ala Xaa Cys Arg Leu Arg Ser Arg 

20 25 30 

agg cca ggg aag aag agg age aaa aaa ggt gga gat ggg get gca ggt 205 
Arg Pro Gly Lys Lys Arg Ser Lys Lys Gly Gly Asp Gly Ala Ala Gly 

35 40 45 

gae cec aaa aaa aaa aaa aaa a 227 
Asp Pro Lys Lys Lys Lys Lys 
50 55 



<210> 606 



wo 02/083898 
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<211> 228 
<212> DNA 

<213> Homo sapiens 

<220> 
<221> CDS 
<222> 63. .227 



<400> 606 

agaacatcct aatcrwaaaa gttcgaggcg ccgccgccgc cgccgcagcc gccggagccg 60 
ca atg cct aaa gga gga aga aag gga ggc cac aaa ggc egg gcg agg 107 
Met Pro Lys Gly Gly Arg Lys Gly Gly His Lys Gly Arg Ala Arg 
15 10 15 

cag tat aca age ect gag gag ate gae gcg eas tgc agg ctg aga age 155 
Gin Tyr Thr Ser Pro Glu Glu He Asp Ala Xaa Cys Arg Leu Arg Ser 

20 25 30 

aga agg cea ggg aag aag agg age aaa aaa ggt gga gat ggg get gca 203 
Arg Arg Pro Gly Lys Lys Arg Ser Lys Lys Gly Gly Asp Gly Ala Ala 

35 40 45 

ggt gac ccc aaa aaa aaa aaa aaa a 
Gly Asp Pro Lys Lys Lys Lys Lys 
50 55 



228 



<210> 607 
<211> 378 
<212> DNA 

<213> Homo sapiens 

<220> 
<221> CDS 
<222> 119.. 376 



<220> 

<221> misc_feature 
<222> 143,318,332 
<223> n-a, g, c or t 

<400> 607 

gaaagacgga gagaggaaaa gggaeaaggg caaggcaaga gagaggegte gtcgtttgat 60 
ctgctgccgg agtgcgaaga ggaagaggaa gaggaagagg agcgeeggag ttgtaaag 118 

atg tts ctg gtg gac tgg ttc tat ngg gtg ctg tea teg ctt ggg ctg 166 

Met Xaa Leu Val Asp Trp Phe Tyr Xaa Val Leu Ser Ser Leu Gly Leu 
15 10 15 

tgg cag aag gag get aag ate etc ttc ctt ggc etc gae aac gee ggc 214 

Trp Gin Lys Glu Ala Lys He Leu Phe Leu Gly Leu Asp Asn Ala Gly 

20 25 30 

aag ace ace etc etc cae atg ctg aag gac gag egg etc gta cag eac 262 

Lys Thr Thr Leu Leu His Met Leu Lys Asp Glu Arg Leu Val Gin His 

35 40 45 

cag cca acg cag tac ccc acg tea gaa gag ttg age ate ggc agg ate 310 

Gin Pro Thr Gin Tyr Pro Thr Ser Glu Glu Leu Ser He Gly Arg He 

50 55 60 

aag ttc ang gcg ttc gac ctt ngg ggc cac cag ate gcc cgc cgc gtc 358 

Lys Phe Xaa Ala Phe Asp Leu Xaa Gly His Gin He Ala Arg Arg Val 
65 70 75 80 

tgg aag gac tac tmc gee aa 378 
Trp Lys Asp Tyr Xaa Ala 
85 



<210> 608 
<211> 455 
<212> DNA 

<213> Homo sapiens 
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<220> 
<221> CDS 
<222> 63. .455 



<400> 608 

attsccacct gtgatgctca gagagaaacc atgagccctg aaaaacacca tattgttttg 60 
tg atg tea cct ctg gaa tgt tct gag tgt ttt ggt gac caa ctt ctg 107 
Met Ser Pro Leu Glu Cys Ser Glu Cys Phe Gly Asp Gin Leu Leu 
15 10 15 

cat agg acc tat acc tgg caa etc aca ttg cac tea agg cea aat tat 155 
His Arg Thr Tyr Thr Trp Gin Leu Thr Leu His Ser Arg Pro Asn Tyr 

20 25 30 

aca aga aaa aga gat acc aga tct gaa age eta gaa att cea ate agt 203 
Thr Arg Lys Arg Asp Thr Arg Ser Glu Ser Leu Glu lie Pro lie Ser 

35 40 45 

gtg gtt eta cct cag agg ggc aca gcc gaa ecc tte ccg agg etc cac 251 
Val Val Leu Pro Gin Arg Gly Thr Ala Glu Pro Phe Pro Arg Leu His 

50 55 60 

aac ttg tac age acc cct cgc tgc gcg cag cag gcc gcc ctg ccc egg 299 
Asn Leu Tyr Ser Thr Pro Arg Cys Ala Gin Gin Ala Ala Leu Pro Arg 

65 70 75 

ctg age cgc agg atg gcg age cag cac tec tat cea ctg aac cgc tte 347 
Leu Ser Arg Arg Met Ala Ser Gin His Ser Tyr Pro Leu Asn Arg Phe 
80 85 90 95 

tec tec gtg cct tta gac ccc atg gag cgc ccc atg tec cag gee gac 395 
Ser Ser Val Pro Leu Asp Pro Met Glu Arg Pro Met Ser Gin Ala Asp 

100 105 110 

ctg gag ctg gac tac aac ccg ccg egg gtg cag etc age gac gag atg 443 
Leu Glu Leu Asp Tyr Asn Pro Pro Arg Val Gin Leu Ser Asp Glu Met 

115 120 125 

tte ktg tte car 455 
Phe Xaa Phe Gin 
130 



<210> 609 
<211> 366 
<212> DNA 

<213> Homo sapiens 

<220> 
<221> CDS 
<222> 68. .364 



<400> 609 

gtacagaaag attggtggac aggagcagcg gecggtgggm gagggcgctc ggcggcggcc 60 

109 



157 



tgcggec 


atg 


gcc 


acc 


gtg 


atg 


gca 


gcg 


acg 


gcg 


gcg 


gag 


egg 


gcg 


gtg 




Met 


Ala 


Thr 


Val 


Met 


Ala 


Ala 


Thr 


Ala 


Ala 


Glu 


Arg 


Ala 


Val 




1 








5 










10 










ctg gag 


gag 


gag 


tte 


cgc 


tgg 


ctg 


ctg 


cac 


gac 


gak 


ktg 


cac 


get 


gtg 


Leu Glu 


Glu 


Glu 


Phe 


Arg 


Trp 


Leu 


Leu 


His 


Asp 


Xaa 


Xaa 


His 


Ala 


Val 


15 








20 










25 










30 


ttg wwg 


cag 


ctg 


cag 


gac 


ate 


etc 


aag 


gag 


gcc 


tct 


ctg 


cgc 


tte 


act 


Leu Xaa 


Gin 


Leu 


Gin 


Asp 


He 


Leu 


Lys 


Glu 


Ala 


Ser 


Leu 


Arg 


Phe 


Thr 








35 










40 










45 




ctg ccg 


ggc 


tee 


ggc 


act 


gag 


ggg 


ccc 


gee 


aag 


caa 


gag 


aac 


tte 


ate 


Leu Pro 


Gly 


Ser 


Gly 


Thr 


Glu 


Gly 


Pro 


Ala 


Lys 


Gin 


Glu 


Asn 


Phe 


He 






50 










55 










60 






eta ggc 


age 


tgt 


ggc 


aca 


gac 


cak 


gtg 


aag 


ggt 


gtg 


ctg 


act 


ctg 


cak 


Leu Gly 


Ser 


Cys 


Gly 


Thr 


Asp 


Xaa 


Val 


Lys 


Gly 


Val 


Leu 


Thr 


Leu 


Xaa 




65 










70 










75 








ggg gat 


gee 


etc 


age 


cag 


gcg 


gat 


gtg 


aac 


ctg 


aag 


atg 


ccc 


egg 


aac 


Gly Asp 


Ala 


Leu 


Ser 


Gin 


Ala 


Asp 


Val 


Asn 


Leu 


Lys 


Met 


Pro 


Arg 


Asn 



205 



253 



301 



349 
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80 85 . 90 

aac cag ctg ctg cac tt 366 
Asn Gin Leu Leu His 
95 



<210> 610 
<211> 498 
<212> DNA 
<213> Homo sapiens 

<220> 
<221> CDS 
<222> 209. .496 



<400> 610 

atgatgccag tgaatgcgta gtcccagcct cagcacaggg gagccacctt gaagctctca 60 
aatatcactg ttgtgaatac agagagggaa aaccaactgt aacgtgccac ccaaataaag 120 
ctcattacta gtcctgtcca gcaacgtgcc tctcctggcc ctagagttct tggaaatagc 180 
ccaggccaaa gagaaggcct ttctcccc atg gtc age cac acg ttc cac atg 232 

Met Val Ser His Thr Phe His Met 

1 5 



ego 


aca 


gag 


gag 


tet 


gat 


gee 


tea 


cag 


gag gge gat gae 


eta 


ccc 


aag 


280 


Arg 


Thr 


Glu 


Glu 


Ser 


Asp 


Ala 


Ser 


Gin 


Glu Gly Asp Asp Leu 


Pro 


Lys 






10 










15 






20 










tec 


tea 


gca 


aac 


acc 


age 


cat 


ccc 


aag 


cag gat gae age 


ccc 


aag 


tec 


328 


Ser 


Ser 


Ala 


Asn 


Thr 


Ser 


His 


Pro 


Lys 


Gin Asp Asp Ser 


Pro 


Lys 


Ser 




25 










30 








35 






40 




tea 


gaa 


gaa 


acc 


ate 


cag 


ccc 


aag 


gag 


ggt gae ate ccc 


aag 


gcc 


cea 


376 


Ser 


Glu 


Glu 


Thr 


He 


Gin 


Pro 


Lys 


Glu 


Gly Asp He Pro 


Lys 


Ala 


Pro 












45 










50 




55 






gaa 


gaa 


acc 


ate 


caa 


tec 


aag 


aag 


gag 


gae etc ccc aag 


tec 


tea 


gaa 


424 


Glu 


Glu 


Thr 


He 


Gin 


Ser 


Lys 


Lys 


Glu 


Asp Leu Pro Lys 


Ser 


Ser 


Glu 










60 










65 




70 








aaa 


gcc 


ate 


cag 


ccc 


aaa 


gag 


agt 


aac 


ate ccc aag tec 


tea 


gca 


aaa 


472 


Lys 


Ala 


He 


Gin 


Pro 


Lys 


Glu 


Ser 


Asn 


He Pro Lys Ser 


Ser 


Ala 


Lys 








75 










80 




85 










ccc 


ate 


cag 


ccc 


aag 


ctg 


gea 


ata 


tt 










498 


Pro 


lie 


Gin 


Pro 


Lys 


Leu 


Ala 


lie 
















90 










95 

















<210> 611 
<211> 504 
<212> DNA 
<213> Homo sapiens 

<220> 
<221> CDS 
<222> 214. .504 



<400> 611 

aagaeagaca cggaagtgct gggaggegcc gggagcecgt tcggttgcgg gtgtctctgg 
ccctgcggtc agccctggga aegtccegga gagctagatt cetagaggcc egattccget 
agcccggaac agacaaagec agegctcccg cccgctcccc gacttaggat ccgatgccgg 
cagcgtcctg gggcccccgt agcgggctgg acc atg age ctg ctg gae ggc etc 

Met Ser Leu Leu Asp Gly Leu 



get tec teg ccg egg get 
Ala Ser Ser Pro Arg Ala 
10 



ccg ctg cag tec age aag gcc 



5 

agg 



atg aaa 



Pro Leu Gin Ser Ser Lys Ala Arg Met Lys 



15 



20 



aag 

Lys 



etc ccg aag aag age cag aat gag aag tac egg ctg aag tac ctg 
Leu Pro Lys Lys Ser Gin Asn Glu Lys Tyr Arg Leu Lys Tyr Leu 



25 



30 



35 



egg ctg egc aaa gcg gcc aag gcc acg gtg ttt gaa aat get get att 



60 
120 
180 
234 



282 



330 



378 



wo 02/083898 



Arg Leu Arg Lys Ala Ala 

40 45 

tgt gat gaa att get cgt 

Cys Asp Glu He Ala Arg 
60 

gaa aga agg tac ttg eta 

Glu Arg Arg Tyr Leu Leu 
75 

gaa ggg gta agt aca ggc 

Glu Gly Val Ser Thr Gly 
90 



334 

Lys Ala Thr Val Phe Glu 
50 

ctt gag gaa aaa ttt ett 
Leu Glu Glu Lys Phe Leu 
65 

aag aag cte etc eag ett 
Lys Lys Leu Leu Gin Leu 
80 

mtg eag etc ctt 
Xaa Gin Leu Leu 
95 
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Asn Ala Ala He 
55 

aaa gea aaa gaa 426 
Lys Ala Lys Glu 
70 

eag get eta act 474 
Gin Ala Leu Thr 
85 

504 



<210> 612 
<211> 483 
<212> DNA 
<213> Homo sapiens 

<220> 
<221> CDS 
<222> 161.. 481 



<400> 612 

gtgggcggtt cggeggctct gggcgctgtt gtttggtctt taggcctgcg gaggggcgtt 60 
atctggaggg ccgcgggtgc aggccgcagt gacagggccg ctcgccccgc tagtcctgcc 120 
tgtctcccgg tgcagctgtg ttcgcggcct gcaggccaac atg gcg cag gag gtg 175 

Met Ala Gin Glu Val 

1 5 



teg 


gag 


tac 


etg 


age cag 


aae eeg 


egg 


gtg gea 


gee 


tgg gtg gag 


gcg 


223 


Ser 


Glu 


Tyr 


Leu 


Ser Gin Asn Pro Arg Val Ala 


Ala 


Trp Val Glu 


Ala 












10 






15 




20 






etg 


egc 


tgc 


gae 


ggc gag 


act gae 


aaa 


cac tgg 


egc 


cac egc egg 


gat 


271 


Leu 


Arg 


Cys 


Asp 


Gly Glu Thr Asp 


Lys 


His Trp 


Arg 


His Arg Arg 


Asp 










25 






30 






35 






ttt 


ttg 


ctt 


egc 


aae gee 


ggg gae 


etg 


gee cce 


get 


ggc ggc get 


gee 


319 


Phe 


Leu 


Leu 


Arg 


Asn Ala 


Gly Asp 


Leu 


Ala Pro 


Ala 


Gly Gly Ala 


Ala 








40 






45 








50 






tec 


get 


age 


aeg 


gat gaa 


get gee 


gae 


gee gag 


age 


ggg ace cga 


aae 


367 


Ser 


Ala 


Ser 


Thr 


Asp Glu 


Ala Ala Asp Ala Glu 


Ser 


Gly Thr Arg 


Asn 






55 








60 






65 








egg 


cag 


etg 


cag 


cag etc 


ate tee 


ttt 


tec atg 


gee 


tgg gcg aae 


cac 


415 


Arg 


Gin 


Leu 


Gin 


Gin Leu 


He Ser 


Phe 


Ser Met 


Ala 


Trp Ala Asn 


His 




70 








75 






80 






85 




gtc 


ttc 


etc 


ggg 


tgc cga 


tae eet 


eaa 


aaa gtt 


atg 


gat aaa ata 


ett 


4 63 


Val 


Phe 


Leu 


Gly 


Cys Arg 


Tyr Pro 


Gin 


Lys Val 


Met 


Asp Lys He 


Leu 












90 






95 




100 






agt 


atg 


get 


gaa 


ggc ate 


aa 












483 


Ser 


Met 


Ala 


Glu 


Gly He 






















105 



















<210> 613 
<211> 473 
<212> DNA 
<213> Homo sapiens 



<220> 
<221> CDS 
<222> 277.. 471 



<400> 613 

aaaetgtcct tgecaccaea attatttaat ggacecaaca gaaagtaacc ccggaaatta 60 

ggacaectca tcecaaaaga cctttaaata ggggaagtcc acttgtgcac ggctgctcct 120 

tgetatagaa gacetgggac agaggactgc tgtetgccct ctetggtcac cctgcctagc 180 

tagaggatct gtgacceeag eeatgaggae ectegccatc cttgetgcsa ttctectggt 240 



wo 02/083898 PCT/IBO 1/009 14 

335 



ggccctgcag gccagagcca ctccaggcaa gagctg atg 


agg 


ttg 


ctg 


cag 


ccc 


294 












Met 
1 


Arg 


Leu 


Leu 


Gin 

5 


Pro 




egg age aga ttg 


cag 


egg aca 


tec 


cag 


aag tgg 


ttg 


ttt 


esc 


ttg 


cat 


342 


Arg Ser Arg Leu 


Gin Arg Thr Ser Gin 


Lys Trp 


Leu 


Phe 


Xaa 


Leu 


His 




10 








15 








20 








ggc gac gaa age 


ttg 


get eca 


aag 


cat 


cca QQC 

www 


tea 


agg 


aaa 


aae 


atg 


390 


Gly Asp Glu Ser 


Leu 


Ala Pro 


Lys 


His 


Pro Gly 


Ser Arg 


Lys Asn Met 




25 






30 








35 










gac tgc tat tgg 


cag 


aat ace 


age 


gtg 


cat tgc 


agg 


aga 


acg 


teg 


eta 


438 


Asp Cys Tyr Trp 


Gin 


Asn Thr 


Ser 


Val 


His Cys 


Arg Arg 


Thr 


Ser 


Leu 




40 




45 








50 












tgg aae ctg cat 


eta 


cca ggg 


aag 


act 


ctg ggc 


at 










473 


Trp Asn Leu His 


Leu 


Pro Gly 


Lys 


Thr 


Leu Gly 














55 




60 






65 















<210> 614 
<211> 522 
<212> DNA 
<213> Homo sapiens 

<220> 
<221> CDS 
<222> 279.. 521 

<400> 614 

attctyaatt gsgccmagct tccggcctgg ecacccgcea geaggtctea cccggttcgc 60 
ateagtcccg ggcccccggc gcccctcagt agtagcaaat tategcatec etgagcttag 120 
gatccccett tagtttcaca tatgececca cccccgttte eactttgatg gcggcttcca 180 
aggagcatec ettagtaaga acegtcgeaa atategtgat ttggatceae caaggettgg 240 
ggcgageegt egaatgtgtg tggtagggga gcgggetg atg tta gca cgt ggt ctt 296 

Met Leu Ala Arg Gly Leu 
1 5 
kgg ggr aat agg ags eek gtg ttc acg att agt 34 4 

Xaa Gly Asn Arg Xaa Pro Val Phe Thr lie Ser 

15 20 
act tgg aae ctt tac ate tct cca age ctg agt 392 
Thr Trp Asn Leu Tyr lie Ser Pro Ser Leu Ser 

30 35 
aca get gcg aat ace ggt ttc cag aga gta gtt 440 
Thr Ala Ala Asn Thr Gly Phe Gin Arg Val Val 

45 50 
cac ctt ctt cag ctt cct teg acg caa aae aga 488 
His Leu Leu Gin Leu Pro Ser Thr Gin Asn Arg 
60 65 , 70 

aga aga agg gat ggc aga g 522 
Arg Arg Arg Asp Gly Arg 
80 

<210> 615 
<211> 322 
<212> DNA 
<213> Homo sapiens 

<220> 
<221> CDS 
<222> 171.. 320 

<400> 615 

agtggttcgg ccgeggcgac cecactccgg cggcgegtcc cccgagcttg gtacggctea 60 
geccgtctec cccgaagccg cgcgcccgcg ccegcgcccc teagtcggtg gagcccgcag 120 
cceeecttgt ggeccgegge agctccccgc ecgetcggce cgcgccegcc atg gtc 176 

Met Val 



gtt 


arg 


sey tgr 


aat 


Val 


Xaa 


Xaa Xaa 


Asn 






10 




ttc 


cac 


cac eta 


cca 


Phe 


His 


His Leu 


Pro 






25 




ttc 


ctt 


ate tgt 


gag 


Phe 


Leu 


He Cys 


Glu 




40 






aag 


aae 


cct tgt 


gaa 


Lys 


Asn 


Pro Cys 


Glu 


55 








ctt 


ttt 


cat tgg 


agg 


Leu 


Phe 


His Trp 


Arg 








75 



wo 02/083898 
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cgt ccg cgc cgt gcc ccg tac cgc tec 
Arg Pro Arg Arg Ala Pro Tyr Arg Ser 

5 10 
ggt cgc ggc cgy cct ccg egg ccc etc 
Gly Arg Gly Arg Pro Pro Arg Pro Leu 

20 25 
cgc tec tgg cct gcc age ccy ccg agg 
Arg Ser Trp Pro Ala Ser Pro Pro Arg 
35 40 

gg 



336 

1 

ggc gcc ggg ggc ccc ctm ggg 224 
Gly Ala Gly Gly Pro Leu Gly 
15 

gtg gtg cgc gcc gtc cgc teg 272 
Val Val Arg Ala Val Arg Ser 
30 

ccc gca gcc tyc gcg gat ccg 320 
Pro Ala Ala Xaa Ala Asp Pro 
45 50 

322 



<210> 616 
<211> 524 
<212> DNA 

<213> Homo sapiens 

<220> 
<221> CDS 
<222> 269.-523 



<400> 616 

attttatact aatggagaga gagagaaaga catagaagte teactgactt gcacagttta 60 

cttcccactg actagatett ggataaggac ttttttgtct tattcataaa aggcgattgt 120 

gttcaattat gttatgctca gaggtggaat atataatact gacttctatt gcttaaaaaa 180 

ggtgattatg aatggaggaa gaagtctgtg ggacaggaaa tttaaaataa aattttagag 240 

ttagatagat tatgagaaga aaaaatca atg cca agt ttt gtt gac cgc ate 292 

Met Pro Ser Phe Val Asp Arg lie 

1 5 



ttt 


ggt 


ggt 


gaa 


eta 


act 


agt 


atg 


ate 


atg 


tat 


gat 


caa 


tgc 


aga 


act 


340 


Phe 


Gly 


Gly Glu 


Leu 


Thr 


Ser 


Met 


lie 


Met 


Tyr Asp 


Gin 


Cys 


Arg Thr 






10 










15 










20 












gtc 


tec 


ttg 


gtt 


cat 


gaa 


tet 


ttc 


ctt 


gat 


ttg 


tec 


etc 


cca 


gtt 


tta 


388 


Val 


Ser 


Leu 


Val 


His 


Glu 


Ser 


Phe 


Leu Asp 


Leu 


Ser 


Leu 


Pro 


Val 


Leu 




25 










30 










35 










40 




gat 


gat 


cag 


agt 


ggt 


aag 


aaa 


agt 


gta 


aat 


gat 


aaa 


aat 


ctg 


aaa 


aag 


436 


Asp Asp 


Gin 


Ser 


Gly 


Lys 


Lys 


Ser 


Val 


Asn 


Asp 


Lys 


Asn 


Leu 


Lys 


Lys 












45 










50 










55 






aca 


gtg 


gag 


gat 


gaa 


gat 


caa 


gat 


agt 


gag 


gaa 


gaa 


aaa 


gat 


aac 


gac 


484 


Thr 


Val 


Glu Asp 


Glu Asp Gin Asp Ser Glu 


Glu 


Glu 


Lys 


Asp Asn Asp 










60 










65 










70 








agt 


tac 


ata 


aaa 


gag 


aga 


agt 


gat 


att 


cct 


tet 


gga 


aca 


a 






524 


Ser 


Tyr 


lie 


Lys 


Glu Arg 


Ser Asp 


He 


Pro 


Ser Gly Thr 














75 










80 










85 











<210> 617 
<211> 274 
<212> DNA 
<213> Homo sapiens 



<220> 
<221> CDS 
<222> 75. .272 



<400> 617 

atacacttct tggctgtgtg egcteagcag gacgtgggag gctccggctt caagacactc 60 

atccaagagg aagg atg gcc agt ate ttt tet aag ttg eta act ggc cgc 110 

Met Ala Ser He Phe Ser Lys Leu Leu Thr Gly Arg 

15 10 

aat get tet ctg ctg ttt get ace atg ggc acc agt gtc ctg ace acc 158 

Asn Ala Ser Leu Leu Phe Ala Thr Met Gly Thr Ser Val Leu Thr Thr 

15 20 25 

ggg tac ctg ctg aac egg eag aaa gtg tgt gee gag gtc egg gag cag 206 



wo 02/083898 

Gly Tyr Leu Leu Asn Arg Gin Lys Val 

30 35 

cct agg eta ttt cct cca age gea gac 

Pro Arg Leu Phe Pro Pro Ser Ala Asp 

45 50 

aca act gea tgg ccg agt ge 

Thr Thr Ala Trp Pro Ser 
65 
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337 

Cys Ala Glu Val Arg Glu Gin 
40 

tae cea gac ctg cgc aas aca 254 
Tyr Pro Asp Leu Arg Xaa Thr 
55 60 

274 



<210> 618 
<211> 243 
<212> DNA 
<213> Homo sapiens 

<220> 
<221> CDS 
<222> 21. .242 



<400> 618 

agggaagagg ctgtegggaa atg gcg gee gcg gee ggg ctg ggg ctt cag egg 53 

Met Ala Ala Ala Ala Gly Leu Gly Leu Gin Arg 
1 5 10 



gag 


gea 


gea 


gag 


ggg 


aag 


tgg 


tea 


gcg 


tgg 


cga 


atg 


acg 


gaa 


gaa 


act 


Glu 


Ala 


Ala 


Glu 


Gly Lys 


Trp 


Ser 


Ala 


Trp Arg 


Met 


Thr 


Glu 


Glu 


Thr 








15 










20 










25 






cgc 


att 


gte tac 


tgg 


ate 


aag 


gac 


aga 


cag 


etc 


acc 


aac 


egt 


gac 


age 


Arg 


He 


Val 


Tyr 


Trp 


He 


Lys 


Asp Arg 


Gin 


Leu 


Thr 


Asn 


Arg Asp 


Ser 






30 










35 










40 








ace 


ata 


ctg 


gaa 


ctt 


caa 


aaa 


gtt 


ctg 


aaa 


aca 


tgt 


tgg 


ttc 


gtt 


tgt 


Thr 


He 


Leu 


Glu 


Leu 


Gin 


Lys 


Val 


Leu 


Lys 


Thr 


Cys 


Trp 


Phe 


Val 


Cys 




45 










50 










55 










tgg 


agt 


att 


ttt 


caa 


ctt 


aaa 


tte 


ctt 


ttc 


ctt 


ttt 


ttt 


ttt 


ttt 


k 


Trp 


Ser 


lie 


Phe 


Gin 


Leu 


Lys 


Phe 


Leu 


Phe 


Leu 


Phe 


Phe 


Phe 


Phe 




60 










65 










70 













<210> 619 
<211> 499 
<212> DNA 
<213> Homo sapiens 

<220> 
<221> CDS 
<222> 295.. 498 



<400> 619 

aggaagttga teaatcccga ggetgetgag agacggtggc gcgattggga cagtcgecag 60 

ggatggetga gegtgaagat gcagcgggtg tcegggetge tetcetggae gctgageaga 120 

gtcctgtggc tctccggcct ctctgagccg ggagctgccc ggcagccceg gatcatggaa 180 

gagaaagcct agaggtttat gatttgatta gaactatccg ggacccagaa aagcecaata 240 

ctttagaaga actggaagtg gteteggaaa gttgtgtgga agttcaggag ataa atg 297 

Met 



aag 


aag 


aat 


ate 


tgg 


tta 


tta 


tea 


ggt 


tea 


cgc 


caa 


cag 


tac 


etc 


att 


345 


Lys 


Lys Asn 


He 


Trp 


Leu 


Leu 


Ser 


Gly 


Ser 


Arg 


Gin 


Gin 


Tyr 


Leu 


He 










5 










10 










15 








get 


ctt 


tgg 


cga 


etc 


tta 


ttg 


ttg 


gaa 


ate 


tac 


att 


tet 


gaa 


gga 


acc 


393 


Ala 


Leu 


Trp 


Arg 


Leu 


Leu 


Leu 


Leu 


Glu 


He 


Tyr 


He 


Ser 


Glu 


Gly 


Thr 








20 










25 










30 










cac 


tea 


aca 


gaa 


gaa 


gac 


ate 


aat 


aag 


cag 


ata 


aat 


gac 


aaa 


gag 


cga 


441 


His 


Ser 


Thr 


Glu 


Glu Asp 


He 


Asn 


Lys 


Gin 


He 


Asn 


Asp 


Lys 


Glu 


Arg 






35 










40 










45 












gtg 


gea 


get 


gea 


atg 


gaa 


aac 


cec 


aac 


tta 


egg 


gaa 


att 


gtg 


gac 


agt 


489 


Val 


Ala 


Ala 


Ala 


Met 


Glu 


Asn 


Pro 


Asn 


Leu 


Arg 


Glu 


He 


Val 


Asp 


Ser 
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338 

50 55 60 65 

gtg tec ttg a 499 

Val Ser Leu 

<210> 620 
<211> 771 
<212> DNA 
<213> Homo sapiens 

<220> 
<221> CDS 
<222> 613.. 771 

<400> 620 

caggttcagg gcacagaacc caggcttgta ccatggtggt gggagaaaat gaccactggc 60 

caagaggact gctgacctgt gcaccaggct agtacttatg actacaaatt cttactgctt 120 

ctctaatcaa ctctgaggga agagggcatc tgatcattac aaaagggagg gcttataagt 180 

gatctcccaa gaaggcagtg atctgctagt gcctttggct ctgtacctct gctgggcatc 240 

tctccaaggt ctaaggtaac atattaaatg tttttgtcag ctaatgcagg ctcagtgact 300 

ttaagtctgt aagttaccca ggaagaagga ttataggaaa aataactcag taagtttaaa 360 

accaaacaca tttccattta gtgacaggaa tttaagcaag gacctgaagt agaatcaact 420 

gattcacaca gtagtaaata caaagtagaa caatgatctt ggcttcgctg tctggttcag 480 

tggtctgctg gaatgcaata cacaagttaa gtcacactgc agactgtttt ctagctgtgg 540 

ccgctggatg ccacttctag catagtagaa ctatgttagg aggaatggga aaagtgagca 600 

ccacttctca cc atg ttc ccc cct cct get gcc agt etc tgc tec cat gtt 651 
Met Phe Pro Pro Pro Ala Ala Ser Leu Cys Ser His Val 

699 



747 



771 

I Trp Thr Pro 
50 

<210> 621 
<211> 658 
<212> DNA 
<213> Homo sapiens 

<220> 
<221> CDS 
<222> 440.. 658 

<400> 621 

gctttgtaat tggtgcagct ctgagaacac acgcatgcat tccaggcggt gctgctgggg 60 

ctcggcgggg ccgtgtctgg tctgcaaggc gcgcgggctc gtggggggtt ggctggggag 120 

cccacgctgc ctggcgactc gggccaccga atgtgagacc gagtcccttt atgtcaccag 180 

cgcaeacgct gatttgaaec ctgcttcgae gtgtgtgtca tggcttaaaa atagctgcta 240 

atctgtcaac ctgtcttggg cagaaacagc ggcggcgaca gagcaggagc gtcatggccg 300 

tggcgctgtc tgcgcsggcg atccgccttt cggactgagg cccagcgcag cgcttgcaaa 360 

gagcagcagc tacctggcaa ctgaacccat catcaccaca gccaetcctg cagctgccac 420 

ggtttctgcc acctctaag atg tgc cct ggt aac tgg ctt tgg get tct atg 472 

Met Cys Pro Gly Asn Trp Leu Trp Ala Ser Met 

15 10 

act ttt atg gcc cgc ttc tec egg agt age tea agg tct cct gtt ega 520 

Thr Phe Met Ala Arg Phe Ser Arg Ser Ser Ser Arg Ser Pro Val Arg 

15 20 25 

act ega ggg acc ctg gag gag atg cca acc gtt caa cat cct ttc etc 568 

Thr Arg Gly Thr Leu Glu Glu Met Pro Thr Val Gin His Pro Phe Leu 

30 35 40 









1 








5 






10 




gga 


tgc 


age 


aga 


gat 


eac 


cca 


cca 


gtt ggc cca 


gga 


cag acc aat 


agg 


Gly 


Cys 


Ser 


Arg Asp 


His 


Pro Pro Val Gly Pro Gly Gin Thr Asn Arg 




15 










20 






25 






aag 


ggt 


cca 


ate 


act 


eta 


act 


aca 


gcc gaa etc 


acc 


tec aca aca 


gtc 


Lys 


Gly 


Pro 


lie 


Thr 


Leu 


Thr 


Thr 


Ala Glu Leu 


Thr 


Ser Thr Thr 


Val 


30 










35 






40 






45 


tct 


gtg 


get 


eta 


gcc 


tgg 


act 


cct 










Ser 


Val 


Ala 


Leu 


Ala 


Trp 


Thr 


Pro 
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aat gtc ttc gag ttg 
Asn Val Phe Glu Leu 
45 

aac cat gcc gac gag 
Asn His Ala Asp Glu 
60 

<210> 622 
<211> 460 
<212> DNA 
<213> Homo sapiens 

<220> 
<221> CDS 
<222> 32. .460 

<400> 622 

aatttttttg tgacttcttc agaccacttt c atg act tct gga age aaa tgt 52 

Met Thr Ser Gly Ser Lys Cys 

1 5 



cct 


agt 


aca 


gac 


tea 


gga 


aaa 


gaa 


gaa 


tat 


att 


gcc 


acg 


ttc 


aaa 


gga 


100 


Pro 


Ser 


Thr Asp 


Ser 


Gly 


Lys 


Glu 


Glu 


Tyr 


He 


Ala 


Thr 


Phe 


Lys 


Gly 








10 










15 










20 










tct 


gaa 


tac 


ttc 


tgc 


tac 


gac 


ttg 


tct 


caa 


aac 


ccc 


att 


caa 


age 


age 


148 


Ser 


Glu 
25 


Tyr 


Phe 


Cys 


Tyr 


Asp 
30 


Leu 


Ser 


Gin 


Asn 


Pro 
35 


He 


Gin 


Ser 


Ser 




agt 


gat 


gaa 


ata 


act 


ctg 


tea 


ttt 


aaa 


acc 


ctt 


cag 


agg 


aat 


gga 


ctg 


196 


Ser Asp 


Glu 


He 


Thr 


Leu 


Ser 


Phe 


Lys 


Thr 


Leu 


Gin 


Arg 


Asn 


Gly 


Leu 




40 










45 










50 










55 




atg 


ctt 


cac 


act 


ggg 


aaa 


teg 


get 


gat 


tat 


gtc 


aat 


ctt 


gcc 


ctg 


aaa 


244 


Met 


Leu 


His 


Thr 


Gly 
60 


Lys 


Ser 


Ala 


Asp 


Tyr 
65 


Val 


Asn 


Leu 


Ala 


Leu 
70 


Lys 




aat 


gga 


get 


gtc 


tct 


ctg 


gtc 


att 


aat 


ttg 


gga 


tea 


ggg 


gcc 


ttt 


gaa 


292 


Asn Gly 


Ala 


Val 


Ser 


Leu 


Val 


He 


Asn 


Leu 


Gly 


Ser 


Gly 


Ala 


Phe 


Glu 










75 










80 










85 








gca 


eta 


gtg 


gag 


cct 


gtg 


aat 


gga 


aag 


ttt 


aat 


gat 


aat 


gcc 


tgg 


cat 


340 


Ala 


Leu 


Val 
90 


Glu 


Pro 


Val 


Asn 


Gly 
95 


Lys 


Phe 


Asn 


Asp 


Asn 
100 


Ala 


Trp 


His 




gat 


gtg 


aaa 


gtc 


acc 


arg 


aat 


ctg 


cgt 


cag 


gtg 


aca 


ata 


tea 


gtg 


gat 


388 


Asp Val 


Lys 


Val 


Thr 


Xaa 


Asn 


Leu 


Arg 


Gin 


Val 


Thr 


He 


Ser 


Val 


Asp 






105 










110 










115 












ggg 


att 


ctt 


acc 


aca 


acg 


ggc 


tac 


acg 


caa 


gaa 


gat 


tat 


ace 


atg 


ctg 


436 


Gly 


lie 


Leu 


Thr 


Thr 


Thr 


Gly 


Tyr 


Thr 


Gin 


Glu 


Asp 


Tyr 


Thr 


Met 


Leu 




120 










125 










130 










135 




ggg 


tct 


gat 


gac 


ttt 


ttc 


tat 


gtt 


















460 


Gly Ser 


Asp Asp 


Phe 


Phe 


Tyr 


Val 





















140 



<210> 623 
<211> 481 
<212> DNA 
<213> Homo sapiens 

<220> 
<221> CDS 
<222> 145. ,480 

<400> 623 

tttttcacct tcttctcctg 
ctcagctgaa tacaaggcct 
ctctcttggc geaaggaagg 



339 

gag egg etc etc tac aca ggc 
Glu Arg Leu Leu Tyr Thr Gly 

50 55 
gtc tgg cca ggc ttc tat etc 
Val Trp Pro Gly Phe Tyr Leu 
65 70 



PCT/IBOl/00914 

aag aca gcc tgt 616 
Lys Thr Ala Cys 

gga gac 658 
Gly Asp 



gacagecggg gccaccattc cactggcttc attcttgctg 60 
cagtggtttg ctgggctttg gtgagatctt aaaggcaaac 120 
agcc atg cag ttg tat ctg tac teg atg tec 171 

Met Gin Leu Tyr Leu Tyr Ser Met Ser 

1 5 



wo 02/083898 
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340 



aca 


gcc 


ate 


atg 


ggg 


eta 


gat 


gaa cag 


get 


aga 


acc 


etg 


gaa 


agg 


gaa 


219 


Thr 


Ala 


He 


Met 


Gly 


Leu 


Asp 


Glu Gin 


Ala 


Arg 


Thr 


Leu 


Glu 


Arg 


Glu 




10 










15 








20 










25 




gag 


aag 


aaa 


gaa 


aae 


agt 


gga 


gca ate 


aga 


tea 


gac 


aga 


tgt 


gta 


aag 


267 


Glu 


Lys 


Lys 


Glu 


Asn 


Ser 


Gly 


Ala He 


Arg Ser 


Asp 


Arg 


Cys 


Val 


Lys 












30 








35 










40 






gca 


tea 


tte 


eag 


eag 


gga 


etg 


gca aae 


aat 


att 


tec 


ttt 


tat 


eta 


tge 


315 


Ala 


Ser 


Phe 


Gin 
45 


Gin 


Gly 


Leu 


Ala Asn 
50 


Asn 


He 


Ser 


Phe 


Tyr 
55 


Leu 


Cys 




cca 


cct 


gge 


tac 


agt 


eca 


egt 


atg etc 


aga 


tte 


tct 


tct 


gcc 


tct 


gee 


363 


Pro 


Pro 


Gly Tyr 


Ser 


Pro 


Arg 


Met Leu 


Arg 


Phe 


Ser 


Ser 


Ala 


Ser 


Ala 








60 










65 








70 










gcc 


ctg 


aga 


cag 


eaa 


gac 


eaa 


cec ctt 


etc 


etc 


etc 


tte 


etc 


etc 


etc 


411 


Ala 


Leu 
75 


Arg 


Gin 


Gin 


Asp 


Gin 
80 


Pro Leu 


Leu 


Leu 


Leu 
85 


Phe 


Leu 


Leu 


Leu 




age 


eta 


etc 


aae 


gtg 


aag 


agg 


atg aga 


atg 


aag 


acc 


ttt 


atg 


atg 


ace 


459 


Ser 


Leu 


Leu 


Asn 


Val 


Lys 


Arg 


Met Arg 


Met 


Lys 


Thr 


Phe 


Met 


Met 


Thr 




90 










95 








100 










105 




cgc 


ate 


cac 


tta 


atg 


aat 


agt 


a 
















481 


Arg 


He 


His 


Leu 


Met 


Asn 


Ser 





















110 



<210> 624 
<211> 456 
<212> DNA 
<213> Homo sapiens 

<220> 
<221> CDS 
<222> 295. .456 



<400> 624 

aggcggtgct 

gtgttteeea 

tcgtcatttc 

aceagcacag 

cagetgaaac 



gectgtgteg 
gaeegtaacg 
ctatatatgt 
ggccttgact 
ttcacttgag 



tcgtccctgt 
ttecaaaggt 
gttcaatggt 
cetaacaaca 
accttcteat 



ttgttcttct 
agaaacaeag 
ttggaatagt 
atcacacgaa 
tggecateat 



tcaaacctca 
atgacagatt 
tcaeeagtga 
cttggaagag 
egaeatctgg 



ggatttctcc 
tttggtcttt 
agteatctca 
gatccttccc 
attc atg 

Met 

1 



60 
120 
180 
240 
297 



ata 


cag 


aga 


aae 


ttg 


agt 


aat 


gat 


gac 


aat 


age 


agt 


cca 


gcc 


tge 


atg 


345 


He 


Gin 


Arg 


Asn 


Leu 


Ser 


Asn 


Asp Asp Asn 


Ser 


Ser 


Pro 


Ala 


Cys 


Met 










5 










10 










15 








gat 


get 


gac 


atg 


gat 


get 


gte 


tec 


cat 


tct 


tac 


cac 


tgg 


atg 


tea 


aga 


393 


Asp 


Ala 


Asp 


Met 


Asp Ala 


Val 


Ser 


His 


Ser 


Tyr 


His 


Trp 


Met 


Ser 


Arg 








20 










25 










30 










aat 


get 


aga 


tge 


gat 


cea 


gcc 


agt 


etg 


tac 


teg 


act 


etc 


agt 


tec 


ate 


441 


Asn 


Ala 
35 


Arg 


Cys 


Asp 


Pro 


Ala 
40 


Ser 


Leu 


Tyr 


Ser 


Thr 
45 


Leu 


Ser 


Ser 


He 




tte 


tea 


act 


cac 


agt 
























456 


Phe 
50 


Ser 


Thr 


His 


Ser 



























<210> 625 
<211> 551 
<212> DNA 
<213> Homo sapiens 

<220> 
<221> CDS 
<222> 137.. 550 



<400> 625 

aaaaaaaaec agetgagtet ttgtgecagg aagaetgcgt geagaaggtg aetgtetcag 



60 
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341 

tggagctggg tcatctcagg ccttggctcc ttgaactttt ggccgccatg tgcttcccga 120 
agtcctctct gatgac atg aag aag ctg aag gcc cga atg cac cag gcc ata 172 
Met Lys Lys Leu Lys Ala Arg Met His Gin Ala lie 
15 10 



gaa 


aga 


ttt 


tat 


gat 


aaa 


atg 


caa 


aat 


gca 


gaa 


tea 


gga 


cgt 


gga cag 


220 


Glu 


Arg 


Phe 


Tyr 


Asp Lys 


Met 


Gin 


Asn 


Ala 


Glu 


Ser 


Gly 


Arg 


Gly Gin 








15 










20 










25 








gtg 


atg 


teg 


age 


ctg 


gca 


gag 


ctg 


gag 


gac 


gac 


ttc 


aaa 


gag 


ggc tac 


268 


Val 


Met 


Ser 


Ser 


Leu 


Ala 


Glu 


Leu 


Glu 


Asp 


Asp 


Phe 


Lys 


Glu 


Gly Tyr 






30 










35 










40 










ctg 


gag 


aca 


gtg 


gcg 


get 


tat 


tat 


gag 


gag 


cag 


cac 


cca 


gag 


etc act 


316 


Leu 


Glu 


Thr 


Val 


Ala 


Ala 


Tyr 


Tyr 


Glu 


Glu 


Gin 


His 


Pro 


Glu 


Leu Thr 




45 










50 










55 








60 




cct 


eta 


ctt 


gaa 


aaa 


gaa 


aga 


gat 


gga 


tta 


egg 


tgc 


cga 


ggc 


aac aga 


364 


Pro 


Leu 


Leu 


Glu 


Lys 


Glu 


Arg 


Asp 


Gly 


Leu 


Arg 


Cys 


Arg 


Gly 


Asn Arg 












65 










70 










75 




tec 


cct 


gte 


ecg 


gat 


gtt 


gag 


gat 


ccc 


gca 


acc 


gag 


gag 


cct 


ggg gag 


412 


Ser 


Pro 


Val 


Pro 


Asp Val 


Glu 


Asp 


Pro 


Ala 


Thr 


Glu 


Glu 


Pro 


Gly Glu 










80 










85 










90 






age 


ttt 


tgt 


gac 


aag 


gtc 


atg 


aga 


tgg 


ttc 


cag 


gcc 


atg 


ctg 


cag egg 


460 


Ser 


Phe 


Cys Asp 


Lys 


Val 


Met 


Arg 


Trp 


Phe 


Gin 


Ala 


Met 


Leu 


Gin Arg 








95 










100 










105 








ctg 


cag 


acc 


tgg 


tgg 


cac 


ggg 


gtt 


ctg 


gcc 


tgg 


gtg 


aag 


gag 


aag gtg 


508 


Leu 


Gin 


Thr 


Trp 


Trp 


His 


Gly 


Val 


Leu 


Ala 


Trp 


Val 


Lys 


Glu 


Lys Val 






110 










115 










120 










gtg 


gcc 


ctg 


gtc 


cat 


gca 


gtg 


cag 


gcc 


etc 


tgg 


aaa 


cag 


ttc 


c 


551 


Val 


Ala 


Leu 


Val 


His 


Ala 


Val 


Gin 


Ala 


Leu 


Trp 


Lys 


Gin 


Phe 







125 130 135 



<210> 626 
<211> 494 
<212> DNA 
<213> Homo sapiens 

<220> 
<221> CDS 
<222> 283. .492 



<400> 626 

gactttgcga aggcatggcg gggacactgt gaatgtcagc ccagaaggtg atcagagect 60 
gttaattaaa atggaaagaa gacagaaggg aaggtagaca tcaggttctc cctggagact 120 
tttcgttttc atttacgctg cggaaactga cgtttttgce taacacccca tgtaatgtaa 180 
acgtataggc ttgagtacgt gtccggccgc atgtgtagtg aaccctaaag ctttcctaat 240 
tgtagttagc ategtcccta agcggaacga ttttecgtga ac atg att tgt act 294 

Met lie Cys Thr 

1 

ttt eta cga gcc gta cag tat aeg gag aag ctg cac agg tec teg gca 342 

Phe Leu Arg Ala Val Gin Tyr Thr Glu Lys Leu His Arg Ser Ser Ala 

5 10 15 20 

aag cga ttg ctt ttg cca tac ate gtg ctt aac aaa gcg tgc ttg aag 390 

Lys Arg Leu Leu Leu Pro Tyr lie Val Leu Asn Lys Ala Cys Leu Lys 

25 30 35 

act gag ccc agt ttg aga tgt ggg ctt caa tat caa aag aaa aeg ctg 438 
Thr Glu Pro Ser Leu Arg Cys Gly Leu Gin Tyr Gin Lys Lys Thr Leu 

40 45 50 

cga cct aga tgt att ctt gga gtc acc cag aaa acc ate tgg aeg cag 486 
Arg Pro Arg Cys He Leu Gly Val Thr Gin Lys Thr He Trp Thr Gin 

55 60 65 

gga ccg ag 494 
Gly Pro 
70 



<210> 627 



wo 02/083898 



PCT/IBOl/00914 



342 

<211> 585 
<212> DNA 
<213> Homo sapiens 

<220> 
<221> CDS 
<222> 128.. 583 



<400> 627 

attttccggg aagatggcgg cgcacaagtc aggtccggca catgtttccg cggagcccca 60 

gcaatgacgg atgatatcac ctcttcttct ctggtgagag tctgaggata gagacttttt 120 

tctcacc atg aat gtc acc cca gag gtc aag agt cgt ggg atg aag ttt 169 

Met Asn Val Thr Pro Glu Val Lys Ser Arg Gly Met Lys Phe 

15 10 

get gag gag cag ctg eta aag cat gga tgg act caa ggc aaa ggc etc 217 

Ala Glu Glu Gin Leu Leu Lys His Gly Trp Thr Gin Gly Lys Gly Leu 
15 20 25 30 

ggc egg aag gag aat ggt ate act cag get etc agg gtg aca ctg aag 265 

Gly Arg Lys Glu Asn Gly He Thr Gin Ala Leu Arg Val Thr Leu Lys 

35 40 45 

caa gac act cat ggg gta gga cat gac cet gee aag gag ttc aca aac 313 

Gin Asp Thr His Gly Val Gly His Asp Pro Ala Lys Glu Phe Thr Asn 

50 55 60 

cae tgg tgg aat gag etc tte aac aag act geg gee aae ttg gta gtg 361 

His Trp Trp Asn Glu Leu Phe Asn Lys Thr Ala Ala Asn Leu Val Val 

65 70 75 

gaa act ggg cag gat gga gta cag ata agg age ctt tet aag gag acc 409 

Glu Thr Gly Gin Asp Gly Val Gin He Arg Ser Leu Ser Lys Glu Thr 

80 85 90 

aee cgt tat aat cat eec aag cec aae ttg ctg tat cag aag ttt gtg 457 

Thr Arg Tyr Asn His Pro Lys Pro Asn Leu Leu Tyr Gin Lys Phe Val 
95 100 105 110 

aag atg get aca ttg act tea ggt gga gag aag cca aac aaa gac ttg 505 

Lys Met Ala Thr Leu Thr Ser Gly Gly Glu Lys Pro Asn Lys Asp Leu 

115 120 125 

gag age tge agt gat gac gae aae eag ggg tea agt cec caa aga tte 553 

Glu Ser Cys Ser Asp Asp Asp Asn Gin Gly Ser Ser Pro Gin Arg Phe 

130 135 140 

tta ctg atg aga tge tgc tec aag cet gtg ar 585 
Leu Leu Met Arg Cys Cys Ser Lys Pro Val 

145 150 



<210> 628 
<211> 560 
<212> DNA 
<213> Homo sapiens 

<220> 
<221> CDS 
<222> 201. .560 



<400> 628 

ateytawteg aaaagctctt tccttcttcc tcttgttcct cetcctgcct etcttcgett 60 
cgcctgcaaa cgcggtgggg gctgctcggc ggtcaggagc agcaagagac agagcgacat 120 
gagagattgg accgeggget geactggaga atttactggt aggataattc atccctaaag 180 
agattgaagt gagettcaga atg gca aaa gag gag eec eag agt ate tea agg 233 

Met Ala Lys Glu Glu Pro Gin Ser He Ser Arg 
15 10 
gae ttg eag gaa ctg eag aag aag ctg tet ctg ctg ata gae tec tte 281 
Asp Leu Gin Glu Leu Gin Lys Lys Leu Ser Leu Leu He Asp Ser Phe 

15 20 25 

cag aat aac tea aag gtg gtg gcc ttt atg aag tet cca gtg ggt cag 329 
Gin Asn Asn Ser Lys Val Val Ala Phe Met Lys Ser Pro Val Gly Gin 



wo 02/083898 

30 

tac ttg gac age cat ccg 
Tyr Leu Asp Ser His Pro 
45 

gtc atg teg gee gtt cot 
Val Met Ser Ala Val Pro 
60 65 
acc acc ctg get get ctg 
Thr Thr Leu Ala Ala Leu 
80 

ate tct gtg ggt ggc ttc 
He Ser Val Gly Gly Phe 
95 

ttc gta tea etc gee atg 
Phe Val Ser Leu Ala Met 
110 

<210> 629 
<211> 728 

<212> DNA 
<213> Homo sapiens 

<220> 
<221> CDS 
<222> 420. .728 

<400> 629 

tttttggagg gctcttagca acggccctgg ttgagccccc tcagccatga gaaaatcaaa 60 

tcaatgtgcc attcttccag gcgcgaggca gcagcggctg cagttcaaca tgaaaggagg 120 

cttcctccct gcctgctaat tacctgctet teecgatcte ategtttctg cctttgeaaa 180 

gtgctactga gaagsgggaa gaaacgtccg ccacccatec cccttgctgc ctgggggttc 240 

agacttgatt agatggctaa eaggggcccg agctatggct taagccgaga ggtgcaggag 300 

aagatcgagc agaagtatga tgcggacctg gagaacaage tggtggactg gatcatcctg 360 

eagtgcgeeg aggacataga gcacccgccc cceggcagge ccatttteag aaatggtta 419 



atg gac 


ggg 


acg 


gtc 


ctg 


tgc 


aag 


ctg 


ata 


aat 


agt 


tta tac eea 


cca 


467 


Met Asp 


Gly 


Thr 


Val 


Leu 


Cys 


Lys 


Leu 


He 


Asn 


Ser 


Leu Tyr Pro 


Pro 




1 






5 










10 






15 






gga caa 


gag 


ccc 


ata 


ccc 


aag 


ate 


tea 


gag 


tea 


aag 


atg get ttt 


aag 


515 


Gly Gin 


Glu 


Pro 
20 


He 


Pro 


Lys 


He 


Ser 
25 


Glu 


Ser 


Lys 


Met Ala Phe 
30 


Lys 




cag atg 


gag 


eaa 


ate 


tec 


cag 


ttc 


eta 


aaa 


get 


geg 


gag acc tat 


ggt 


563 


Gin Met 


Glu 

35 


Gin 


He 


Ser 


Gin 


Phe 
40 


Leu 


Lys 


Ala 


Ala 


Glu Thr Tyr 
45 


Gly 




gtc aga 


acc 


acc 


gac 


ate 


ttt 


cag 


acg 


gtg 


gat 


eta 


tgg gaa ggg 


aag 


611 


Val Arg 


Thr 


Thr 


Asp 


He 


Phe 


Gin 


Thr 


Val 


Asp 


Leu 


Trp Glu Gly Lys 




50 










55 










60 








gac atg 


gea 


get 


gtg 


cag 


agg 


ace 


ctg 


atg 


get 


tta 


ggc age gtt 


gca 


659 


Asp Met 


Ala 


Ala 


Val 


Gin 


Arg 


Thr 


Leu 


Met 


Ala 


Leu Gly Ser Val Ala 




65 








70 








75 






80 




gtc acc 


aag 


gat 


gat 


ggc 


tgc 


tat 


egg 


gga 


gag 


cca 


tec tgg ttt 


eac 


707 


Val Thr 


Lys 


Asp 


Asp 
85 


Gly 


Cys 


Tyr 


Arg 


Gly 
90 


Glu 


Pro 


Ser Trp Phe 
95 


His 




agg aaa 


gee 


cag 


cag 


aat 


egg 
















728 


Arg Lys 


Ala 


Gin 
100 


Gin 


Asn 


Arg 



















<210> 630 
<211> 496 
<212> DNA 

<213> Homo sapiens 



35 




ttt ctg 


gee 


Phe Leu 


Ala 


50 




gtt gga 


ttc 


Val Gly 


Phe 


ctg ggg 


gtc 


Leu Gly 


Val 


tea ctg 


etc 


Ser Leu 


Leu 




100 


teg ggg 


atg 


Ser Gly 


Met 


115 





343 

ttc ace ttg 
Phe Thr Leu 
55 

ttc ctg etc 
Phe Leu Leu 
70 

ata ata ttg 
He He Leu 
85 

tgc ate etc 
Cys He Leu 

atg ata gca 
Met He Ala 



40 

ctg gtg ttc 
Leu Val Phe 

ate gtg gtg 
He Val Val 

gaa gga ttg 
Glu Gly Leu 
90 

tgt ggk ttg 
Cys Gly Leu 
105 

tct 
Ser 
120 



PCT/IBOl/00914 



att 377 
He 

ctt 425 

Leu 

75 

gtc 473 
Val 

age 521 
Ser 

560 



<220> 
<221> CDS 



wo 02/083898 
<222> 261.. 494 
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<400> 630 

ttcgcggtta gggattccat agttcacggg ttacggcaca cacgcgttac gtatatatcg 60 

aggttaatct gcgcaastct tcacttttgt ctgaatgtat tttagcggtg aattactcct 120 

tctggctaag agcttcacat cttcagtgtt aatggtggtg gttttcgcat gtcttgcaca 180 

catttctaga tcttgcactg tcgaaaagtc agctccgaaa ttgctgcaat gataagtttg 240 

ctgaactgca tctctgccca atg cca ctt cct cgc aaa gac aac cca cag tgt 293 

Met Pro Leu Pro Arg Lys Asp Asn Pro Gin Cys 











1 








5 








10 




agt cca 


ctg 


ctg 


cct 


cct 


ttg 


gcc 


tct 


gtt 


ggt 


aag 


aaa 


atg get aag 


341 


Ser Pro 


Leu 


Leu 


Pro 


Pro 


Leu 


Ala 


Ser 


Val 


Gly Lys 


Lys 


Met Ala Lys 








15 










20 










25 




aaa ate 


get 


get 


get 


cct 


eca 


cct 


cca 


cct 


cct 


cct 


cca 


tea gaa egg 


389 


Lys lie 


Ala 
30 


Ala 


Ala 


Pro 


Pro 


Pro 
35 


Pro 


Pro 


Pro 


Pro 


Pro 
40 


Ser Glu Arg 




cga ggc 


aaa 


cge 


gca 


gaa 


cag 


aaa 


ggg 


get 


gtg 


ggg 


cag 


egg eca gat 


437 


Arg Gly 


Lys 


Arg 


Ala 


Glu 


Gin 


Lys 


Gly Ala 


Val 


Gly 


Gin 


Arg Pro Asp 




45 










50 










55 








tgg att 


ttc 


tct 


ccg 


cag 


cce 


caa 


rgg 


atg 


ttt 


eta 


atg 


acc caa gga 


485 


Trp lie 


Phe 


Ser 


Pro 


Gin 


Pro 


Gin 


Xaa 


Met 


Phe 


Leu 


Met 


Thr Gin Gly 




60 








65 










70 






75 




aaw tta 


gtg 


gt 






















496 


Xaa Leu 


Val 



























<210> 631 
<211> 500 
<212> DNA 
<213> Homo sapiens 

<220> 
<221> CDS 
<222> 91,. 498 

<400> 631 

wgaaeatcct aategaaaaa agattctaag aeaagactte taatacggca tetcagttta 60 
ttgttctttg gggtttgttt tcagatcetc atg tea gaa cca ggc aaa etc age 114 

Met Ser Glu Pro Gly Lys Leu Ser 

1 5 



cag 


aaa 


ate 


aaa 


gtt 


tgg 


ctt 


cag 


gag 


tac 


tgg 


aac 


ate 


aca 


gat 


etc 


162 


Gin 


Lys 


He 


Lys 


Val 


Trp 


Leu 


Gin 


Glu 


Tyr 


Trp 


Asn 


He 


Thr 


Asp 


Leu 






10 










15 










20 












gtg 


gee 


att 


tee 


aea 


tte 


atg 


att 


gga 


gea 


att 


ett 


cge 


eta 


cag 


aac 


210 


Val 


Ala 


He 


Ser 


Thr 


Phe 


Met 


He 


Gly 


Ala 


He 


Leu 


Arg 


Leu 


Gin 


Asn 




25 










30 










35 










40 




cag 


cce 


tac 


atg 


ggc 


tat 


ggc 


egg 


gtg 


ate 


tac 


tgt 


gtg 


gat 


ate 


ate 


258 


Gin 


Pro 


Tyr 


Met 


Gly 


Tyr 


Gly 


Arg 


Val 


He 


Tyr 


Cys 


Val 


Asp 


He 


He 












45 










50 










55 






tte 


tgg 


tac 


ate 


cgt 


gtc 


ctg 


gac 


ate 


ttt 


ggt 


gtc 


aac 


aag 


tat 


ctg 


306 


Phe 


Trp 


Tyr 


He 


Arg 


Val 


Leu 


Asp 


He 


Phe 


Gly 


Val 


Asn 


Lys 


Tyr 


Leu 










60 










65 










70 








ggg 


cea 


tac 


gtg 


atg 


atg 


att 


gga 


aag 


atg 


atg 


ate 


gac 


atg 


ctg 


tac 


354 


Gly Pro 


Tyr 


Val 


Met 


Met 


He 


Gly 


Lys 


Met 


Met 


He 


Asp 


Met 


Leu 


Tyr 








75 










80 










85 










ttt 


gtg 


gtc 


ate 


atg 


ctg 


gtc 


gtg 


etc 


atg 


agt 


ttc 


gga 


gta 


gcc 


cgt 


402 


Phe 


Val 


Val 


He 


Met 


Leu 


Val 


Val 


Leu 


Met 


Ser 


Phe 


Gly 


Val 


Ala 


Arg 






90 










95 










100 












caa 


gee 


att 


ctg 


cat 


cea 


gag 


gag 


aag 


cce 


tct 


tgg 


aaa 


ctg 


gcc 


cga 


450 


Gin 


Ala 


He 


Leu 


His 


Pro 


Glu 


Glu 


Lys 


Pro 


Ser 


Trp 


Lys 


Leu 


Ala 


Arg 




105 










110 










115 










120 




aac 


ate 


ttc 


tac 


atg 


cce 


tac 


tgg 


atg 


ate 


tat 


gga 


gag 


gtg 


ttg 


cag 


498 


Asn 


lie 


Phe 


Tyr 


Met 


Pro 


Tyr 


Trp 


Met 


He 


Tyr 


Gly 


Glu 


Val 


Leu 


Gin 





125 130 135 
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<210> 632 
<211> 524 
<212> DNA 

<213> Homo sapiens 

<220> 
<221> CDS 
<222> 170.. 523 

<400> 632 

tttttccggt cttactcacg ttgcggcctt cctcgcgtca cagccgggat gaagccgatc 60 
ctactgcagg gccatgagcg gtccattacg cagattaagt ataaccgcga aggagacctc 120 
ctctttactg tggccaagga ccctatcgtc aatgtatggt actctgtga atg gtg aga 178 

Met Val Arg 
1 

ggc tgg gca cct aca tgg gcc ata ccg gag ctg tgt ggt gtg tgg acg 226 
Gly Trp Ala Pro Thr Trp Ala He Pro Glu Leu Cys Gly Val Trp Thr 

5 10 15 

ctg act ggg aca cca age atg tec tea ctg get eag ctg aca aca get 274 
Leu Thr Gly Thr Pro Ser Met Ser Ser Leu Ala Gin Leu Thr Thr Ala 
20 25 30 35 

gtc gte tet ggg act gtg aaa eag tct gga gag gtg ttg gtg aat gtt 322 
Val Val Ser Gly Thr Val Lys Gin Ser Gly Glu Val Leu Val Asn Val 

40 45 50 

aag gag cac tec egg cag ate aac gac ate eag tta tee agg gae atg 370 
Lys Glu His Ser Arg Gin He Asn Asp He Gin Leu Ser Arg Asp Met 

55 60 65 

ace atg ttt gtg ace gcg tec aag gac aac aca gee aag ctt ttt gae 418 
Thr Met Phe Val Thr Ala Ser Lys Asp Asn Thr Ala Lys Leu Phe Asp 

70 75 80 

tec aca act ctt gaa cat eag aag act tte egg aca gaa cgt cct gte 4 66 

Ser Thr Thr Leu Glu His Gin Lys Thr Phe Arg Thr Glu Arg Pro Val 

85 90 95 

aac tea get gcc etc tec ecc aac tat gac cat gtg gtc ctg ggc ggt 514 
Asn Ser Ala Ala Leu Ser Pro Asn Tyr Asp His Val Val Leu Gly Gly 
100 105 110 115 

ggt cag gaa g 524 
Gly Gin Glu 

<210> 633 

<211> 508 
<212> DNA 

<213> Homo sapiens 

<220> 
<221> CDS 
<222> 141. .506 

<400> 633 

attggcttct ctcccagttt ccaceacact ceccatctgg gtaacctaag caggttactc 60 

gcecctgtga ggaggttctc ttctetgtac aatggaaatg gttatccaea cetatctctc 120 
agggaggctc tgatatcatc atg aga ccc age tgt get cct gga tgg ttt tac 173 

Met Arg Pro Ser Cys Ala Pro Gly Trp Phe Tyr 
15 10 

cac aag tec aat tgc tat ggt tac ttc agg aag ctg agg aac tgg tct 221 

His Lys Ser Asn Cys Tyr Gly Tyr Phe Arg Lys Leu Arg Asn Trp Ser 

15 20 25 

gat gee gag etc gag tgt eag tct tac gga aac gga gcc cac ctg gca 269 

Asp Ala Glu Leu Glu Cys Gin Ser Tyr Gly Asn Gly Ala His Leu Ala 

30 35 40 

tct ate ctg agt tta aag gaa gcc age ace ata gca gag tac ata agt 317 



wo 02/083898 PCT/IBO 1/009 14 

346 



Ser 


lie 


Leu 


Ser 


Leu 


Lys 


Glu 


Ala 


Ser 


Thr 


He 


Ala 


Glu 


Tyr 


He 


Ser 






45 








50 










55 












ggc 


tat 


cag 


aga 


age 


cag 


ccg 


ata 


tgg 


att 


ggc 


ctg 


cac 


gac 


cca 


cag 


365 


Gly Tyr 


Gin 


Arg 


Ser 


Gin 


Pro 


lie 


Trp 


He 


Gly Leu 


His 


Asp 


Pro 


Gin 




60 










65 










70 










75 




aag 


agg 


cag 


cag 


tgg 


cag 


tgg 


att 


gat 


ggg 


gee 


atg 


tat 


ctg 


tac 


aga 


413 


Lys 


Arg 


Gin 


Gin 


Trp Gin 


Trp 


He 


Asp 


Gly 


Ala 


Met 


Tyr 


Leu 


Tyr 


Arg 












80 










85 










90 






tec 


tgg 


tct 


ggc 


aag 


tec 


atg 


ggt 


ggg 


aac 


aag 


cac 


tgt 


get 


gag 


atg 


461 


Ser 


Trp 


Ser 


Gly 


Lys 


Ser 


Met Gly 


Gly 


Asn 


Lys 


His 


Cys 


Ala 


Glu 


Met 










95 










100 










105 








age 


tec 


aat 


aac 


aac 


ttt 


tta 


act 


tgg 


age 


age 


aac 


gaa 


tgc 


aac 


aa 


508 


Ser 


Ser 


Asn 
110 


Asn 


Asn 


Phe 


Leu 


Thr 
115 


Trp 


Ser 


Ser 


Asn 


Glu 
120 


Cys 


Asn 







<210> 634 
<211> 464 
<212> DNA 
<213> Homo sapiens 

<220> 
<221> CDS 
<222> 92. .412 

<220> 

<221> sig_peptide 
<222> 92. .160 
<223> Von Heijne matrix 
score 12 

seq RLLLLSLVSGALG/LG 

<220> 

<221> misc_feature 
<222> 325 

<223> n=a, g, c or t 
<400> 634 

tgctgccttc tctgatggct tctccagcgg cctcagtacc accttagcgg tcttctgcca 60 
tgagctgccc cactaactgg gtgactttgc c atg ctg etc cag tea ggg ctg 112 

Met Leu Leu Gin Ser Gly Leu 
-20 



tec 


ttt 


egg 


egg 


ctg 


ctg 


ctg 


ctg 


age 


etc 


gtg 


tct 


gga 


gee 


ctg 


gga 


160 


Ser 


Phe 
-15 


Arg 


Arg 


Leu 


Leu 


Leu 
-10 


Leu 


Ser 


Leu 


Val 


Ser 
-5 


Gly 


Ala 


Leu 


Gly 




ttg 


ggg 


ggt 


gca 


gtc 


ctg 


ggg 


gtg 


ggg 


etc 


age 


ctg 


ggc 


ect 


gtc 


cec 


208 


Leu Gly 


Gly 


Ala 


Val 


Leu 


Gly 


Val 


Gly 


Leu 


Ser 


Leu 


Gly 


Pro 


Val 


Pro 




1 








5 










10 










15 






etc 


act 


cec 


tgg 


gtg 


ttt 


ggg 


gtc 


act 


get 


ggg 


gtc 


ttc 


etc 


tat 


gtg 


256 


Leu 


Thr 


Pro 


Trp 
20 


Val 


Phe 


Gly 


Val 


Thr 

25 


Ala 


Gly 


Val 


Phe 


Leu 
30 


Tyr 


Val 




gee 


ett 


gtg 


gac 


atg 


eta 


cca 


gee 


ctg 


ett 


cgt 


ect 


ccg 


gag 


cec 


ctg 


304 


Ala 


Leu 


Val 
35 


Asp 


Met 


Leu 


Pro 


Ala 
40 


Leu 


Leu 


Arg 


Pro 


Pro 
45 


Glu 


Pro 


Leu 




ect 


acg 


cec 


cat 


gtg 


etc 


ctn 


cag 


ggg 


ctg 


ggg 


ctg 


ctg 


ctg 


ggg 


ggc 


352 


Pro 


Thr 
50 


Pro 


His 


Val 


Leu 


Leu 
55 


Gin 


Gly 


Leu 


Gly 


Leu 
60 


Leu 


Leu 


Gly 


Gly 




ggc 


etc 


atg 


ett 


gee 


ata 


acc 


ctg 


ctg 


gag 


gag 


egg 


eta 


ctg 


cec 


gtg 


400 


Gly 


Leu 


Met 


Leu 


Ala 


He 


Thr 


Leu 


Leu 


Glu 


Glu 


Arg 


Leu 


Leu 


Pro 


Val 




65 










70 










75 










80 





ace act gag ggc tgatggggcc aattggaaag gggtcgggtt gcccttcett 452 
Thr Thr Glu Gly 

ccccccaacc ac 4 64 
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<210> 635 
<211> 506 
<212> DNA 

<213> Homo sapiens 

<220> 
<221> CDS 
<222> 193. .468 



<220> 

<221> sig_peptide 

<222> 193.. 264 

<223> Von Heijne matrix 

score 11.6999998092651 
seq FVLGLGLTPPTLA/QD 



<400> 635 

acacacactc acacaaggac gcsaacccca cctagatgca aagcaggatt caaaagaaca 60 
tctttgcgtt ttctaccggc tccccatcat cgtactaggg aggaagaagc gggtgagaaa 120 
caaaacttct ttccattgtc ctgcccgttt ctgcggactt gttctgaggc cgaggagcct 180 
gtgttggaag ag atg gtg atg ggc ctg ggc gtt ttg ttg ttg gtc ttc gtg 231 
Met Val Met Gly Leu Gly Val Leu Leu Leu Val Phe Val 
-20 -15 



ctg 


ggt 


ctg 


ggt 


ctg 


acc 


cca 


ccg 


acc 


ctg 


get 


cag 


gat 


aac 


tec 


agg 


279 


Leu Gly 


Leu 


Gly 


Leu 


Thr 


Pro 


Pro 


Thr 


Leu 


Ala 


Gin Asp Asn 


Ser Arg 






-10 










-5 










1 








5 




tac 


aca 


cac 


ttc 


ctg 


acc 


cag 


cac 


tat 


gat 


gee 


aaa 


cca 


cag 


ggc 


egg 


327 


Tyr 


Thr 


His 


Phe 


Leu 


Thr 


Gin 


His 


Tyr 


Asp 


Ala 


Lys 


Pro 


Gin 


Gly Arg 












10 










15 










20 






gat 


gac 


aga 


tac 


tgt 


gaa 


age 


ate 


atg 


agg 


aga 


egg 


ggc 


ctg 


ace 


tea 


375 


Asp Asp 


Arg 


Tyr 


Cys 


Glu 


Ser 


He 


Met 


Arg 


Arg 


Arg 


Gly Leu 


Thr 


Ser 










25 










30 










35 








ccc 


tgc 


aaa 


gac 


ate 


aac 


aca 


ttt 


att 


cat 


ggc 


aac 


aag 


cgc 


asa 


tea 


423 


Pro 


Cys 


Lys 


Asp 


He 


Asn 


Thr 


Phe 


He 


His 


Gly 


Asn 


Lys 


Arg 


Xaa 


Ser 








40 










45 










50 










agg 


cca 


tct 


gtg 


aaa 


aca 


aga 


atg 


gaa 


acc 


etc 


aca 


gag 


aaa 


ace 




468 


Arg 


Pro 


Ser 


Val 


Lys 


Thr 


Arg 


Met 


Glu 


Thr 


Leu 


Thr 


Glu 


Lys 


Thr 








55 










60 










65 













taagaataag caagtcttct ttecaggtea ecaettgc 506 



<210> 636 
<211> 566 
<212> DNA 
<213> Homo sapiens 

<220> 
<221> CDS 
<222> 268.. 507 



<220> 

<221> sig_peptide 

<222> 268. .363 

<223> Von Heijne matrix 

score 8.89999961853027 
seq LGLFFGFLLVIFA/IE 



<400> 636 

actttagctc gectaggatt tgagetgggt gtatgtctgc teccagctca agtecctccg 60 

agtgccagag aggaaggcag ggagaagcgg asaeccctct ttgggccaag gccaaggagg 120 

actgtgettg ccgggattgc tgtccttgee attggactat ggctccgatt cgactctcag 180 

aceaagagea tettcgagca agaaactaat aataataatt ccagcttcta caeaggagtc 240 

tatattctga tcggagccgg cgccctc atg atg ctg gtg ggc ttc ctg ggc tgc 294 

Met Met Leu Val Gly Phe Leu Gly Cys 
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-30 -25 



tgc 


ggg 


get 


gtg 


eag 


gag 


tec 


eag 


tgc 


atg ctg 


gga 


ctg 


ttc 


ttc 


ggc 


342 


Cys 


Gly 


Ala 


Val 


Gin 


Glu 


Ser 


Gin 


Cys 


Met Leu 


Gly 


Leu 


Phe 


Phe 


Gly 










-20 










-15 








-10 








ttc 


etc 


ttg 


gtg 


ata 


ttc 


gee 


att 


gaa 


ata get 


gcg 


gee 


ate 


tgg 


gga 


390 


Phe 


Leu 


Leu 
-5 


Val 


He 


Phe 


Ala 


He 
1 


Glu 


He Ala 


Ala 
5 


Ala 


He 


Trp 


Gly 




tat 


tec 


cac 


aag 


gat 


gag 


gtg 


att 


aag 


gaa gtc 


eag 


gag 


ttt 


tac 


aag 


438 


Tyr 


Ser 


His 


Lys 


Asp 


Glu 


Val 


He 


Lys 


Glu Val 


Gin 


Glu 


Phe 


Tyr 


Lys 




10 










15 








20 










25 




gac 


ace 


tac 


aac 


aag 


ctg 


aaa 


ace 


aag 


gat gag 


cec 


eag 


egg 


gaa 


aeg 


486 


Asp 


Thr 


Tyr 


Asn 


Lys 


Leu 


Lys 


Thr 


Lys 


Asp Glu 


Pro 


Gin 


Arg 


Glu 


Thr 












30 










35 








40 






ctg 


aaa 


sea 


tec 


act 


atg 


cgt 


tgaactgctg tggtttgget gggggcgtgg 


537 


Leu 


Lys 


Xaa 


Ser 


Thr 


Met 


Arg 





















45 

aaeagtttat eteagaeatc tgcceaaga 566 



<210> 637 
<211> 568 
<212> DNA 
<213> Homo sapiens 

<220> 
<221> CDS 
<222> 30.. 527 



<220> 

<221> sig_peptide 

<222> 30. .74 

<223> Von Heijne matrix 

score 8.60000038146973 
seq PLLIICLLPAIEG/KN 



<400> 637 



actggggcac agtaggagga acccagaag 


atg 


ctg 


cct 


etc 


ctg 


ate 


ate 


tgt 


53 


















Met 


Leu 


Pro 


Leu 


Leu 


He 


He 


Cys 




















-15 










-10 








etc 


ctg 


cet 


gee 


att 


gaa 


ggg 


aag 


aac 


tgc 


etc 


ege 


tgc 


tgg 


cca 


gaa 


101 


Leu 


Leu 


Pro 
-5 


Ala 


He 


Glu 


Gly 


Lys 
1 


Asn 


Cys 


Leu 


Arg 
5 


Cys 


Trp 


Pro 


Glu 




ctg 


tct 


gee 


ttg 


ata 


gac 


tat 


gac 


ctg 


eag 


ate 


etc 


tgg 


gtg 


ace 


cca 


149 


Leu 


Ser 


Ala 


Leu 


He 


Asp 


Tyr 


Asp 


Leu 


Gin 


He 


Leu 


Trp 


Val 


Thr 


Pro 




10 










15 










20 










25 




ggg 


cca 


cec 


aca 


gaa 


ctt 


tct 


caa 


aat 


cgt 


gac 


eat 


ttg 


gaa 


gaa 


gaa 


197 


Gly 


Pro 


Pro 


Thr 


Glu 


Leu 


Ser 


Gin 


Asn 


Arg 


Asp 


His 


Leu 


Glu 


Glu 


Glu 












30 










35 










40 






aca 


gee 


aaa 


ttc 


ttc 


act 


caa 


gta 


cac 


caa 


gcc 


att 


aaa 


aeg 


tta 


cga 


245 


Thr 


Ala 


Lys 


Phe 


Phe 


Thr 


Gin 


Val 


His 


Gin 


Ala 


He 


Lys 


Thr 


Leu Arg 










45 










50 










55 








gat 


gat 


aaa 


aca 


gta 


ctt 


ctg 


gaa 


gag 


ate 


tac 


acg 


cac 


aag 


aat 


etc 


293 


Asp Asp 


Lys 


Thr 


Val 


Leu 


Leu 


Glu 


Glu 


He 


Tyr 


Thr 


His 


Lys 


Asn 


Leu 








60 










65 










70 










ttt 


act 


gag 


agg 


ctg 


aat 


aag 


ata 


tct 


gat 


ggg 


ctg 


aag 


gag 


aag 


gac 


341 


Phe 


Thr 


Glu 


Arg 


Leu 


Asn 


Lys 


He 


Ser 


Asp 


Gly 


Leu 


Lys 


Glu 


Lys 


Asp 






75 










80 










85 












ata 


cag 


tec 


aca 


ctg 


aag 


gtc 


ace 


age 


tgt 


get 


gac 


tgc 


agg 


act 


cac 


389 


He 


Gin 


Ser 


Thr 


Leu 


Lys 


Val 


Thr 


Ser 


Cys 


Ala 


Asp 


Cys 


Arg 


Thr 


His 




90 










95 










100 










105 




ttc 


etc 


tec 


tgc 


aat 


gac 


cec 


act 


ttc 


tgc 


cca 


gee 


agg 


aac 


egg 


egg 


4 37 


Phe 


Leu 


Ser 


Cys 


Asn 


Asp 


Pro 


Thr 


Phe 


Cys 


Pro 


Ala 


Arg 


Asn 


Arg Arg 












110 










115 










120 






ace 


tec 


ctg 


tgg 


get 


gtg 


agt 


etc 


age 


agt 


get 


eta 


etc 


ctg 


gee 


ata 


485 
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Thr Ser Leu Trp Ala Val Ser Leu Ser Ser Ala Leu Leu Leu Ala lie 

125 130 135 

get gga gat gtb wet ttt act gge aaa gga aga agg agg cag 527 
Ala Gly Asp Val Xaa Phe Thr Gly Lys Gly Arg Arg Arg Gin 

140 145 150 

taaagcagga acagggcagc ccgcatgtct tccagaagtg a 568 

<210> 638 
<211> 420 
<212> DNA 

<213> Homo sapiens 

<220> 
<221> CDS 
<222> 56. .217 

<220> 

<221> sig_peptide 

<222> 56. .133 

<223> Von Heijne matrix 

seore 8.39999961853027 

seq AAILLLTVPVAEG/VT 



<400> 638 

aaaactgaaa acagaacaaa cacaaacaca aacagagaat caacgaagaa gaaaa atg 58 

Met 



gtg 


aag 


gtg 


atg 


tgg 


gtt tec gtt tta get ctg 


gcg 


gcg 


gcg 


att 


etc 


106 


Val 


Lys 


Val 


Met 


Trp 


Val Ser Val Leu Ala Leu 


Ala 


Ala 


Ala 


He 


Leu 




-25 










-20 -15 










-10 




ctt 


ttg 


acg 


gte 


eeg 


gtg gea gaa ggg gtg acg 


tgc 


teg 


cct 


atg 


cag 


154 


Leu 


Leu 


Thr 


Val 


Pro 
-5 


Val Ala Glu Gly Val Thr 
1 


Cys 


Ser 


Pro 
5 


Met 


Gin 




ctg 


get 


tea 


tgt 


gcg 


gcg gcg atg acg tea tct 


teg 


ccg 


cca 


teg 


gag 


202 


Leu 


Ala 


Ser 


Cys 


Ala 


Ala Ala Met Thr Ser Ser 


Ser 


Pro 


Pro 


Ser 


Glu 








10 






15 




20 










get 


gtt 


gea 


eaa 


age 


tgagagagca gcagccatgc ctttgtgggt acatgaggaa 


257 


Ala 


Val 


Ala 


Gin 


Ser 

















25 

ccctaccetc cgecaatacg ttagcteccc taacgcaagg aaagtctcca acagttgcaa 317 

gattccttcc ccaagctgtt aaggaaatgt taatcatgat taattagtga caagtttcgc 377 

tgattatagt ggttaatgct ggtcttatct tcgteagtac tac 420 



<210> 639 
<211> 526 
<212> DNA 
<213> Homo sapiens 

<220> 
<221> CDS 
<222> 276. .440 

<220> 

<221> sig_j)eptide 

<222> 276.-353 

<223> Von Heijne matrix 

score 8.30000019073486 

seq SVLLLAIVLISWG/CI 

<400> 639 

tatctcacgc gcggcagttc aagaacatca gacacgetea acaaacactt gttgcttgac 60 

tgatacatta tggcgggegg gcaaggcccc gateacgtga taccgcceec caactccggg 120 

cggaagaggc cagcgacagc ttgcaegaag cgagtgggcg ggcgetegtg cccggctaaa 180 

agatggctge tggcgcetgg cagccaccgc ctgggaggtt aetgtaaggc ccgcagetcc 240 



wo 02/083898 PCT/IBO 1/009 14 

350 

cgccagctcc cgcggactgc tgccgcctcc ttacc atg aag cca gta agt cgt 293 

Met Lys Pro Val Ser Arg 
-25 

cgc acg ctg gac tgg att tat tea gtg ttg ctg ctt gcc ate gtt tta 341 

Arg Thr Leu Asp Trp lie Tyr Ser Val Leu Leu Leu Ala He Val Leu 

-20 -15 -10 -5 

ate tec tgg ggc tgc ate ate tat get teg atg gtg tet gea aga cga 389 

He Ser Trp Gly Cys lie He Tyr Ala Ser Met Val Ser Ala Arg Arg 

15 10 
cag eta agg aag aaa tac cca gac aaa ate ttt ggg acg aat gaa aat 437 
Gin Leu Arg Lys Lys Tyr Pro Asp Lys He Phe Gly Thr Asn Glu Asn 

15 20 25 

ttg taactcttct ggatttaatt atctgaaaat acagttcttt ccctcatgct 490 
Leu 

tatgtagata taaaaataaa attcataatg caaagt 526 



<210> 640 
<211> 543 
<212> DNA 
<213> Homo sapiens 

<220> 
<221> CDS 
<222> 152. .454 



<220> 

<221> sig_peptide 

<222> 152. .217 

<223> Von Heijne matrix 

score 7.90000009536743 
seq ILLVALQARLSHS/RQ 



<400> 640 

aggaaattag gacacctcat cecaaaagac ctttaaatag gggaagtcea cttgtgcacg 60 
gctgctcctt gctatagaag acctgggaca gaggactgct gtctgccctc tctggtcacc 120 
ctgcctagct agaggatctg tgaccccagc c atg agg acc etc gcc ate ctt 172 

Met Arg Thr Leu Ala He Leu 
-20 

get gee att etc ctg gtg gcc ctg cag gcc agg ctg age cac tec agg 220 

Ala Ala He Leu Leu Val Ala Leu Gin Ala Arg Leu Ser His Ser Arg 

-15 -10 -5 1 

caa gag ctg atg agg ttg ctg cag cec egg age aga ttg cag egg aca 268 

Gin Glu Leu Met Arg Leu Leu Gin Pro Arg Ser Arg Leu Gin Arg Thr 

5 10 15 

tec cag aag tgg ttg ttt ccc ttg cat ggg acg aaa get tgg etc caa 316 
Ser Gin Lys Trp Leu Phe Pro Leu His Gly Thr Lys Ala Trp Leu Gin 

20 25 30 

age ate cag get caa gga aaa aca tgg cet get att gca gaa tac cag 364 
Ser He Gin Ala Gin Gly Lys Thr Trp Pro Ala He Ala Glu Tyr Gin 

35 40 45 

cgt gca ttg cag gag aac gtc get atg gaa cet gca tet acc agg gaa 412 
Arg Ala Leu Gin Glu Asn Val Ala Met Glu Pro Ala Ser Thr Arg Glu 
50 55 60 65 

gac tet ggg cat tet get get gag ctt gea gaa aaa gaa aaa 454 
Asp Ser Gly His Ser Ala Ala Glu Leu Ala Glu Lys Glu Lys 

70 75 
tgagetcaaa atttgctttg agagctacag ggaattgeta ttactcmtgt accttctgct 514 
caatttcctt tectcatcte aaataaatg 543 



<210> 641 
<211> 336 
<212> DNA 
<213> Homo sapiens 



wo 02/083898 
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351 



<220> 
<221> CDS 
<222> 77. .247 



<220> 

<221> sig_pepticie 

<222> 77. . 139 

<223> Von Heijne matrix 

score 7.80000019073486 
seq SVLIFCLLNLAIS/DR 

<220> 

<221> misc_feature 
<222> 301 

<223> n==a, g, c or t 



<400> 641 

atctgagcta gggaagaatg tgtattctgt tgttggatga agtagtctac agatgtcagt 60 
tatatccagt tgagta atg gtg ttg ttg agt tea aca atg tct gta ctg att 112 
Met Val Leu Leu Ser Ser Thr Met Ser Val Leu lie 
-20 -15 -10 

ttc tgc ctg eta aat ctg gcc att tct gat aga agg gta atg aag tct 160 
Phe Cys Leu Leu Asn Leu Ala lie Ser Asp Arg Arg Val Met Lys Ser 

-5 15 
cca act ata ata gtg att cat eta ttt ttt cct cac agt tct acc agt 208 
Pro Thr He He Val He His Leu Phe Phe Pro His Ser Ser Thr Ser 

10 15 20 

att tgc att gag aat ttg gtc act ctg tta tta ggc aca taagaatcct 257 
He Cys He Glu Asn Leu Val Thr Leu Leu Leu Gly Thr 

25 30 35 

cagctgcatt ttgcattatt tccagagttc ttggttgtaa tcantggata tacaggctgt 317 
ggaagactta ccccatttc 336 



<210> 642 
<211> 507 
<212> DNA 

<213> Homo sapiens 

<220> 
<221> CDS 
<222> 110.. 496 



<220> 

<221> sig_peptide 

<222> 110.. 187 

<223> Von Heijne matrix 

score 6.59999990463257 
seq LFWLASGWTPAFA/YS 



<400> 642 

aagatgcgga ggagggtgac gcactagetc tccagttcgc ccgttcctgg cctgaccccc 60 
accaaggcce ataccgcagt aggctcctcg ggctgcccct cggttgaca atg gtc tec 118 

Met Val Ser 
-25 

agg atg gtc tct acc atg eta tct ggc eta ctg ttt tgg ctg gea tct 166 
Arg Met Val Ser Thr Met Leu Ser Gly Leu Leu Phe Trp Leu Ala Ser 

-20 -15 -10 

gga tgg act cca gca ttt get tac age cec egg ace cct gae egg gtc 214 
Gly Trp Thr Pro Ala Phe Ala Tyr Ser Pro Arg Thr Pro Asp Arg Val 

-5 15 
tea gaa gca gat ate cag agg ctg ctt cat ggt gtt atg gag caa ttg 262 
Ser Glu Ala Asp He Gin Arg Leu Leu His Gly Val Met Glu Gin Leu 
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10 








15 








20 






25 




ggc 


att 


gcc 


agg 


ccc cga 


gtg 


gaa 


tat 


eca get 


cac cag gee 


atg 


aat 


310 


Gly 


lie 


Ala 


Arg 


Pro Arg 


Val 


Glu 


Tyr 


Pro Ala 


His Gin Ala 


Met 


Asn 












30 








35 




40 






ctt 


gtg 


ggc 


ccc 


cag age 


att 


gaa 


ggt 


gga get 


cat gaa gga 


ctt 


cag 


358 


Leu 


Val 


Gly 


Pro 


Gin Ser 


He 


Glu 


Gly Gly Ala His Glu Gly Leu Gin 










45 








50 




55 








cat 


ttg 


ggt 


cct 


ttt ggc 


aac 


ate 


ccc 


aac ate 


gtg gca gag 


ttg 


act 


406 


His 


Leu 


Gly 


Pro 


Phe Gly Asn 


He 


Pro 


Asn He 


Val Ala Glu 


Leu 


Thr 








60 








65 






70 








gga 


gac 


aac 


att 


cct aag 


gac 


ttt 


agt 


gag gat 


cag ggg tac 


cag 


ace 


454 


Gly 


Asp 


Asn 


He 


Pro Lys 


Asp 


Phe 


Ser Glu Asp Gin Gly Tyr Gin Thr 






75 








80 








85 








etc 


caa 


ate 


cct 


gtc Gtg 


ttg 


gaa 


aaa 


cag atg 


atg gat gtc 






496 


Leu 


Gin 


lie 


Pro 


Val Leu 


Leu 


Glu 


Lys 


Gin Met 


Met Asp Val 









90 95 100 

tagaaaacac c 507 

<210> 643 
<211> 466 
<212> DNA 
<213> Homo sapiens 

<220> 
<221> CDS 
<222> 14.. 310 

<220> 

<221> sig_peptide 

<222> 14. .136 

<223> Von Heijne matrix 

score 5.40000009536743 

seq WLCPGLRLPGSLA/GR 

<400> 643 

gtaaacagac aac atg gcg gcc gcg gtg kss gcg gca cct ggg gcc ttg 49 
Met Ala Ala Ala Val Xaa Ala Ala Pro Gly Ala Leu 
-40 -35 -30 

gga tec etg cat get ggc ggc gcc cgc etg gtg gcc get tge agt gcg 97 
Gly Ser Leu His Ala Gly Gly Ala Arg Leu Val Ala Ala Cys Ser Ala 

-25 -20 -15 

tgg etc tgc ccg ggg ttg agg etg ccc ggc teg ttg gca ggc egg cga 145 
Trp Leu Cys Pro Gly Leu Arg Leu Pro Gly Ser Leu Ala Gly Arg Arg 

-10 -5 1 

gcc ggc ccg gcg ate tgg gcc cag ggc tgg gta cct gcg gcc ggg ggt 193 
Ala Gly Pro Ala He Trp Ala Gin Gly Trp Val Pro Ala Ala Gly Gly 

5 10 15 

ccc gcc ccg aaa agg ggc tac age tct gag atg aag acg gag gac gag 241 
Pro Ala Pro Lys Arg Gly Tyr Ser Ser Glu Met Lys Thr Glu Asp Glu 
20 25 30 35 

etg egg gtg egg cac etg gag gag gag aac cga gac aga gac aag aac 289 
Leu Arg Val Arg His Leu Glu Glu Glu Asn Arg Asp Arg Asp Lys Asn 

40 45 50 

atg gtc aaa ggc atg gag atg tgaacatgtg tggtgtattt gggacgtaac 340 
Met Val Lys Gly Met Glu Met 
55 

ataaggaatt gtggtgettg gaataaacag agcttatggc aaaaattcac tcagtaaaaa 400 
tcttataaaa atgctatcaa aagctgtgga tgctttgaaa tetgataaga aagtaeggae 4 60 
cataat 466 



<210> 644 
<211> 502 
<212> DNA 



wo 02/083898 
<213> Homo sapiens 



353 



PCT/IBOl/00914 



<220> 
<221> CDS 
<222> 19. .342 

<220> 

<221> sig_peptide 

<222> 19. .75 

<223> Von Heijne matrix 

score 4.19999980926514 

seq AVAASAASGQAEG/KK 

<400> 644 

gcgcgctgct cttctaag atg get gcc get acc ggt gcg gtg gca gcc teg 51 
Met Ala Ala Ala Thr Gly Ala Val Ala Ala Ser 
-15 -10 
gcc gcc teg ggt cag gcg gaa ggt aaa aag ate acc gat etg egg gtc 99 
Ala Ala Ser Gly Gin Ala Glu Gly Lys Lys lie Thr Asp Leu Arg Val 

-5 15 
ate gat etg aag tee gag etg aag egg egg aae tta gac ate acc gga 147 
lie Asp Leu Lys Ser Glu Leu Lys Arg Arg Asn Leu Asp He Thr Gly 

10 15 20 

gte aag acc gtg etc ate tee cga etc aag eag get att gaa gag gaa 195 
Val Lys Thr Val Leu He Ser Arg Leu Lys Gin Ala He Glu Glu Glu 
25 30 35 40 

gga gge gat cea gat aat att gaa tta act gtt tea act gat act cca 243 
Gly Gly Asp Pro Asp Asn He Glu Leu Thr Val Ser Thr Asp Thr Pro 

45 50 55 

aae aag aaa cca act aaa gge aaa ggt aaa aaa eat gaa gca gat gag 291 
Asn Lys Lys Pro Thr Lys Gly Lys Gly Lys Lys His Glu Ala Asp Glu 

60 65 70 

ttg agt gga gat get tct gtg gaa gat gat get ttt ate aag gta aag 339 
Leu Ser Gly Asp Ala Ser Val Glu Asp Asp Ala Phe He Lys Val Lys 

75 80 85 

tgt taattaetca ggttaagatg cttattttaa gectagtatt ttgcatcatg 392 
Cys 

aattttgtgc tttttgttga aeaatggtca ctgaagtatg taggttaaag akdaagacct 452 
atgtaatatt gacaaegttt aatattgtca atctcattaa ctetcataat 502 

<210> 645 
<211> 493 

<212> DNA 

<213> Homo sapiens 

<220> 
<221> CDS 
<222> 264.. 443 

<220> 

<221> sig_peptide 

<222> 264. .386 

<223> Von Heijne matrix 

score 4.19999980926514 

seq VFWMSFYLSRVNC/TT 

<400> 645 

ttcgtgaaat ttgttggttt tgaggactgt aaaagtgatt tcatactetg aatataaaae 60 

tggataatag ggtaatgttt taaaatttat tatgctatta ttcagaatgc caaagtatta 120 

ttttttttec caaaatcagt ctggaeattt actacttttt agactttttg acgttgaact 180 

ttctgtataa aaattggctg ggttttgagc ttttggtaag aaataaaagc cgattaagca 240 

ctggccgccc cgcgtggtac cca atg ecc gag tea etg tgg cag cat teg cac 293 

Met Pro Glu Ser Leu Trp Gin His Ser His 
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-40 -35 
tgg tgt ggg gag tec ttt caa etc aag gag get ggg ttt ctg ggc ace 341 
Trp Cys Gly Glu Ser Phe Gin Leu Lys Glu Ala Gly Phe Leu Gly Thr 

-30 -25 -20 

ctt gaa gtt ttc tgg atg tct ttt tat ctt tct cgt gtg aac tgc act 389 
Leu Glu Val Phe Trp Met Ser Phe Tyr Leu Ser Arg Val Asn Cys Thr 
-15 -10 -5 1 

aca aaa gag ace age cca ctt ccc aag cca gcc aga cac ctg.ggt ctt 437 
Thr Lys Glu Thr Ser Pro Leu Pro Lys Pro Ala Arg His Leu Gly Leu 

5 10 15 

gag cca taaactggcg tagttaagct ttgcagcttc cagtgtattt tatttattct .4 93 

Glu Pro 

<210> 646 
<211> 540 
<212> DNA 
<213> Homo sapiens 

<220> 
<221> CDS 
<222> 175. .354 

<220> 

<221> sig__peptide 
<222> 175.. 309 
<223> Von Heijne matrix 
score 4 

seq ENFLSLLSKSCSA/DP 
<400> 646 

attttaaaat ggaaattggg gcgaggggtt ggcggctggg cgaagggagc attcaaaagc 60 
ggagaatgtc actttacceg agaaattcac tacgatgggc cattggtttc tgctcggctc 120 
gcaggcgcac tgcacgagtg gaggtgtggc tagtggctgt gatgagataa atcc atg 177 

Met 
-45 



cat 


age 


ctt 


ttc 


att 


geg 


age 


ttg 


aaa 


gtt 


ctt 


ttt tat 


tac agt ttt 


225 


His 


Ser 


Leu 


Phe 


lie 
-40 


Ala 


Ser 


Leu 


Lys 


Val 
-35 


Leu 


Phe Tyr 


Tyr Ser Phe 
-30 




age 


ttt 


agg 


ttt 


aat 


tgg 


ttc 


gac 


tgc 


ctt 


etc 


cac aat 


ttg ggc gag 


273 


Ser 


Phe Arg Phe 


Asn 


Trp 


Phe Asp Cys 


Leu 


Leu 


His Asn 


Leu Gly Glu 










-25 










-20 








-15 




aat 


ttc 


ctt 


age 


ctt 


etc 


age 


aaa 


agt 


tgt 


tct 


geg gac 


ceg tct ggg 


321 


Asn 


Phe 


Leu 
-10 


Ser 


Leu 


Leu 


Ser 


Lys 
-5 


Ser 


Cys 


Ser 


Ala Asp 
1 


Pro Ser Gly 




tea 


act 


ttc 


atg 


agg 


gac 


att 


gag 


aca 


aac 


aaa 


tgaaatatgg gttaaagtac 


374 


Ser 


Thr 


Phe 


Met 


Arg 


Asp 


He 


Glu 


Thr 


Asn 


Lys 








5 










10 










15 









tctgagcage tacaaaaaga agaecagtct atectgetgg agacagtggc cacgtgaaga 434 
aagagctctt geagtatgaa gaecacatgg aaagagaggc cacatggaac caacagtcag 4 94 
catcttggtt tcggacacgt gaagaattca tctcagactg tgtatc 540 

<210> 647 
<211> 505 
<212> DNA 
<213> Homo sapiens 

<220> 
<221> CDS 
<222> 100.. 498 

<220> 

<221> sig_peptide 
<222> 100. .138 



wo 02/083898 
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355 

<223> Von Heijne matrix 

score 3.79999995231628 
seq MAHCVTLVQLSIS/CD 



<400> 647 

attttgtgaa gcggcgaagg aggtggtggc tgcgttgggc tccgggaagc cgttcgggct 60 
ggggctgtcg gccgcggggc ggaggcactc gcgcggggg atg gcc cac tgc gtg 114 

Met Ala His Cys Val 





























"10 






acc 


ttg 


gtt 


eag 


ctg 


tec 


att 


tec 


tgt 


gae cat 


etc 


att 


gae 


aag 


gae 


162 


Thr 


Leu 


Val 


Gin 
-5 


Leu 


Ser 


He 


Ser 


Cys 
1 


Asp His 


Leu 


He 
5 


Asp 


Lys 


Asp 




ate 


ggc 


tec 


aag 


tet 


gae 


cca 


etc 


tgc 


gtc ett 


tta 


cag 


gat 


gtg 


gga 


210 


He 


Gly 


Ser 


Lys 


Ser 


Asp 


Pro 


Leu 


Cys 


Val Leu 


Leu 


Gin 


ASD 


Val 


Glv 






10 










15 








20 












ggg 


ggc 


age 


tgg 


get 


gag 


ett 


ggc 


egg 


act gaa 


egg 


gtg 


egg 


aac 


tgc 


258 


Gly Gly 


Ser Trp 


Ala 


Glu 


Leu 


Gly Arg 


Thr Glu 


Ara 


Val 


Ara 


Asn 


Cvs 




25 










30 








35 










40 




tea 


age 


cct 


gag 


ttc 


tec 


aag 


act 


eta 


cag ett 


gag 


tac 


cgc 


ttt 


gag 


306 


Ser 


Ser 


Pro 


Glu 


Phe 


Ser 


Lys 


Thr 


Leu 


Gin Leu 


Glu 


Tyr 


Arg 


Phe 


Glu 












45 










50 








55 






aca 


gtc 


cag 


aag 


eta 


cge 


ttt 


gga 


ate 


tat gae 


ata 


gae 


aac 


aag 


aeg 


354 


Thr 


Val 


Gin 


Lys 


Leu 


Arg 


Phe 


Gly 


He 


Tyr Asp 


He 


Asp 


Asn 


Lys 


Thr 










60 










65 








70 








cca 


gag 


ctg agg 


gat 


gat 


gae 


ttc 


eta 


ggg ggt 


get 


gag 


tgt 


tec 


eta 


402 


Pro 


Glu 


Leu Arg 


Asp Asp 


Asp 


Phe 


Leu 


Gly Gly 


Ala 


Glu 


Cys 


Ser 


Leu 








75 










80 








85 










gga 


cag 


att 


gtg 


tec 


age 


cag 


gta 


ctg 


act etc 


eee 


ttg 


atg 


ctg 


aag 


450 


Gly 


Gin 


He 


Val 


Ser 


Ser 


Gin 


Val 


Leu 


Thr Leu 


Pro 


Leu 


Met 


Leu 


Lys 






90 










95 








100 












cct 


gga 


aaa 


cct 


get 


ggg 


egg 


ggg 


aca 


tea egg 


tet 


eag 


etc 


agg 


aat 


498 


Pro Gly 


Lys 


Pro 


Ala 


Gly 


Arg 


Gly 


Thr 


Ser Arg 


Ser 


Gin 


Leu 


Arg 


Asn 




105 










110 








115 










120 




taaagga 




























505 



<210> 648 

<211> 472 

<212> DNA 

<213> Homo sapiens 

<220> 
<221> CDS 
<222> 78.. 281 



<220> 

<221> sig peptide 

<222> 78.7218 

<223> Von Heijne matrix 

score 3.70000004768372 
seq PLLSLHSRGGSSS/ES 



<400> 648 

attttgegaa cggcgagcag cggcgcggcg cggagagacs agcggaggtt ttcctggttt 60 
cggaccccag cggccgg atg gtg aaa tec tec ctg cag egg ate etc aat 110 
Met Val Lys Ser Ser Leu Gin Arg He Leu Asn 
-45 -40 
age cae tgc ttc gee aga gag aag gaa ggg gat aaa eee age gee ace 158 
Ser His Cys Phe Ala Arg Glu Lys Glu Gly Asp Lys Pro Ser Ala Thr 

-35 -30 -25 

ate cac gee age cgc acc atg ceg etc eta age ctg cac age cgc ggc 206 
He His Ala Ser Arg Thr Met Pro Leu Leu Ser Leu His Ser Arg Gly 
-20 -15 -10 -5 

ggc age age agt gag agt tec agg gtc tec etc cac tgc tgt agt aac 254 
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Gly Ser Ser Ser Glu Ser Ser Arg Val Ser Leu His Cys Cys Ser Asn 

15 10 
ccg ggt ccg ggg cct egg tgg tgc tec tgatgcccct cacccacccc 301 
Pro Gly Pro Gly Pro Arg Trp Cys Ser 

15 20 
tgaagatccc aggtgggcga gggaatagtc agagggatca caatctttca gctttttggg 361 
ctttgagatt gtgagaccgg ggcatcccct tatccccaag agacccgacg cttgcttcat 421 
ggcctacacg ttcgagagag agtcttcggg agaggaggag gagtagggcc g 472 

<210> 649 
<211> 458 
<212> DNA 
<213> Homo sapiens 

<220> 
<221> CDS 
<222> 173.. 373 

<220> 

<221> sig_peptide 

<222> 173. .265 

<223> Von Heijne matrix 

score 3.70000004768372 

seq ASWALGRWGSCRA/LD 

<400> 649 

agaatctgac attccaggac tccactaggc ctgttttcct ctgtgggaac tagagccaag 60 
gcgagagacc cgtgccagcc ccgaggctcc cggggcccat gggcaggcgg ggctggtaca 120 
gggccccggg gctctctcag cccatctgtc acccctcccc cccaacaccc ag atg tec 178 

Met Ser 
-30 



cgt ctt 


ggc 


agt 


get 


gtc 


ccg 


gae 


tgc 


cac 


att 


aat 


get 


cca 


gaa ccc 


226 


Arg Leu 


Gly 


Ser 


Ala 


Val 


Pro 


Asp 


Cys 


His 


lie 


Asn 


Ala 


Pro 


Glu Pro 










-25 










-20 










-15 




gcc age 


tgg 


gcc 


ctg 


ggg 


egg 


tgg 


ggc 


tec 


tgc 


aga 


gee 


ctg 


gae aaa 


274 


Ala Ser 


Trp 


Ala 


Leu 


Gly 


Arg 


Trp 


Gly 


Ser 


Cys 


Arg Ala 


Leu 


Asp Lys 








-10 










-5 










1 






ggc gag 


gga 


ggg 


agg 


gaa 


ggg 


gat 


cct 


aag 


cac 


ecc 


tee 


etc 


cct ggc 


322 


Gly Glu 


Gly 


Gly 


Arg 


Glu 


Gly 


Asp 


Pro 


Lys 


His 


Pro 


Ser 


Leu 


Pro Gly 




5 










10 










15 










ect agg 


agg 


cag 


aca 


tgc 


ccg 


atg 


aga 


tea 


aca 


ttg 


atg 


aat 


tgt tgg 


370 


Pro Arg 


Arg 


Gin 


Thr 


Cys 


Pro 


Met 


Arg 


Ser 


Thr 


Leu 


Met 


Asn 


Cys Trp 




20 








25 










30 








35 





agt tagagagtga agaggagaga ageeggaaaa tecagggact cetgaagtea 423 
Ser 

tgtgggaaac ctgtcgagga cttcatccag ggagc 458 



<210> 650 
<211> 482 
<212> DNA 
<213> Homo sapiens 

<220> 
<221> CDS 
<222> 8. .250 

<220> 

<221> sig_peptide 
<222> 8. .64 

<223> Von Heijne matrix 
score 3.5 

seq RLLRASVARHVSA/IP 
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<400> 650 



actgaag 


ata 


gcg 


get 


get 


gta 


gga 


egg 


ttg 


etc 


caa ocQ 


teg gtt 


ace 


49 






Met 


Ala 


Ala 


Ala 


Val 


Gly Arg Leu 


Leu 


Arg Ala 


Ser 


Val 


Ala 
















-15 










-10 










caa 


cat 


ato 


agt 


gcc 


att 


cct 


tgg 


ggc 


att 


tct 


gcc act 


gca 


gee 


etc 


97 


Arg 


His 


Val 


Ser 


Ala 


He 


Pro 


TrD 


Glv 


He 


Ser 


Ala Thr 


Ala 


Ala 


Leu 




-5 










1 








5 








10 






aoa 


cct 


act 


gca 


tgt 


gga 


aga 


acg 


age 


ttg 


aca 


aat tta 


ttg 


tgt 


tct 


145 


Arg 


Pro 


Ala 


Ala Cys Gly Arg 


Thr 


Ser 


Leu 


Thr 


Asn Leu 


Leu Cys 


Ser 










15 










20 








25 








ggt 


tec 


agt 


caa 


gca 


aaa 


att 


att 


tct 


tct 


ate 


ctt tgg 


att 


tea 


cet 


193 


Gly 


Ser 


Ser 

30 


Gin 


Ala 


Lys 


lie 


He 
35 


Ser 


Ser 


He 


Leu Trp 
40 


He 


Ser 


Pro 




ttg 


tgt 


gtc 


eta 


cag 


aaa 


ttg 


ttg 


ctt 


tta 


gtg 


aca aag 


eta 


acg 


aat 


241 


Leu 


Cys 
45 


Val 


Leu 


Gin 


Lys 


Leu 
50 


Leu 


Leu 


Leu 


Val 


Thr -Lys 
55 


Leu 


Thr 


Asn 




ttc 


acg 


atg 


tgaactgtga agttgtcgca gtctcagtgg attcccactt 






290 


Phe 
60 


Thr 


Met 





























tagccatctt gcctggataa atacaccaag aaagaatggt ggtttgggcc acatgaacat 350 

cgcactcttg tcagacttaa etaagcagat ttcccgagae tacggtgtgc tgttagaagg 410 

ttctggtett gcactaagag gtetcttcat aattgaeect aatggagtea teaageattt 470 

gagegteaac ga 482 

<210> 651 
<211> 291 
<212> DNA 
<213> Homo sapiens 

<220> 
<221> CDS 
<222> 130.. 288 

<220> 

<221> sig^peptide 
<222> 130. .234 
<223> Von Heijne matrix 
score 3.5 

seq GLVEVVAVGVVRS/DQ 
<400> 651 

agtctctgtc ggagtctgtt tcccagtgca cccaacttgt gatagaaggt atatgttctg 60 
ctgaaaaaga gaaaaaggaa aaaaaagaat aaaagcagag agcagagaag gacagggtct 120 
ggctgccgc atg gag ggg agg age agg age agg aaa gee tgt ctg gag gtt 171 
Met Glu Gly Arg Ser Arg Ser Arg Lys Ala Cys Leu Glu Val 
-35 -30 -25 

get aca gta aac cag gtc ttc gtg ggg ttg gtc gag gtg gtg gca gtg 219 
Ala Thr Val Asn Gin Val Phe Val Gly Leu Val Glu Val Val Ala Val 

-20 -15 -10 

ggg gtg gtg aga agt gat cag att aag aat atg tat tta aaa tac age 267 
Gly Val Val Arg Ser Asp Gin He Lys Asn Met Tyr Leu Lys Tyr Ser 
-5 15 10 

cag cat gat ctg ctg atg gac tag 291 
Gin His Asp Leu Leu Met Asp 
15 

<210> 652 
<211> 766 
<212> DNA 
<213> Homo sapiens 



<220> 
<221> CDS 



wo 02/083898 
<222> 166. .333 
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<400> 652 

aaaaggagta gctattagcc aattcggcag ggcccgcttt ttagaagctt gatttcyttt 60 
gaagatgaaa gactagcgga agctctgcct ctttccccag tggcgaggga actcggggcg 120 
attggctggg aactgtatcc acccaaatgt caccgatttc ttcct atg cag gaa atg 177 

Met Gin Glu Met 

1 

age aga ccc ate aat aag aaa ttt etc age etg gee gaa aat ggt tgg 225 

Ser Arg Pro lie Asn Lys Lys Phe Leu Ser Leu Ala Glu Asn Gly Trp 

5 10 15 20 

cce cac gaa gee acg aca act gga ggc aaa gag ggt tgc tea acg ccc 273 

Pro His Glu Ala Thr Thr Thr Gly Gly Lys Glu Gly Cys Ser Thr Pro 

25 30 35 

cgc cte att gga aaa cca aat eag ate tgg gac eta tat age gtg gcg 321 
Arg Leu lie Gly Lys Pro Asn Gin He Trp Asp Leu Tyr Ser Val Ala 

40 45 50 

gag geg ggg cga tgattgtcge gctcgcaecc aetgeagctg egcaeagteg 373 
Glu Ala Gly Arg 
55 

catttctttc cccgcccctg 
ggtttgagcg ctcgccgtct 
gteegetggg aatctagett 
ggcccceaga accgaecaca 
tgagaagaac ceagactccc 
gegacttgtc ttcttctttg 
ctatctgcet gaetacagga 

<210> 653 
<211> 488 
<212> DNA 
<213> Homo sapiens 



agaccctgca gcaccatctg tcatggcgge tgggctgttt 433 

tttggcggea gcggcgacgc gagggctcee ggccgcccgc 4 93 

ctccaggaet gtggtcgeee cgteegctgt ggeggaaagc 553 

eegtgcaaga ggacccagaa cccgaggaeg aaaacttgta 613 

atggttatga caaggacecc gttttggaeg tetggaaeat 673 

gcgtctceat catcctggtc cttggcagca cetttgtggc 733 

tgaaagagtg ggt 766 



<220> 
<221> CDS 
<222> 199.. 357 



<400> 653 

cagtaggata eatagetttt cttccagtga aacaaagtte atatcatcca ttgtttttea 60 
agcaegtgac accagcetca aagtaaatga eatgaceagt ggttgaacag tetaatttte 120 
aaatttaata tagageatat aacttctgat ttgatagtat ttattttaaa aaattatgtt 180 

231 



279 



ttcateatte atttgaaa 


atg 


aaa 


aag 


cec 


caa 


agt 


gag 


aac 


ttt ggg gga 








Met 


Lys 


Lys 


Pro 


Gin 


Ser 


Glu 


Asn 


Phe Gly Gly 








1 








5 








10 


ggg 


cct aga 


aca tgg 


ata 


gat 


etc 


tta 


gtg 


gtc 


ttt 


cca 


aaa gta cat 


Gly 


Pro Arg 


Thr Trp 


He 


Asp 


Leu 


Leu 


Val 


Val 


Phe 


Pro 


Lys Val His 






15 








20 










25 


gta 


ett gaa 


ata ttt 


tea 


tta 


tea 


tac 


tat 


tet 


ttg 


aaa 


aaa aag atg 


Val 


Leu Glu 


He Phe 


Ser 


Leu 


Ser 


Tyr 


Tyr 


Ser 


Leu 


Lys 


Lys Lys Met 




30 








35 










40 




ett 


act gta 


tac ttg 


ttt 


tea 


age 


ate 


etc 


taaaateaaa ggttttgate 


Leu 


Thr Val 


Tyr Leu 


Phe 


Ser 


Ser 


He 


Leu 












45 






50 

















327 



377 



acaatatgea gatttctctt gatagatact taaataggct atttctctcc tcttcttggg 437 
caatgccttg ttttcteetc tgaatatttg eatttgaaag gattgettce t 488 



<210> 654 
<211> 489 
<212> DNA 
<213> Homo sapiens 



<220> 
<221> CDS 



wo 02/083898 
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<400> 654 

aacctcacga ggccttacgt gcctagagcg gttcctctcc gtggcgccct accgccttcg 60 
gcgcgaaggt ggctggtgcg taccgtcccg aaccctagga gccgagatcc cacccgcagg 120 
accccaaaac ccaatgatcc tgcagcagcc cttgcagcga ggcccccagg gaggggccag 180 
cgcctcccgc gggccgcctt gggggtgact tggggcctgg acgccagctc csctctscga 240 
ggagctgtgc cc atg age acc aag egg cgc ctg gag gag gag cag gag cct 291 
Met Ser Thr Lys Arg Arg Leu Glu Glu Glu Gin Glu Pro 
15 10 
ctg cgc aag cag ttt ctg tct gag gag aac atg gcc acc cac ttc tct 339 
Leu Arg Lys Gin Phe Leu Ser Glu Glu Asn Met Ala Thr His Phe Ser 

15 20 25 

caa etc age ctg cac aat gae caa ecc eta ctg cag cec cac eat gac 387 
Gin Leu Ser Leu His Asn Asp Gin Pro Leu Leu Gin Pro His His Asp 
30 35 40 45 

ctt etc ecc age cct gcc cec act cag gag cec ttg etc tgagctgctt 4 36 

Leu Leu Pro Ser Pro Ala Pro Thr Gin Glu Pro Leu Leu 

50 55 
ctctggcgct atcctggcag cctcatcctg aggcctccgt ctgctgaggc tgg 4 89 

<210> 655 
<211> 616 
<212> DNA 
<213> Homo sapiens 



<220> 
<221> CDS 
<222> 205. .387 



<400> 655 

acttccgggg gagcggcgcg gcggcgggga ggatctctca ccccatcact cagggtggcg 60 
caatcacgac tcattggctc actgcagcct agacctccca gctggagcaa ttctcctgcc 120 
tcagccttct gagtagctgg gactacagtt ggttctaaag agtggtgagt cagaagagae 180 
gtcaggcage aagcgaettg ggee atg gcc tct gac eta gac ttc tea cct 231 

Met Ala Ser Asp Leu Asp Phe Ser Pro 

1 5 

ccg gag tge ceg age cca ctt tec tgg aga ace tge tac ggt acg gac 279 

Pro Glu Cys Pro Ser Pro Leu Ser Trp Arg Thr Cys Tyr Gly Thr Asp 

10 15 20 25 

tct tec tgg gag cca tct tec age tea tct gtg tge tgg cca tea teg 327 

Ser Ser Trp Glu Pro Ser Ser Ser Ser Ser Val Cys Trp Pro Ser Ser 

30 35 40 

tac cca ttc cca agt cec acg agg egg agg ctg aac cgt ctg age cca 375 
Tyr Pro Phe Pro Ser Pro Thr Arg Arg Arg Leu Asn Arg Leu Ser Pro 

45 50 55 

gaa gtg ctg agg tgacgaggaa geccaagget gctgttectt ctgtgaacaa 427 
Glu Val Leu Arg 
60 

gaggcecaag aaagagacta agaagaagcg gtagaagagg aggcctgagg actgggcgkg 487 
cagggagagg gtcttkgggg acagcetcct gggaatctac attgtgtttc ecccgeatty 547 
ceaggcteag gttctgarga rgctgtgacg cectatgace gcagagatct agacagtegt 607 
aacagtccc 616 



<210> 656 
<211> 508 
<212> DNA 
<213> Homo sapiens 



<220> 
<221> CDS 
<222> 37. .219 
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360 

<400> 656 



ctttcccctg ccggtgtcct gctcgccgtc cccgcc 


atg 


etg 


tct 


eta 


gae 


ttt 


54 
















Met 
1 


Leu 


Ser 


Leu 


Asp 
5 


Phe 




ttg 


gac 


gat 


gtg 


egg 


egg 


atg aae aag egg 


eag 


etc 


tat 


tat 


caa 


gtc 


102 


Leu 


Asp 


Asp 


Val 


Arg 


Arg 


Met Asn Lys Arg 


Gin 


Leu 


Tyr 


Tyr 


Gin 


Val 










10 






15 








20 








eta 


aat 


ttt 


gga 


atg 


att 


gtc tea teg gea 


eta 


atg 


ate 


tgg 


aag 


ggg 


150 


Leu 


Asn 


Phe 


Gly 


Met 


He 


Val Ser Ser Ala 


Leu 


Met 


He 


Trp 


Lys 


Gly 








25 








30 






35 










tta 


atg 


gta 


ata 


act 


gga 


agt gaa agt eeg 


att 


gta 


gtg 


gte 


tea 


ggc 


198 


Leu 


Met 


Val 


lie 


Thr 


Gly 


Ser Glu Ser Pro 


He 


Val 


Val 


Val 


Ser 


Gly 





40 45 50 

aaa atg ggc ata tea agt ttt tgaccaaagg agataataat gcggttgatg 249 
Lys Met Gly He Ser Ser Phe 
55 60 



accgaggcct ctataaacaa ggacaacatt ggctagagaa aaaagatgtt gtggggagag 309 

ecaggggatt tgttccttat attggaattg tgacgatect eatgaatgac tatectaaat 369 

ttaagtatge agttetcttt ttgctgggtt tattcgtgct ggttcatcgt gagtaagaag 429 

cetgccttgc tgttectggg aagatgccat agttttcgtt actggatgtt tggagtagat 489 

actggtctgt gattggtgg 508 



<210> 657 
<211> 500 
<212> DNA 
<213> Homo sapiens 

<220> 
<221> CDS 
<222> 114. .452 



<400> 657 

ttttcccctg ccggtgtcct gctcgccgtc cccgccatgc tgtctctaga ctttttggac 60 
gatgtgcggc ggatgaacaa gcggcastct attatcaagt cctaaatttt gga atg 116 

Met 
1 



att 


gtc 


tea 


teg 


gca 


eta 


atg 


ate 


tgg 


aag 


ggg 


tta 


atg 


gta 


ata 


act 


164 


He 


Val 


Ser 


Ser 


Ala 


Leu 


Met 


He 


Trp 


Lys 


Gly 


Leu 


Met 


Val 


He 


Thr 










5 










10 










15 








gga 


agt 


gaa 


agt 


ccg 


att 


gta 


gtg 


gtg 


etc 


agt 


ggc 


age 


atg 


gaa 


cct 


212 


Gly Ser 


Glu 


Ser 


Pro 


He 


Val 


Val 


Val 


Leu 


Ser 


Gly 


Ser 


Met 


Glu 


Pro 








20 










25 










30 










gca 


ttt 


cat 


aga 


gga 


gat 


ctt 


etc 


ttt 


eta 


aca 


aat 


cga 


gtt 


gaa 


gat 


260 


Ala 


Phe 


His 


Arg 


Gly 


Asp 


Leu 


Leu 


Phe 


Leu 


Thr 


Asn 


Arg 


Val 


Glu 


Asp 






35 










40 










45 












eee 


ata 


cga 


gtg 


gga 


gaa 


att 


gtt 


gtt 


ttt 


agg 


ata 


gaa 


gga 


aga 


gag 


308 


Pro 


He 


Arg 


Val 


Gly 


Glu 


He 


Val 


Val 


Phe 


Arg 


He 


Glu 


Gly Arg 


Glu 




50 










55 










60 










65 




att 


cct 


ata 


gtt 


cac 


cga 


gte 


ttg 


aag 


att 


cat 


gaa 


aaa 


ttt 


gtt 


cct 


356 


He 


Pro 


He 


Val 


His 


Arg 


Val 


Leu 


Lys 


He 


His 


Glu 


Lys 


Phe 


Val 


Pro 












70 










75 










80 






tat 


att 


gga 


att 


gtg 


acg 


ate 


etc 


atg 


aat 


gac 


tat 


eet 


aaa 


ttt 


aag 


404 


Tyr 


He 


Gly 


He 


Val 


Thr 


He 


Leu 


Met 


Asn 


Asp 


Tyr 


Pro 


Lys 


Phe 


Lys 










85 










90 










95 








tat 


gca 


gtt 


etc 


ttt 


ttg 


ctg 


ggt 


tta 


ttc 


gtg 


ctg 


gtt 


cat 


cgt 


gag 


452 


Tyr 


Ala 


Val 


Leu 


Phe 


Leu 


Leu 


Gly 


Leu 


Phe 


Val 


Leu 


Val 


His 


Arg 


Glu 





100 105 110 



taagaagcct geettgetgt teetggggaa gatgecatag ttttcgtt 500 



<210> 658 
<211> 411 
<212> DNA 
<213> Homo sapiens 



wo 02/083898 



361 



PCT/IBOl/00914 



<220> 
<221> CDS 
<222> 90. .305 

<220> 

<221> misc_feature 

<222> 138,315,326,343,360,377 

<223> n=a, g, c or t 

<400> 658 

ttgggcagaa aaattcaagc aggagattgt atttcttttg gagttgtacg atttccttta 60 



ttatttgaac tgcagtaaag aaagctggg 


atg 


gge 


tec 


tct 


agg 


gat 


act 


tec 


113 














Met 
1 


Gly 


Ser 


Ser 


Arg 

5 


Asp 


Thr 


Ser 




aga 


tec 


ctg gge 


ggt tgt 


age 


cet 


ngc 


tec 


tet 


tta 


aat 


gga 


ttt 


ggt 


161 


Arg 


Ser 


Leu Gly 


Gly Cys 


Ser 


Pro 


Xaa 


Ser 


Ser 


Leu 


Asn 


Gly 


Phe 


Gly 






10 






15 










20 












ttc 


aaa 


gac gat 


cat etc 


egt 


ctt 


etc 


gga 


tgt 


cat 


agt 


gee 


act 


gat 


209 


Phe 


Lys 


Asp Asp 


His Leu 


Arg 


Leu 


Leu 


Gly 


Cys 


His 


Ser 


Ala 


Thr 


Asp 




25 






30 










35 










40 




cat 


etc 


cag etc 


ctk gee 


ace 


ctg 


gge 


ttt 


etc 


cac 


ttt 


tge 


etc 


tat 


257 


His 


Leu 


Gin Leu 


Leu Ala 


Thr 


Leu 


Gly 


Phe 


Leu 


His 


Phe 


Cys 


Leu 


Tyr 










45 








50 










55 






gtt 


ttg 


ctt etc 


cam egt 


ctt 


age 


cac 


gat 


ate 


tac 


etc 


tct 


gtc 


atg 


305 


Val 


Leu 


Leu Leu 


Xaa Arg 


Leu 


Ser 


His 


Asp 


He 


Tyr 


Leu 


Ser 


Val 


Met 








60 








65 










70 









tgatgtgaen ettgtttttt naaeeaagag ttgecctnaa gstecettaa aaaanagetg 365 
atcttaetgg snttettttg taaaretcet gtgatagatg cagatt 411 



<210> 659 
<211> 529 
<212> DNA 
<213> Homo sapiens 

<220> 
<221> CDS 
<222> 235.. 444 

<400> 659 

atttccagcg gttgctggtt etgacgggtt gtagtctgcc aggacaatga gttatgacta 60 
ccateagaae tggggecgtg atgggggtee ecgcagctcc ggtgggggct atggaggggg 120 
gecagcaggg ggteatggag gtaaccgagg etccggagga greggeggeg gcggaggggg 180 
tggtegagge ggeaggggee ggcateeegg gcacetgaaa gecgcgaaat egge atg 237 

Met 
1 



tgg 


tac 


geg 


aaa 


aaa 


cag 


ggg 


cag 


aag 


aac 


aag 


gaa 


gcg 


gag 


agg 


caa 


285 


Trp Tyr 


Ala 


Lys 


Lys 


Gin 


Gly 


Gin 


Lys 


Asn 


Lys 


Glu 


Ala 


Glu 


Arg 


Gin 










5 










10 










15 








gag 


aga 


get 


gta 


gta 


cac 


atg 


gat 


gaa 


ega 


ega 


gaa 


gaa 


caa 


att 


gta 


333 


Glu 


Arg 


Ala 
20 


Val 


Val 


His 


Met 


Asp 
25 


Glu 


Arg 


Arg 


Glu 


Glu 
30 


Gin 


He 


Val 




cag 


tta 


ctg 


aat 


tet 


gtt 


caa 


gcg 


aag 


aat 


gat 


aaa 


gag 


tea 


gaa 


gca 


381 


Gin 


Leu 

35 


Leu 


Asn 


Ser 


Val 


Gin 
40 


Ala 


Lys 


Asn 


Asp 


Lys 
45 


Glu 


Ser 


Glu 


Ala 




cag 


ata 


tee 


ace 


cca 


gac 


ttg 


get 


gtc 


egc 


tct 


gcg 


cet 


teg 


gge 


cag 


429 


Gin 


He 


Ser 


Thr 


Pro 


Asp 


Leu 


Ala 


Val 


Arg 


Ser 


Ala 


Pro 


Ser 


Gly 


Gin 




50 










55 










60 










65 





cec ttt egg get tge tgagcctgga ggcegakgag aatgcaette cgggttttge 4 84 

Pro Phe Arg Ala Cys 
70 

tgaggctctg aggagctace aggaggctgc ggctgcaggc acctt 529 
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<210> 660 
<211> 478 
<212> DNA 
<213> Homo sapiens 

<220> 
<221> CDS 
<222> 128., 286 

<400> 660 

gttctttggc cctgtgacac gtagcaacgg ggctggttca gggtctgaaa cagagtttgg 60 
gggttgtttg ggattagtga agctactgcc tttgccgcca gcgcagctca gagtttgatt 120 



atttgca 


atg 


tea ggc 


ttt 


gaa 


aac 


tta 


aae 


aeg 


gat tte tac eag aca 


169 




Met 


Ser Gly 


Phe 


Glu 


Asn 


Leu 


Asn 


Thr 


Asp Phe Tyr Gin Thr 






1 






5 










10 




agt tac 


age 


ate gat 


gat 


eag 


tea 


eag 


eag 


tee 


tat gat tat gga gga 


217 


Ser Tyr 


Ser 


lie Asp 


Asp 


Gin 


Ser 


Gin 


Gin 


Ser 


Tyr Asp Tyr Gly Gly 




15 






20 










25 


30 




agt gga 


gga 


ccc tat 


aac 


agt 


atg 


ctg 


get 


atg 


act att cgc age aag 


265 


Ser Gly 


Gly 


Pro Tyr 
35 


Asn 


Ser 


Met 


Leu 


Ala 
40 


Met 


Thr lie Arg Ser Lys 
45 




gca gat 


ttg 


tec etc 


eag 


aea 


tgatgcagcc acaacagcca tacaccgggc 


316 


Ala Asp 


Leu 


Ser Leu 
50 


Gin 


Thr 















agatttacca gccaactcag gcatatactc cagctteacc teagccttte tatggaaaca 376 
actttgagga tgagccacct ttattagaag agttaggtat eaattttgac cacatetggc 4 36 
aaaaaacact aacagtatta catccgttaa aagtageaga tg 4 78 



<210> 661 
<211> 456 
<212> DNA 
<213> HQmo sapiens 

<220> 
<221> CDS 
<222> 110.. 277 

<400> 661 

tacggtttta ttattatatc gatttatttg tgaggaggga atctgttcat gtagatttta 60 
aaagtgcaag agtgtccagc gtcatctgcc tctcctgtcc cctgaccac atg att ttc 118 

Met He Phe 
1 



tgt 


tec 


cat 


aaa 


eta 


gca 


ctg 


ttg 


agt 


ata 


ttt 


cea gag 


cat 


act 


cea 


166 


Cys 


Ser 
5 


His 


Lys 


Leu 


Ala 


Leu 
10 


Leu 


Ser 


He 


Phe 


Pro Glu 
15 


His 


Thr 


Pro 




ggc 


gtg 


tgt 


aaa 


tac 


age 


ctt 


eca 


etc 


ctg 


cac 


aca cac 


cgt 


gtt 


ttt 


214 


Gly Val 


Cys 


Lys 


Tyr 


Ser 


Leu 


Pro 


Leu 


Leu 


His 


Thr His 


Arg 


Val 


Phe 




20 










25 










30 








35 




aca 


tea 


gta 


aca 


ttc 


tge 


gca 


tat 


tgt 


tea 


gca 


cea tgc 


ctt 


ttc 


cac 


262 


Thr 


Ser 


Val 


Thr 


Phe 
40 


Cys 


Ala 


Tyr 


Cys 


Ser 
45 


Ala 


Pro Cys 


Leu 


Phe 
50 


His 




tta 


aca 


gta 


ata 


att 


taaaggtcat ' 


tccatttcag tacataaaga gcgtcctcag 


317 


Leu 


Thr 


Val 


He 
55 


He 

























gccgggcgcg gtggetcgca cetgtaatec tagcactttg ggaggctgag geaggtggat 377 
cacctgaggt tggaagttga agaecagcet ggceaaaagg gegaaaccet gtetctaetg 4 37 
aaaatagaaa aattagctc 456 



<210> 662 
<211> 506 
<212> DNA 
<213> Homo sapiens 
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363 

<220> 
<221> CDS 
<222> 71.. 379 



<400> 662 

acagctgccc gggactccag tgatcgccgc ggctcgctcg cgccccggaa actgcccctt 60 
ctcgggggtc atg atg ggc age aag atg gcg tct get agt agg gtc gtt 109 

Met Met Gly Ser Lys Met Ala Ser Ala Ser Arg Val Val 

15 10 



cag 


gta 


gtc 


aaa 


cca 


cac 


act 


cca 


tta 


ata 


agg 


ttt 


cct 


gac 


aga 


aga 


157 


Gin 


Val 
15 


Val 


Lys 


Pro 


His 


Thr 
20 


Pro 


Leu 


He 


Arg 


Phe 
25 


Pro 


Asp 


Arg 


Arg 




gac 


aat 


cct 


aaa 


ccc 


aat 


gta 


tea 


gaa 


get 


ttg 


aga 


tea 


gca 


ggg 


eta 


205 


Asp Asn 


Pro 


Lys 


Pro 


Asn 


Val 


Ser 


Glu 


Ala 


Leu 


Arg 


Ser 


Ala 


Gly 


Leu 




30 










35 










40 










45 




cca 


tct 


cac 


tct 


tct 


gta 


att 


tea 


caa 


eat 


tct 


aaa 


gga 


agt 


aaa 


tea 


253 


Pro 


Ser 


His 


Ser 


Ser 
50 


Val 


He 


Ser 


Gin 


His 
55 


Ser 


Lys 


Gly 


Ser 


Lys 
60 


Ser 




cca 


gat 


ttg 


ctg 


atg 


tat 


cag 


ggt 


cca 


cca 


gac 


act 


gca 


gaa 


ata 


ata 


301 


Pro Asp 


Leu 


Leu 


Met 


Tyr 


Gin 


Gly 


Pro 


Pro 


Asp 


Thr 


Ala 


Glu 


He 


He 










65 










70 










75 








aaa 


aca 


tta 


cct 


cag 


aaa 


tac 


aga 


agg 


aaa 


ett 


gtg 


tct 


caa 


gaa 


gaa 


349 


Lys 


Thr 


Leu 
80 


Pro 


Gin 


Lys 


Tyr 


Arg 
85 


Arg 


Lys 


Leu 


Val 


Ser 
90 


Gin 


Glu 


Glu 




atg 


gaa 


ttt 


ate 


caa 


cgt 


gga 


ggt 


cct 


gaa 


taaccatggt ggctgctgtt 


399 


Met 


Glu 


Phe 


He 


Gin 


Arg 


Gly 


Gly 


Pro 


Glu 

















95 100 



tgtcatcaga caatagaatt gtctttacaa taaaggactt ccaaaatgac agatgagaaa 459 
ctgtatatta aacaccttta ataaatatta tgaaaaaaam haawaaa 506 



<210> 663 
<211> 595 
<212> DNA 

<213> Homo sapiens 

<220> 
<221> CDS 
<222> 207., 578 



<220> 

<221> misc feature 
<222> 19 

<223> n=a, g, c or t 



<400> 663 

atctcgcgtc eccaaeggne ccccgggtcg gtttecgegg tggeeatgac tgcggccgtg 60 

ttcttcggct gcgcttcatt gccttcgggc ctgcgctcgc cctttatgtc ttcaccatcg 120 

ccaycgagcc gttgcgtatc atcttcctca tcgccggagc tttcttctgg ttggtgtctc 180 

tactgatttc gtcccttgtt tggttc atg gca aga gtc att att gac aac aaa 233 

Met Ala Arg Val He He Asp Asn Lys 



gat 


gga 


cca 


aca 


cag 


aaa 


tat 


1 

ctg 


ctg 


ate 


ttt 


5 

gga 


gcg 


ttt 


gtc 


tct 


281 


Asp Gly 


Pro 


Thr 


Gin 


Lys 


Tyr 


Leu 


Leu 


He 


Phe 


Gly 


Ala 


Phe 


Val 


Ser 




10 










15 










20 










25 




gtc 


tat 


ate 


caa 


gaa 


atg 


ttc 


cga 


ttt 


gca 


tat 


tat 


aaa 


etc 


tta 


aaa 


329 


Val 


Tyr 


He 


Gin 


Glu 


Met 


Phe 


Arg 


Phe 


Ala 


Tyr 


Tyr 


Lys 


Leu 


Leu 


Lys 












30 










35 










40 






aaa 


gee 


agt 


gaa 


ggt 


ttg 


aag 


agt 


ata 


aac 


cca 


ggt 


gag 


aca 


gca 


ccc 


377 


Lys 


Ala 


Ser 


Glu 


Gly 


Leu 


Lys 


Ser 


He 


Asn 


Pro 


Gly 


Glu 


Thr 


Ala 


Pro 










45 










50 










55 








tct 


atg 


cga 


Ctg 


ctg 


gee 


tat 


gtt 


tct 


ggc 


ttg 


ggc 


ttt 


gga 


ate 


atg 


425 


Ser 


Met 


Arg 


Leu 


Leu 


Ala 


Tyr 


Val 


Ser 


Gly 


Leu 


Gly 


Phe 


Gly He 


Met 





60 65 70 
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aqt qqa 


qta 


ttt tec 


ttt gtg aat 


ace 


Ser Gly 


Val 


Phe Ser 


Phe Val Asn 


Thr 


75 






80 




ggc aca 


cftg 

TS ^ 


ggc att 


cat gga gat 


tct 


Gly Thr 


Val 


Gly He 


His Gly Asp Ser 


90 






95 




age ttt 


cat 


gac get 


ggt cat tat 


ett 


Ser Phe 


His 


Asp Ala 


Gly His Tyr 


Leu 






110 






tgt att 


ttt 


tgatggctgt gagaaga 




Cys He 


Phe 









364 

eta tct gac tec ttg ggg cea 473 
Leu Ser Asp Ser Leu Gly Pro 
85 

cct caa ttc ttc ctt tta ttc 521 
Pro Gin Phe Phe Leu Leu Phe 
100 105 
get gca tgt att ctg ggg cat 569 
Ala Ala Cys lie Leu Gly His 
115 120 

595 



<210> 664 
<211> 487 
<212> DNA 
<213> Homo sapiens 

<220> 
<221> CDS 
<222> 41. .226 



<400> 664 

gatacggttc ctccaccgag gcccatgcga agtttccact atg get tec age act 55 

Met Ala Ser Ser Thr 
1 5 

gtc ccg gtg age get get ggc teg get aat gaa act ecc gaa ata ceg 103 

Val Pro Val Ser Ala Ala Gly Ser Ala Asn Glu Thr Pro Glu He Pro 

10 15 20 

gac aac gtg gga gat tgg ctt egg ggc gtc tac cct ttg cca ctg ata 151 

Asp Asn Val Gly Asp Trp Leu Arg Gly Val Tyr Pro Leu Pro Leu He 

25 30 35 

gga atg act tee gga gaa ctt gat act aaa ttt ggg act ett tge tge 199 

Gly Met Thr Ser Gly Glu Leu Asp Thr Lys Phe Gly Thr Leu Cys Cys 

40 45 50 

ggg agt ttg get ggc cag gaa ctt gag tgacattgae ctcatggcac 246 
Gly Ser Leu Ala Gly Gin Glu Leu Glu 

55 60 

etcageeagg ggtgtagcca agtagacaaa tggaatcctg tgctgaaccc gaatcttcca 306 

aaaaaeagcc tacaatctgt gaecaccaca agatgtgcce tgatggeagc tgaagtttga 366 

ttcagatggg cacttttctt ceccttcect gectagtttc ettttgttee ttgagtccac 426 

geagaattcc attctetggt cageagaeag gcttaagcta aagtattgea tctattetgt 486 
a 487 



<210> 665 
<211> 415 
<212> DNA 
<213> Homo sapiens 



<220> 
<221> CDS 
<222> 54 . .248 

<400> 665 

aettctgacc tgcggcecgt agtccgagce ggggaeggeg gcgtcggtgg gtc atg 56 

Met 
1 

etc egg cac tec ccc teg ctg tgg gag ctg gtg gag gag cac gtt ccg 104 
Leu Arg His Ser Pro Ser Leu Trp Glu Leu Val Glu Glu His Val Pro 

5 10 15 

etc egg gag cga egc gaa gtg aag agg att ctg ggg gga ggc ggc ggt 152 
Leu Arg Glu Arg Arg Glu Val Lys Arg He Leu Gly Gly Gly Gly Gly 

20 25 30 

gga eet gag cct gga get gcg ggc gga ggt ggc gat gtt aeg ggc act 200 



wo 02/083898 PCT/IBO 1/009 14 

365 

Gly Pro Glu Pro Gly Ala Ala Gly Gly Gly Gly Asp Val Thr Gly Thr 

35 40 45 

get cca aga ggc teg ate etc tea age ccc eag ete ceg eec eat etc 248 

Ala Pro Arg Gly Ser lie Leu Ser Ser Pro Gin Leu Pro Pro His Leu 
50 55 60 65 

tgaeceetct tetettetgg eaecaeegcc teteetaaag gaeetettge geeaggaget 308 

ceggcagttg etccagggtc tccgceacaa agceatetgt gagggcaggg aceaggseea 368 

agcttgggtc eagtatagcc cccarggtcc tgcactttgc cttggag 415 

<210> 666 
<211> 487 
<212> DNA 

<213> Homo sapiens 

<220> 
<221> CDS 
<222> 13. .432 

<400> 666 

agtgaggaaa tc atg ggt cag aat cga gag atg ctt eeg ttt tgg atg aae 51 

Met Gly Gin Asn Arg Glu Met Leu Pro Phe Trp Met Asn 
15 10 



age 


ace 


ggc 


agg 


egg 


gaa 


ggt 


tgg 


cag 


egt 


gga 


tgg 


cae 


ggc 


tac 


gac 


99 


Ser 


Thr 


Gly 


Arg 


TVrg 


Glu 


Gly 


Trp 


Gin 


Arg 


Gly 


Trp 


His 


Gly 


Tyr 


Asp 






15 










20 










25 












aae 


gag 


etc 


atg 


gac 


atg 


egg 


ggc 


ate 


tte 


ctg 


gee 


tte 


gga 


cct 


gat 


147 


Asn 


Glu 


Leu 


Met 


Asp 


Met 


Arg 


Gly 


He 


Phe 


Leu 


Ala 


Phe 


Gly 


Pro 


Asp 




30 










35 










40 










45 




tte 


aaa 


tee 


aae 


tte 


aga 


get 


get 


cct 


ate 


agg 


teg 


gtg 


gac 


gtc 


tac 


195 


Phe 


Lys 


Ser 


Asn 


Phe 


Arg 


Ala 


Ala 


Pro 


He 


Arg 


Ser 


Val 


Asp 


Val 


Tyr 












50 










55 










60 






aat 


gtc 


atg 


tgc 


aat 


gtg 


gtg 


ggc 


ate 


acc 


ccg 


ctg 


ccc 


aac 


aac 


gga 


243 


Asn 


Val 


Met 


Cys 


Asn 


Val 


Val 


Gly 


He 


Thr 


Pro 


Leu 


Pro 


Asn 


Asn 


Gly 










65 










70 










75 








tee 


tgg 


tee 


agg 


gtg 


atg 


tgc 


atg 


ctg 


aag 


ggc 


cgc 


gcg 


gca 


ctg 


ccc 


291 


Ser 


Trp 


Ser 


Arg 


Val 


Met 


Cys 


Met 


Leu 


Lys 


Gly 


Arg 


Ala 


Ala 


Leu 


Pro 








80 










85 










90 










cgc 


ctg 


tct 


ggc 


eca 


gee 


act 


gtg 


eec 


tgg 


cat 


gat 


tct 


tct 


ctt 


cct 


339 


Arg 


Leu 


Ser 


Gly 


Pro 


Ala 


Thr 


Val 


Pro 


Trp 


His 


Asp 


Ser 


Ser 


Leu 


Pro 






95 










100 










105 












get 


tgc 


ata 


act 


gat 


cat 


att 


get 


tgt 


etc 


aga 


aaa 


aaa 


cac 


cat 


cag 


387 


Ala 


Cys 


lie 


Thr 


Asp 


His 


He 


Ala 


Cys 


Leu 


Arg 


Lys 


Lys 


His 


His 


Gin 




110 










115 










120 










125 




eaa 


agt 


ggg 


cet 


cca 


aag 


eca 


gat 


gat 


ttt 


cat 


ttt 


atg 


tgt 


gaa 




4 32 


Gin 


Ser 


Gly 


Pro 


Pro 


Lys 


Pro 


Asp 


Asp 


Phe 


His 


Phe 


Met 


Cys 


Glu 







130 135 140 



taatagcttc attaacacaa tcaagaccat gcacattgta aatacattat tcttg 4 87 

<210> 667 
<211> 476 
<212> DNA 

<213> Homo sapiens 

<220> 
<221> CDS 
<222> 33. .218 

<400> 667 

gaacaagcgt ccctgatcca gaaggtgttc ag atg gag atg gcg agt tct get 53 

Met Glu Met Ala Ser Ser Ala 

1 5 

ggc tec tgg etc tct ggc tgc etc ate cct etc gtc tte etc egg ctg 101 

Gly Ser Trp Leu Ser Gly Cys Leu He Pro Leu Val Phe Leu Arg Leu 



wo 02/083898 PCT/IBO 1/009 14 

366 

10 15 20 

tct gtg cat gtg tea ggc cac gca ggg gat gcc ggc aag ttc cac gtg 149 
Ser Val His Val Ser Gly His Ala Gly Asp Ala Gly Lys Phe His Val 

25 30 35 

gcc eta eta ggg ggc aca gcc gag ctg etc tgc cct etc tec etc tgg 197 
Ala Leu Leu Gly Gly Thr Ala Glu Leu Leu Cys Pro Leu Ser Leu Trp 
40 45 50 55 

ccc ggg acg tac cca agg agg tgaggtggct gcggtcccca ttcccgcagc 248 
Pro Gly Thr Tyr Pro Arg Arg 
60 

gctcccaggc tgttcacata ttccgggatg ggaaggacca ggatgaagat ctgatgccgg 308 
aatataaggg gaggacggtg ctagtgagag atgcccaaga gggaagtgtc actctgcaga 368 
tccttgacgt gcgccttgag gaccaagggt cttaccgatg tctgatccaa gttggaaatc 428 
tgagtaaaga ggacaccgtg atcctgcagg ttgcagcccc atctgtgg 476 

<210> 668 
<211> 475 
<212> DNA 
<213> Homo sapiens 



<220> 
<221> CDS 
<222> 90. .431 

<400> 668 

gtattatgat gtcgtataat gtaacattct gaaaatacaa catgtccacc taaatttcaa 60 



agggetgaaa aaaatttctt gaaagatga 


atg 


gac 


att 


gtg 


gcc 


cat 


tgt 


gag 


113 


















Met 
1 


Asp 


He 


Val 


Ala 

5 


His 


Cys 


Glu 




aaa 


gcc 


atg 


gca 


gag 


tea 


gtt 


aag 


aaa 


eat 


cag 


ttt 


age 


act 


tct 


gaa 


161 


Lys 


Ala 


Met 


Ala 


Glu 


Ser 


Val 


Lys 


Lys 


His 


Gin 


Phe 


Ser 


Thr 


Ser 


Glu 






10 










15 










20 












tct 


tat 


ctt 


cca 


gca 


act 


tta 


ata 


ctg 


aac 


age 


ata 


tec 


ttt 


aat 


act 


209 


Ser 


Tyr 


Leu 


Pro 


Ala 


Thr 


Leu 


He 


Leu 


Asn 


Ser 


He 


Ser 


Phe 


Asn 


Thr 




25 










30 










35 










40 




gat 


ggc 


tec 


ata 


cag 


tat 


aaa 


gaa 


aag 


ctg 


tat 


ttc 


tct 


gcc 


tec 


gaa 


257 


Asp 


Gly 


Ser 


He 


Gin 


Tyr 


Lys 


Glu 


Lys 


Leu 


Tyr 


Phe 


Ser 


Ala 


Ser 


Glu 












45 










50 










55 






gcc 


eta 


gac 


get 


tac 


att 


gat 


gat 


ttt 


cac 


tta 


aac 


tat 


gag 


cct 


cct 


305 


Ala 


Leu 


Asp 


Ala 


Tyr 


He 


Asp Asp 


Phe 


His 


Leu 


Asn 


Tyr 


Glu 


Pro 


Pro 










60 










65 










70 








gat 


att 


gat 


aca 


aag 


gtt 


aac 


ctg 


gat 


cag 


agt 


cct 


ctt 


gaa 


ttt 


ctt 


353 


Asp 


He 


Asp 


Thr 


Lys 


Val 


Asn 


Leu 


Asp 


Gin 


Ser 


Pro 


Leu 


Glu 


Phe 


Leu 








75 










80 










85 










get 


aag 


tea 


aat 


agt 


ggg 


gtc 


cat 


gga 


ate 


gca 


cca 


ggc 


caa 


agt 


ggc 


401 


Ala 


Lys 


Ser 


Asn 


Ser 


Gly 


Val 


His 


Gly 


He 


Ala 


Pro 


Gly 


Gin 


Ser 


Gly 






90 










95 










100 












atg 


gag 


aag 


ate 


ata 


cag 


tgg 


tat 


get 


gca 


taacaaggtt tctgtcagca 


451 


Met 


Glu 


Lys 


He 


He 


Gin 


Trp 


Tyr 


Ala 


Ala 

















105 110 

acaaattgct tatatgatgg tgge 475 



<210> 669 
<211> 578 
<212> DNA 
<213> Homo sapiens 

<220> 
<221> CDS 
<222> 411.. 575 



<400> 669 

gagcaagcce tggtggcagc gcagggtcca gtgcagcccc tccccacagc atgctggggg 



60 



wo 02/083898 
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367 

ctaattctga tgtcatcttt ctgcagaaaa ccattagacc atccctccag actgccaccc 120 

tcaaagccgt ctgcccaggc cccatctgac actcttgaca tctgcaggtc ccagacccta 180 

tgatgtgtcc actctggagg ctcctcatct tcctcgggtt gctggccttg cccttggcac 240 

cacacaagca gccttggcct ggcctggccc aagcccacag agacaacaaa tccaccctgg 300 

caagaagtcc ttcgataaca acatcgtaaa gatgtgtgca catatgagca tcgttgtgga 360 

gttctggctg gagaaagacg agtttggccg gaggatctgg tgataggcaa atg cga 416 

Met Arg 
1 



tgc aga 


ccc 


age 


agt 


gtc 


cat 


gtg 


gee 


ate 


etc 


act 


gar 


get 


ate 


cca 


464 


Cys Arg 


Pro 
5 


Ser 


Ser 


Val 


His 


Val 
10 


Ala 


He 


Leu 


Thr 


Glu 
15 


Ala 


He 


Pro 




cca aag 


atg 


ate 


agt 


ttc 


tct 


aca 


ame 


tea 


aag 


aga 


ate 


ttc 


aaa 


aak 


512 


Pro Lys 


Met 


He 


Ser 


Phe 


Ser 


Thr 


Xaa 


Ser 


Lys Arg 


He 


Phe 


Lys 


Xaa 




20 










25 










30 












tct cca 


a am 


atg 


kta 


aaa 


agt 


cag 


gta 


tgt 


cct 


ctg 


ate 


ggt 


gaa 


ate 


560 


Ser Pro 


Xaa 


Met 


Xaa 


Lys 


Ser 


Gin 


Val 


Cys 


Pro 


Leu 


He 


Gly Glu 


He 




35 








40 










45 










50 




teg ggc 


age 


tgg 


atg 


tga 






















578 


Ser Gly 


Ser 


Trp 


Met 
55 



























<210> 670 
<211> 467 
<212> DNA 
<213> Homo sapiens 

<220> 
<221> CDS 
<222> 62. .430 

<400> 670 

atttecggtt ecggtgtcag ttcgaggcgc cgccgccgcc gccgcagccg ccggagcegc 
a atg cct aaa gga gga aga aag gga ggc cac aaa ggc egg gcg agg cag 
Met Pro Lys Gly Gly Arg Lys Gly Gly His Lys Gly Arg Ala Arg Gin 
15 10 15 



tat 


aca 


age 


cct 


gag 


gag ate 


gac 


gcg 


cas 


tgc 


agg 


ctg 


aga 


age 


aga 


157 


Tyr 


Thr 


Ser 


Pro 
20 


Glu 


Glu He 


Asp 


Ala 
25 


Xaa 


Cys 


Arg 


Leu 


Arg 
30 


Ser 


Arg 




agg 


cca 


ggg 


aag 


aag 


agg age 


aaa 


aaa 


ggt 


gga 


gat 


ggg 


get 


gea 


ggt 


205 


Arg 


Pro 


Gly 
35 


Lys 


Lys 


Arg Ser 


Lys 
40 


Lys 


Gly 


Gly 


Asp 


Gly 
45 


Ala 


Ala 


Gly 




gae 


ccc 


aaa 


aag 


gag 


aag aaa 


tct 


eta 


gac 


tea 


gat 


gag 


agt 


gag 


gat 


253 


Asp 


Pro 
50 


Lys 


Lys 


Glu 


Lys Lys 
55 


Ser 


Leu 


Asp 


Ser 


Asp 
60 


Glu 


Ser 


Glu 


Asp 




gaa 


gaa 


gat 


gac 


tac 


cag caa 


aag 


cgc 


aaa 


ggc 


gtt 


gaa 


ggg 


etc 


ate 


301 


Glu 


Glu 


Asp 


Asp 


Tyr 


Gin Gin 


Lys 


Arg 


Lys 


Gly 


Val 


Glu 


Gly 


Leu 


He 




65 










70 








75 










80 




gac 


ate 


gag 


aac 


ccc 


aac egg 


gtg 


gea 


cag 


aca 


ace 


aaa 


aag 


gtc 


aca 


349 


Asp 


He 


Glu 


Asn 


Pro Asn Arg 


Val 


Ala 


Gin 


Thr 


Thr 


Lys 


Lys 


Val 


Thr 










85 








90 










95 






caa 


ctg 


gat 


ctg 


gac 


ggg cca 


agg 


age 


ttt 


cga 


gga 


gag 


aac 


gag 


aag 


397 


Gin 


Leu 


Asp 


Leu 


Asp Gly Pro 


Arg 


Ser 


Phe 


Arg 


Gly 


Glu 


Asn 


Glu 


Lys 










100 








105 










110 








aga 


ttg 


aga 


age 


aga 


agg caa 


aag 


age 


gtt 


aca 


tgaaaatgea cttggccggg 


450 


Arg 


Leu 


Arg 


Ser 


Arg Arg Gin 


Lys 


Ser 


Val 


Thr 


















115 








120 





















aagaeagagc aagecaa 

<210> 671 
<211> 468 
<212> DNA 

<213> Homo sapiens 
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368 

<220> 
<221> CDS 
<222> 63. .431 

<400> 671 

agaacatcct aatcrwaaaa gttcgaggcg ccgccgccgc cgccgcagcc gccggagccg 60 

ca atg cct aaa gga gga aga aag gga ggc cac aaa ggc egg gcg agg 107 

Met Pro Lys Gly Gly Arg Lys Gly Gly His Lys Gly Arg Ala Arg 



1 






5 








10 








15 




cag tat 
Gin Tyr 


aca 
Thr 


age 
Ser 


cct 
Pro 
20 


gag 
Glu 


gag 
Glu 


ate 
He 


gac 
Asp 


gcg 
Ala 
25 


cas 
Xaa 


tgc 
Cys 


agg 
Arg 


ctg 
Leu 


aga 
Arg 
30 


age 
Ser 


155 


aga agg 
Arg Arg 


oca 
Pro 


ggg 

Gly 
35 


aag 
Lys 


aag 
Lys 


agg 
Arg 


age 
Ser 


aaa 
Lys 
40 


aaa 
Lvs 


ggt 
Glv 


gga 
Gly 


gat 
Asp 


ggg 

Gly 
45 


get 
Ala 


gca 
Ala 


203 


ggt gae 
Gly Asp 


ccc 
Pro 
50 


aaa 
Lys 


aag 
Lys 


gag 
Glu 


aag 
Lys 


aaa 
Lys 
55 


tct 
Ser 


eta 
Leu 


gac 
Asp 


tea 
Ser 


gat 
Asp 
60 


gag 
Glu 


agt 
Ser 


gag 
Glu 


251 


gat gaa 
Asp Glu 

65 


gaa 
Glu 


gat 
Asp 


gac 
Asp 


tac 
Tyr 


cag 

Gin 
70 


caa 
Gin 


aag 
Lys 


cgc 
Arg 


aaa 
Lys 


ggc 
Gly 
75 


gtt 
Val 


gaa 
Glu 


ggg etc 
Gly Leu 


299 


ate gac 
He Asp 
80 


ate 
He 


gag 
Glu 


aac 
Asn 


ccc 
Pro 
85 


aac 
Asn 


egg 
Arg 


gtg 
Val 


gca 
Ala 


cag 
Gin 
90 


aca 
Thr 


acc 
Thr 


aaa 
Lys 


aag 
Lys 


gtc 
Val 
95 


347 


aca caa 
Thr Gin 


ctg 
Leu 


gat 
Asp 


ctg 
Leu 

100 


gac 
Asp 


ggg 

Gly 


cca 
Pro 


agg 
Arg 


age 
Ser 

105 


ttt 
Phe 


cga 
Arg 


gga 
Gly 


gag 
Glu 


aac 
Asn 
110 


gag 
Glu 


395 


aag aga 
Lys Arg 


ttg 
Leu 


aga 
Arg 
115 


age 
Ser 


aga 
Arg 


agg 
Arg 


caa 
Gin 


aag 
Lys 
120 


age 
Ser 


gtt 

Val 


aca 
Thr 


tgaaaatgca 




441 



cttggccggg aagacagagc aagccaa 4 68 

<210> 672 
<211> 527 
<212> DNA 

<213> Homo sapiens 

<220> 
<221> CDS 
<222> 85. .327 

<400> 672 

ettttctatg teaattttet ggcaacaaag tctgtggeea agaaagaage tacegaetcc 60 
ttcgaceaea aaaagttett ecaa atg gtc ggc ctg aag aaa aag agt gcg 111 

Met Val Gly Leu Lys Lys Lys Ser Ala 



gat 


gat 


gtg 


aag 


aag 


gtg 


1 

ttt cac 


atg 


ctg 


5 

gac aag 


gac 


aaa 


agt 


ggc 


159 


Asp Asp 


Val 


Lys 


Lys 


Val 


Phe 


His 


Met 


Leu 


Asp Lys 


Asp 


Lys 


Ser 


Gly 




10 










15 










20 








25 




ttc 


ate 


gag 


gag 


gat 


gag 


ctg 


gga 


ttc 


ate 


eta aaa 


ggc 


ttc 


tec 


eea 


207 


Phe 


He 


Glu 


Glu 


Asp 


Glu 


Leu 


Gly 


Phe 


He 


Leu Lys 


Gly 


Phe 


Ser 


Pro 












30 










35 








40 






gat 


gee 


aga 


gac 


ctg 


tct 


get 


aaa 


gaa 


acc 


aag atg 


ctg 


atg 


get 


get 


255 


Asp 


Ala 


Arg 


Asp 


Leu 


Ser 


Ala 


Lys 


Glu 


Thr 


Lys Met 


Leu 


Met 


Ala 


Ala 










45 










50 








55 








gga 


gac 


aaa 


gat 


ggg 


gac 


ggc 


aaa 


att 


ggg 


gtt gac 


gaa 


ttc 


tec 


act 


303 


Gly Asp 


Lys 


Asp 


Gly 


Asp 


Gly 


Lys 


He 


Gly 


Val Asp 


Glu 


Phe 


Ser 


Thr 








60 










65 








70 










ctg 


gtg 


get 


gaa 


ast 


aag 


aag 


cac 


tgactgcccc tggtcttcca cctctctgcc 


357 


Leu 


Val 


Ala 


Glu 


Xaa 


Lys 


Lys 


His 




















75 










80 





















etgaacaccc aatctcggcc cctcttgeca cectcetgea tttctgttca gttcgtttat 417 
gttatttttt actcccccat cccctgtgge cetctaatga caccattctt ctggaaaatg 477 



wo 02/083898 PCT/TBO 1/009 14 

369 

ctggagaagc aataaaggtt gtaccagtca aaaaaaaaaa aaaatgcgaa 527 



<210> 673 
<211> 594 
<212> DNA 
<213> Homo sapiens 

<220> 
<221> CDS 
<222> 161. .589 



<400> 673 

aaaaggatat acaaagaatg attttgccat tcatacacta aaaaataact ttgaagcaga 60 
tgcttatttt gttaaagtta ttgttcttag tggtctgtgt tctaatgact gtcctcgtaa 120 
gataacaccc tttggagtga atcaacctgg accttatatc atg tat aca act gta 175 

Met Tyr Thr Thr Val 
1 5 



gat 


gca 


aat 


ggg 


tat 


ctg 


aaa 


aat 


gga 


tea 


get 


ggc 


caa 


etc 


age 


caa 


223 


Asp Ala 


Asn Gly Tyr 


Leu 


Lys 


Asn 


Gly 


Ser 


Ala 


Gly 


Gin 


Leu 


Ser 


Gin 












10 










15 










20 






tec 


tea 


cat 


tta 


get 


etc 


caa 


eta 


cca 


tac 


aac 


gtg 


ctt 


ggt 


tta 


ggt 


271 


Ser 


Ser 


His 


Leu 
25 


Ala 


Leu 


Gin 


Leu 


Pro 

30 


Tyr 


Asn 


Val 


Leu 


Gly 
35 


Leu 


Gly 




egg 


age 


gca 


aat 


ttt 


ctt 


gae 


eat 


etc 


tac 


gtt 


ggt 


att 


ccc 


egt 


cea 


319 


Arg Ser 


Ala 


Asn 


Phe 


Leu Asp 


His 


Leu 


Tyr 


Val 


Gly 


He 


Pro 


Arg 


Pro 








40 










45 










50 










tot 


gga 


gaa 


aaa 


tet 


ata 


ega 


aaa 


caa 


gag 


tgg 


act 


gca 


ate 


att 


cca 


367 


Ser 


Gly 
55 


Glu 


Lys 


Ser 


He 


Arg 
60 


Lys 


Gin 


Glu 


Trp 


Thr 

65 


Ala 


He 


He 


Pro 




aat 


tec 


cag 


eta 


att 


gtc 


att 


cea 


tac 


ect 


cac 


aat 


gtc 


ect 


ega 


agt 


415 


Asn 


Ser 


Gin 


Leu 


He 


Val 


He 


Pro 


Tyr 


Pro 


His 


Asn 


Val 


Pro 


Arg 


Ser 




70 










75 










80 










85 




tgg 


agt 


gcc 


aaa 


ctg 


tat 


ctt 


aca 


cca 


agt 


aat 


att 


gtt 


ctg 


ctt 


act 


463 


Trp 


Ser 


Ala 


Lys 


Leu 
90 


Tyr 


Leu 


Thr 


Pro 


Ser 
95 


Asn 


He 


Val 


Leu 


Leu 
100 


Thr 




get 


ata 


get 


ete 


ate 


ggt 


gtc 


tgt 


gtt 


tte 


ate 


ttg 


gea 


ata 


att 


ggc 


511 


Ala 


He 


Ala 


Leu 
105 


He 


Gly 


Val 


Cys 


Val 
110 


Phe 


He 


Leu 


Ala 


He 
115 


He 


Gly 




att 


tta 


cat 


tgg 


cag 


gaa 


aag 


aaa 


gca 


gat 


gat 


aga 


gaa 


aaa 


ega 


caa 


559 


lie 


Leu 


His 


Trp Gin 


Glu 


Lys 


Lys 


Ala 


Asp 


Asp 


Arg 


Glu 


Lys 


Arg 


Gin 








120 










125 










130 










gaa 


gee 


cac 


egg 


ttt 


cat 


ttt 


gat 


get 


atg 


tgact 










594 


Glu 


Ala 


His Arg 


Phe 


His 


Phe 


Asp 


Ala 


Met 

















135 140 



<210> 674 
<211> 545 
<212> DNA 

<213> Homo sapiens 

<220> 
<221> CDS 
<222> 297.. 500 



<400> 674 

atgaggttee 

teceettcca 

cataatgeat 

aaaaaggcaa 

ggtaagtctg 



atgcggccaa 
aataetttaa 
gggteeagte 
gagtaatatt 
tgtgtgccag 



caagagaggg 
gttcceagtt 
etgttctttg 
caaaatactg 
cetgeagcag 



agatagaaag 
taacetatgt 
ggttgcaeaa 
aagagctatg 
gtggtctcag 



atg gae ace atg gat ccc tte tgg ttt aat gea 



gaaagggece cagatgtgge 60 

gatcetttca atattteeae 120 

aataattage acaagcatac 180 

gtacactaaa gtgecctggg 240 

ccetggaagg aacage atg 299 

Met 

1 

tte aaa agg act aat 347 
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Met 


Asp 


Thr 


Met 
5 


Asp 


Pro 


Phe 


Trp 


Phe 
10 


Asn 


Ala 


Phe 


Lys Arg 
15 


Thr 


Asn 




acc 


ata 


ctg 


cac 


cat 


ttq 


aga 


atg 


tec 


aag 


cac 


aea 


gat gca 


gca 


gaa 


395 


Thr 


He 


Leu 


His 


His 


Leu 


Arg Met 


Ser 


Lys 


His 


Thr Asp Ala 


Ala 


Glu 








20 










25 










30 








gag 


gtg 


eta 


ttg 


gaa 


aaa 


aaa 


ggt 


tgc 


a eg 


gga 


gtc 


ata aea 


eta 


aac 


443 


Glu 


Val 


Leu 


Leu 


Glu 


Lys 


Lys 


Gly Cys 


Thr Gly 


Val 


He Thr 


Leu 


Asn 






35 










40 










45 










aga 


cca 


aag 


ttc 


etc 


aat 


gca 


ctg 


act 


ett 


aat 


atg 


att egg 


cag 


att 


491 


Arg 


Pro 


Lys 


Phe 


Leu 


Asn 


Ala 


Leu 


Thr 


Leu 


Asn 


Met 


He Arg 


Gin 


He 




50 










55 










60 








65 





tat cac age taaagaagtt ggaacaagat cetgaaactt tectgatcat tataa 545 
Tyr His Ser 



<210> 675 
<211> 534 
<212> DNA 

<213> Homo sapiens 

<220> 
<221> CDS 
<222> 13. .234 



<400> 675 

tttttttcca et atg get tec age act gtc ccg gtg age get get ggc teg 51 
Met Ala Ser Ser Thr Val Pro Val Ser Ala Ala Gly Ser 









1 








5 










10 






get 


aat 


gaa 


act 


ecc 


gaa 


ata 


ccg 


gac 


aac 


gtg 


gga 


gat tgg ett 


egg 


99 


Ala 


Asn 
15 


Glu 


Thr 


Pro 


Glu 


He 
20 


Pro 


Asp 


Asn 


Val 


Gly 
25 


Asp Trp Leu 


Arg 




ggc 


gtc 


tac 


egc 


ttt 


gee 


act 


gat 


agg 


aat 


gac 


ttc 


egg agg aac 


ttg 


147 


Gly Val 


Tyr 


Arg 


Phe 


Ala 


Thr 


Asp 


Arg 


Asn 


Asp 


Phe 


Arg Arg Asn 


Leu 




30 










35 










40 






45 




ata 


eta 


aat 


ttg 


gga 


etc 


ttt 


get 


gcg 


gga 


gtt 


tgg 


ctg gcc agg 


aac 


195 


He 


Leu 


Asn 


Leu 


Gly 
50 


Leu 


Phe 


Ala 


Ala 


Gly 
55 


Val 


Trp 


Leu Ala Arg 
60 


Asn 




ttg 


agt 


gac 


att 


gac 


etc 


atg 


gca 


cet 


cag 


cca 


ggg 


gtg tagceaagta 


244 


Leu 


Ser 


Asp 


He 
65 


Asp 


Leu 


Met 


Ala 


Pro 
70 


Gin 


Pro 


Gly 


Val 







gaeaaatgga atcctgtget gaaeecgaat etteeaaaaa acageetaea atctgtgace 304 

accacaagat gtgcectgat ggcagctgaa gtttgattea gatgggcact tttetteece 364 

ttcectgeet agtttccttt tgttccttga gtceacgcag aattecattc tctggteage 424 

agacaggctt aagetaaagt attgcctcta ttctgtaaag ttctgtacat agttceeaag 484 

cttetgcagg gggtgatttt tgctettgtc etgagaaata acagtgetgt 534 



<210> 676 
<211> 509 
<212> DNA 

<213> Homo sapiens 

<220> 
<221> CDS 
<222> 125.. 367 



<220> 

<221> mise_feature 

<222> 91,461 

<223> n=a, g, e or t 



<400> 676 

agccgagcet gcggaaggeg geggeggegg caectgcgat eageggctgg ggeaggttat 60 

ggtagtgcgg aetgcggtgt gagcagageg necaegggge ecgecatgeg eeggcggccc 120 

tgac atg ggc gee acg ggt cca aag etc ggg gee tgt ggc ett egc etc 169 
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Met Gly Ala Thr Gly Pro Lys Leu Gly Ala Cys Gly Leu Arg Leu 





1 






5 








10 










15 




ggc 


ggc 


egg ags 




ggc 


tea 


gag 


gca gca 


gga 


get 


gag 


caa 


get 


ttg 


217 


Gly Gly 


Arg Xaa 


Gly 


Gly 


Ser 


Glu 


Ala Ala 


Gly 


Ala 


Glu 


Gin 


Ala 


Leu 










20 








25 










30 






gtr 


egg 


cet egg 


gge 


cga 


get 


gtr 


cce cee 


ttc 


gta 


ttc 


acg 


egc 


egc 


265 


Val 


Arg 


Pro Arg 


Gly 


Arg 


Ala 


Val 


Pro Pro 


Phe 


Val 


Phe 


Thr 


Arg 


Arg 








35 










40 








45 








ggc 


tct 


atg ttc 


tat 


gat 


gag 


gat 


ggg gat 


ctr 


get 


eac 


gag 


ttc 


tat 


313 


Gly Ser 


Met Phe 


Tyr 


Asp 


Glu 


Asp 


Gly Asp 


Leu 


Ala 


His 


Glu 


Phe 


Tyr 








50 








55 








60 










gag 


gag 


aca ate 


gtc 


acc 


aag 


aae 


ggg cag 


aak 


ekg 


gcc 


aag 


ctg 


agg 


361 


Glu 


Glu 


Thr He 


Val 


Thr 


Lys 


Asn 


Gly Gin 


Xaa 


Xaa 


Ala 


Lys 


Leu 


Arg 






65 








70 








75 












gca 


teg 


tgaagetgga teasccccgc atceacgtgg atttccctgt 


gatcctctat 


417 



Ala Ser 
80 

gaggtgtgae ccetgggagg tggcagaeag aageaccccc tgcnceggca agaaactccc 477 
aggctcaatc aaggtgtggc ttccatttga gg 509 



<210> 677 
<211> 591 
<212> DNA 
<213> Homo sapiens 

<220> 
<221> CDS 
<222> 101.. 304 

<400> 677 

gcggaagtgc tgcttactgg gtcctgaatc ggctgcgtge tgttetaggc atctttcacc 60 



tgg gaa gaa oca etc tac gtt ate acc ttc 
Trp Glu Glu Pro Leu Tyr Val He Thr Phe 

10 15 
gcc aaa tgg aca gaa age ccg aga caa cat 
Ala Lys Trp Thr Glu Ser Pro Arg Gin His 

25 30 
att ttg gag aaa ttg act tac cat acc act 
He Leu Glu Lys Leu Thr Tyr His Thr Thr 

40 45 50 

cet gtc tat gtc caa tea gca gaa tgt etc gga cea ect aaa aag 304 
Pro Val Tyr Val Gin Ser Ala Glu Cys Leu Gly Pro Pro Lys Lys 

55 60 65 

taaaagaagg agaetgaaat aatagcatct ttgatgaaaa etatctggaa gacaagttgt 364 
taacaattet ggggatettg gtgattacag agttcttaat ccctctgtcc ataggtgatg 424 
acaattacag getgeetata ggtectatag tgcteacaea cctccagccc ttccecatgg 484 
tgtacacaca cttgeagtat attcatctet ttgtcttatt tgagagtagg gctgggtgtg 544 
tgtacaaact aatgacaaat aettgaeagt caeacagcag tgataea 591 



jatt 


atg 


tgg 


get 


ttt 


cea 


115 




Met 


Trp 


Ala 


Phe 


Pro 






1 








5 




tac 


ata 


gta 


gca 


gtt 


gaa 


163 


Tyr 


He 


Val 


Ala 


Val 
20 


Glu 




gaa 


gtt 


gtt 


eta 


caa 


gtt 


211 


Glu 


Val 


Val 


Leu 
35 


Gin 


Val 




cat 


caa 


ccc 


atg 


caa 


aag 


259 


His 


Gin 


Pro 


Met 


Gin 


Lys 





<210> 678 
<211> 348 
<212> DNA 
<213> Homo sapiens 

<220> 
<221> CDS 
<222> 11. .331 



<400> 678 

aaaaaacaac atg cea cag tta aaa ctt act tea gtt tgg aca act act 



49 
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Met Pro Gin Leu Lys Leu Thr Ser Val Trp Thr Thr Thr 
15 10 



cac 


age 


tac 


tac 


aca 


grg 


acc 


cga 


acg 


agk 


tea 


cct 


ggd 


tak 


ama 


cct 


97 


His 


Ser 
15 


Tyr 


Tyr 


Thr 


Xaa 


Thr 
20 


Arg 


Thr 


Xaa 


Ser 


Pro 
25 


Gly 


Xaa 


Xaa 


Pro 




gga 


cca 


cca 


cca 


atg 


gat 


atw 


caa 


atg 


gca 


aac 


aat 


ttt 


act 


ccg 


ccc 


145 


Gly 


Pro 


Pro 


Pro 


Met 


Asp 


He 


Gin 


Met 


Ala 


Asn 


Asn 


Phe 


Thr 


Pro 


Pro 




30 










35 










40 










45 




tct 


gca 


act 


cct 


cag 


gga 


aat 


gac 


tgt 


gac 


etc 


tat 


gca 


cat 


cac 


age 


193 


Ser 


Ala 


Thr 


Pro 


Gin 


Gly Asn 


Asp 


Cys 


Asp 


Leu 


Tyr 


Ala 


His 


His 


Ser 












50 










55 










60 






acg 


gcc 


agt 


ata 


gta 


atg 


cct 


ctg 


cat 


tac 


age 


etc 


gkc 


ttc 


ate 


att 


241 


Thr 


Ala 


Ser 


He 
65 


Val 


Met 


Pro 


Leu 


His 
70 


Tyr 


Ser 


Leu 


Xaa 


Phe 
75 


He 


He 




ggg 


etc 


gtg 


gga 


aac 


tta 


eta 


gcc 


ttg 


gtc 


gtc 


att 


gtt 


caa 


aac 


agg 


289 


Gly Leu 


Val 


Gly 


Asn 


Leu 


Leu 


Ala 


Leu 


Val 


Val 


He 


Val 


Gin 


Asn 


Arg 








80 










85 










90 










aaa 


aaa 


ate 


ate 


tea 


cct 


tgr 


gcc 


cam 


ccc 


cam 


ccc 


att 


gkc 






331 


Lys 


Lys 


He 


He 


Ser 


Pro 


Xaa 


Ala 


Xaa 


Pro 


Xaa 


Pro 


He 


Xaa 









95 100 105 

taatctgtag aagctaa 348 



<210> 679 
<211> 503 
<212> DNA 
<213> Homo sapiens 

<220> 
<221> CDS 
<222> 194. .409 



<400> 679 

catctcccac caggccccac ctgcaacact ggggatcaaa tttcaacacg agatttggaa 60 

ggggcaaata tccaaactgt ateageaetc tttgaaagat tacagagtaa atatctttct 120 

tcgtcagaaa tggaatgate eccgeetcgc gtacagtgaa tatcctgacg actctttaga 180 

cetcgacccc tec atg ttg gac tec att tgg aaa cct gat ttg ttc ttt 229 

Met Leu Asp Ser He Trp Lys Pro Asp Leu Phe Phe 

15 10 



gcc 


aat 


gaa 


aag 


ggt 


gcc 


aac 


ttt 


cat 


gaa 


gtc act 


aca gac aac 


aaa 


277 


Ala 


Asn 


Glu 


Lys Gly 


Ala 


Asn 


Phe 


His 


Glu 


Val Thr 


Thr Asp Asn 


Lys 








15 










20 








25 






ttg 


eta 


aga 


att 


ttc 


aaa 


aat 


gga 


aat 


gtt 


ctt tat 


tea ata aga 


tta 


325 


Leu 


Leu 


Arg 


He 


Phe 


Lys 


Asn 


Gly Asn 


Val 


Leu Tyr 


Ser He Arg 


Leu 






30 










35 








40 








aca 


tta 


aca 


ctt 


tee 


tgt 


cca 


atg 


gat 


etc 


aag aat 


ttt ccc atg 


gat 


373 


Thr 


Leu 


Thr 


Leu 


Ser 


Cys 


Pro 


Met 


Asp 


Leu 


Lys Asn 


Phe Pro Met 


Asp 




45 










50 










55 




60 




gta 


caa 


aca 


tgt 


ata 


atg 


caa 


ctg 


gaa 


age 


ttg gtt 


taaaatgaca 




419 


Val 


Gin 


Thr 


Cys 


He 
65 


Met 


Gin 


Leu 


Glu 


Ser 

70 


Leu Val 









agataaaaga ttgtaaatcg gtggaaccac tggaaagtag gcaaaaagag tttaacctac 479 
tggaaagcaa gagctgtcag aatc 503 



<210> 680 
<211> 561 
<212> DNA 
<213> Homo sapiens 

<220> 
<221> CDS 
<222> 256. .426 



<400> 680 
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ttgaagattt ttttggtata ttctagatgt taatctcatc ctgtcttctg tttatcaact 60 
ttgtcaatgg taactttgag aaatttttag ttttgatata gccccagtcc attaattttt 120 
gccttaggtt gcactttaag agtataagat atttttcctt accccaagat tacaaagata 180 
ttctgtattc tgtcagcttt ccagttttac ctttcacatt gagggcttta ttctatttga 240 
agttcatctt catat atg gga aaa caa agg aat etc aca tta ttt ttt tec 291 
Met Gly Lys Gin Arg Asn Leu Thr Leu Phe Phe Ser 
15 10 
ttt oca ttc tgt gag aca gtt ttc eta aat cat eta tta aac agt gta 339 
Phe Pro Phe Cys Glu Thr Val Phe Leu Asn His Leu Leu Asn Ser Val 

15 20 25 

ttc ttt cet tea tat agt ttt gtg gag eca ett tta tea tae etc aga 387 
Phe Phe Pro Ser Tyr Ser Phe Val Glu Pro Leu Leu Ser Tyr Leu Arg 

30 35 40 

ttc eca eag aca cat att agt tta ett ttt caa ett tat taagagttat 436 
Phe Pro Gin Thr His He Ser Leu Leu Phe Gin Leu Tyr 
45 50 55 

taaggctgag cgtggtggct caatgcetgt aateccagca etttgggaga etgaggtggg 4 96 
aggattgttt gagtccagga gttcaagacc agcctgggca acatggagag accctgtetc 556 
tacaa 561 



<210> 681 
<211> 520 
<212> DNA 

<213> Homo sapiens 

<220> 
<221> CDS 
<222> 21.. 326 



<400> 681 

etttggette cccaccggea atg gag egg ttg acg ttg ect etc ggc ggc gcg 53 

Met Glu Arg Leu Thr Leu Pro Leu Gly Gly Ala 
15 10 



gcg 


gcg 


gtg 


gae 


gag 


tac 


ctg 


gag 


tae 


egg 


aga 


att 


gtt 


ggt 


gag 


gat 


101 


Ala 


Ala 


Val 


Asp 


Glu 


Tyr 


Leu 


Glu 


Tyr 


Arg 


Arg 


He 


Val 


Gly 


Glu 


Asp 










15 










20 










25 








gat 


gga 


ggg 


aaa 


ett 


ttt 


act 


cet 


gaa 


gaa 


tat 


gaa 


gaa 


tac 


aaa 


aga 


149 


Asp Gly 


Gly 


Lys 


Leu 


Phe 


Thr 


Pro 


Glu 


Glu 


Tyr 


Glu 


Glu 


Tyr 


Lys 


Arg 








30 










35 










40 










aaa 


gtt 


tta 


cet 


ctg 


cgc 


tta 


caa 


aac 


aga 


tta 


ttt 


gtg 


age 


tgg 


egg 


197 


Lys 


Val 


Leu 


Pro 


Leu 


Arg 


Leu 


Gin 


Asn 


Arg 


Leu 


Phe 


Val 


Ser 


Trp 


Arg 






45 










50 










55 












tea 


eca 


aca 


ggg 


atg 


gat 


tgt 


aaa 


ett 


gtg 


ggc 


cea 


gag 


aca 


ctg 


tgt 


245 


Ser 


Pro 


Thr 


Gly 


Met 


Asp 


Cys 


Lys 


Leu 


Val 


Gly 


Pro 


Glu 


Thr 


Leu 


Cys 




60 










65 










70 










75 




ttt 


tgt 


aca 


cat 


agg 


tat 


aaa 


caa 


cat 


aaa 


act 


gae 


ttg 


gaa 


gcg 


att 


293 


Phe 


Cys 


Thr 


His 


Arg 


Tyr 


Lys 


Gin 


His 


Lys 


Thr 


Asp 


Leu 


Glu 


Ala 


He 












80 










85 










90 






cet 


eag 


eag 


tgc 


CCC 


att 


gat 


CCC 


cet 


gee 


aag 


tgactggctg ccagtgcagg 


346 


Pro 


Gin 


Gin 


Cys 


Pro 


He 


Asp 


Pro 


Pro 


Ala 


Lys 















95 100 



gettacettt atgteeectt gaatggtage eagcecatte getgeaggtg caaacacttt 406 
gctgatcage acagtgetgc gcctggcttt acatgcaata catgttccaa gtgtteagga 4 66 
ttccataget gcttcacttg tgcttgtggt cagcctgcat attggycatg acac 520 



<210> 682 
<211> 513 
<212> DNA 

<213> Homo sapiens 

<220> 
<221> CDS 
<222> 85.. 396 
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<400> 682 

actttctcgg ggtgtctgcg taactgccca gacttgcctt ggtttggtca gatgacctcc 60 
tctgggactg gctagccagc gttc atg tct aca tta ttg ate teg tet tec 111 

Met Ser Thr Leu Leu lie Ser Ser Ser 

1 5 



gat 


gee 


tta 


ttg 


gag 


gaa 


ctg 


gaa 


cge 


tee 


ace 


ctt 


cag 


gac 


agt 


gat 


159 


Asp Ala 


Leu 


Leu 


Glu 


Glu 


Leu 


Glu 


Arg 


Ser 


Thr 


Leu 


Gin 


Asp 


Ser 


Asp 




10 










15 










20 










25 




gaa 


tat 


tec 


aac 


cca 


get 


cet 


ctt 


ccc 


ctg 


gat 


cag 


eat 


tec 


aga 


aag 


207 


Glu 


Tyr 


Ser 


Asn 


Pro 


Ala 


Pro 


Leu 


Pro 


Leu 


Asp 


Gin 


His 


Ser 


Arg 


Lys 












30 










35 










40 






gag 


act 


aac 


ctt 


gat 


gag 


act 


teg 


gag 


ate 


ctt 


tct 


att 


cag 


gat 


aac 


255 


Glu 


Thr 


Asn 


Leu 


Asp 


Glu 


Thr 


Ser 


Glu 


He 


Leu 


Ser 


He 


Gin 


Asp 


Asn 










45 










50 










55 








aca 


agt 


ccc 


ttg 


ceg 


gcg 


cas 


teg 


tgt 


ata 


eta 


cca 


ata 


tee 


agg 


age 


303 


Thr 


Ser 


Pro 


Leu 


Pro 


Ala 


Xaa 


Ser 


Cys 


He 


Leu 


Pro 


He 


Ser 


Arg 


Ser 








60 










65 










70 










tea 


atg 


tct 


aca 


gtg 


aag 


ccc 


aag 


age 


caa 


agg 


aat 


eac 


eac 


cge 


ctt 


351 


Ser 


Met 


Ser 


Thr 


Val 


Lys 


Pro 


Lys 


Ser 


Gin 


Arg 


Asn 


His 


His 


Arg 


Leu 






75 










80 










85 












eta 


aaa 


cgt 


cag 


cag 


ctg 


etc 


agt 


tgg 


atg 


age 


tea 


ttg 


etc 


ace 




396 


Leu 


Lys 


Arg 


Gin 


Gin 


Leu 


Leu 


Ser 


Trp 


Met 


Ser 


Ser 


Leu 


Leu 


Thr 






90 






95 










100 















tgactgagat geaggceaag gttgcagtga gagcagatgc tggcaagaag caettaecag 456 
acaagcagga teaeaaggcc tccetggaet eaatgcttgg gggtctcgag eaggaat 513 

<210> 683 
<211> 521 
<212> DNA 

<213> Homo sapiens 

<220> 
<221> CDS 
<222> 317.. 505 

<400> 683 

aagggaatct cegggcgggg tagtgcagge gcegggtttc ccgcggtccg agetggcgeg 60 
ggcggaggag aatcgetett aaagggccag cgeacacgcg ttcttttgtt ecggggccgc 120 
aggeggggca ggcecgactt tcgccgtctt ettgtctact ctccagaacg gccatgattt 180 
cccaattett cattctgtee teeaaggggg acegctcate tacaaagact teegegggga 240 
cagtggcgge cgggatgtgg ccgagctctt ctaccggaag ctgaegggac tgccaggaga 300 
cgagtccccg gttgtc atg gac tat ggc tat gta cag ace aca tec acg gag 352 
Met Asp Tyr Gly Tyr Val Gin Thr Thr Ser Thr Glu 
15 10 
atg ctg agg aat tte ate cag acg gaa get gtg gtc age aag ccc ttc 400 
Met Leu Arg Asn Phe He Gin Thr Glu Ala Val Val Ser Lys Pro Phe 

15 20 25 

age etc ttt gac etc age age gtt ggc ttg ktt ggg get gag aca caa 448 
Ser Leu Phe Asp Leu Ser Ser Val Gly Leu Xaa Gly Ala Glu Thr Gin 

30 35 40 

cag age aaa gtg gee ccc cag cag tgc age cag yeg sec cgt cet gkc 496 
Gin Ser Lys Val Ala Pro Gin Gin Cys Ser Gin Xaa Xaa Arg Pro Xaa 
45 50 55 60 

cag teg etc tgaccagagc caaaag 521 
Gin Ser Leu 

<210> 684 
<211> 462 
<212> DNA 

<213> Homo sapiens 



<220> 



wo 02/083898 PCT/IBO 1/009 14 

375 

<221> CDS 
<222> 2. .196 



<400> 684 

a atg etc age gcg ctg ccc ggc tgg gga ccc gcg cac ctg cag egc ccg 4 9 
Met Leu Ser Ala Leu Pro Gly Trp Gly Pro Ala His Leu Gin Arg Pro 
15 10 15 

ctg etc ggc cct gca tec tge ctg ggc ate ctg cgc ccg gee atg aeg 97 

Leu Leu Gly Pro Ala Ser Cys Leu Gly He Leu Arg Pro Ala Met Thr 
20 25 30 

gcg eac tea ttc gcc etc ccg gte ate ate ttc acc acg ttc tgg ggc 145 

Ala His Ser Phe Ala Leu Pro Val He He Phe Thr Thr Phe Trp Gly 
35 40 45 

etc gte ggc ate gee ggg ccc tgg ttc gtg ccg aag gae eca acc gcg 193 

Leu Val Gly He Ala Gly Pro Trp Phe Val Pro Lys Asp Pro Thr Ala 
50 55 60 

gag tgatcateae catgctggtc gccaccgecg tctgctgtta cctcttetgg 24 6 

Glu 

65 

cteategeca tcetggcgca stgaacccec tgttcgggcc ecagctgaag aatgagacca 306 
tctggtacgt gcgettcctg tgggagtgac ecgecgecce egaeeeaggt geeeagctct 366 
eggaatgact gtggctccac tgtccctgae aaccccttcg tccggaccct cccccacaca 426 
actatgtctg gteaeeagct ccetcctgct ggeace 4 62 



<210> 685 
<2H> 513 
<212> DNA 
<213> Homo sapiens 

<220> 
<221> CDS 
<222> 128.. 376 



<400> 685 

atctggatat ttctcaeatt tgccttggtt tgatgctcct ggagttccca aaagaataga 60 
gatcacttta tcaagcttgc agaacaggtt gcatgattgt tgtgctgctg cttgctagta 120 



aaagttt 


atg 


ctt 


gta 


gaa 


aga 


aaa 


gtt 


aat 


ggg 


ctg 


gtg 


ccg 


egg 


etc 


169 






Met 


Leu 


Val 


Glu 


Arg 


Lys 


Val 


Asn 


Gly Leu 


Val 


Pro 


Arg 


Leu 








1 








5 










10 












atg 


cct 


gta 


ate 


gca 


gca 


cat 


tgg 


gag 


gcc 


cag 


gtg 


ggc 


gca 


tea 


ctt 


217 


Met 


Pro 


Val 


He 


Ala 


Ala 


His 


Trp 


Glu 


Ala 


Gin 


Val 


Gly 


Ala 


Ser 


Leu 




15 










20 










25 










30 




gag 


ggt 


tec 


age 


ate 


ttg 


gac 


aga 


get 


gtt 


att 


gaw 


cac 


aat 


ttg 


ttg 


265 


Glu 


Gly 


Ser 


Ser 


He 


Leu 


Asp 


Arg 


Ala 


Val 


He 


Xaa 


His 


Asn 


Leu 


Leu 












35 










40 










45 






tet 


gca 


age 


aaa 


tta 


tat 


aat 


aat 


att 


acc 


ttc 


gaa 


gaa 


ctt 


gga 


get 


313 


Ser 


Ala 


Ser 


Lys 


Leu 


Tyr 


Asn 


Asn 


He 


Thr 


Phe 


Glu 


Glu 


Leu 


Gly 


Ala 










50 










55 










60 








ctt 


tta 


gag 


ate 


cct 


gca 


get 


aag 


gcg 


gaa 


aag 


ata 


gca 


tet 


caa 


atg 


361 


Leu 


Leu 


Glu 


He 


Pro 


Ala 


Ala 


Lys 


Ala 


Glu 


Lys 


He 


Ala 


Ser 


Gin 


Met 





65 70 75 

ata acc gaa gae gta tgaatggatt tattgaccag attgatggaa tagttcattt 416 
He Thr Glu Asp Val 



80 

tgaaacacga gaagccctgc caaegtggga taageagatc caatcacttt gtttccaagt 476 
gaataaeett tttggagaaa attagtcaaa cagcacc 513 



<210> 686 
<211> 498 
<212> DNA 
<213> Homo sapiens 



<220> 



wo 02/083898 PCT/IBO 1/009 14 

376 

<221> CDS 
<222> 118. .399 



<400> 686 

aacaattcat gaagttgaag aaaagacact gtcagaaatg aacacagaag cggasaacag 60 
cttctccatc acgccagaaa tggcaatgct gaagaagtaa gacaactatt agagacc 117 



atg 


gcg 


agt 


aat 


gaa 


gtg 


att 


get 


gac 


att 


aat tge 


aaa 


gga 


aga 


agt 


165 


Met 


Ala 


Ser 


Asn 


Glu 


Val 


lie 


Ala 


Asp 


He 


Asn Cys 


Lys 


Gly 


Arg 


Ser 




1 








5 










10 








15 






aag 


tct 


aac 


ttg 


ggc 


tgg 


aca 


ccc 


eta 


cat 


ctg gca 


tgc 


tat 


ttt 


gga 


213 


Lys 


Ser 


Asn 


Leu 
20 


Gly 


Trp 


Thr 


Pro 


Leu 
25 


His 


Leu Ala 


Cys 


Tyr 
30 


Phe 


Gly 




cac 


aga 


caa 


gtg 


gtc 


cag 


gat 


Gtg 


ttg 


aag 


get ggt 


gca 


gaa 


gtg 


aat 


261 


His 


Arg 


Gin 
35 


Val 


Val 


Gin 


Asp 


Leu 
40 


Leu 


Lys 


Ala Gly 


Ala 
45 


Glu 


Val 


Asn 




gtg 


ttg 


aat 


gac 


atg 


gga 


gac 


acg 


eeg 


ett 


eat cga 


get 


gee 


ttt 


aca 


309 


Val 


Leu 
50 


Asn 


Asp 


Met 


Gly 


Asp 
55 


Thr 


Pro 


Leu 


His Arg 
60 


Ala 


Ala 


Phe 


Thr 




gga 


cga 


aag 


gtg 


aaa 


ate 


att 


eta 


tgt 


tea 


atg ttt 


gta 


agt 


gag 


gta 


357 


Gly Arg 


Lys 


Val 


Lys 


lie 


lie 


Leu 


Cys 


Ser 


Met Phe 


Val 


Ser 


Glu 


Val 




65 










70 










75 








80 




ttt 


gga 


gga 


gta 


gtt 


acc 


att 


gtt 


ttc 


tct 


gtt ata 


acc 


ate 






399 


Phe 


Gly 


Gly 


Val 


Val 
85 


Thr 


He 


Val 


Phe 


Ser 
90 


Val He 


Thr 


He 









tgaccageaa cegagaaagc eaeacaaaaa aatgtataea ceagcacttt gggteaaagg 459 
ceacaggatc ttttgagtet gacagtgagg teeagtact 498 



<210> 687 
<211> 550 
<212> DNA 
<213> Homo sapiens 

<220> 
<221> CDS 
<222> 82. .306 



<400> 687 

agtctcgcca gctgccggtc tttcgggggc tccgtaactt tctatccgtc cgcgtcageg 60 

ettgecaeee teatcteeaa t atg eet ggt eeg acc eee agt ggc act aae 111 

Met Pro Gly Pro Thr Pro Ser Gly Thr Asn 
15 10 

gtg gga tee tea ggg ege tct eee age aaa gca gtg gee gee egg gcg 159 

Val Gly Ser Ser Gly Arg Ser Pro Ser Lys Ala Val Ala Ala Arg Ala 
15 20 25 

207 



255 



ggg 


gat 


cca 


ctg 


tec 


ggc 


aga 


gga 


aaa 


atg 


cca 


get 


gtg 


gga 


caa 


gga 


Gly Asp 


Pro 


Leu 


Ser 


Gly 


Arg 


Gly 


Lys 


Met 


Pro 


Ala 


Val 


Gly 


Gin 


Gly 








30 










35 










40 






gtg 


cag 


gee 


gca 


caa 


eet 


egg 


cag 


gca 


eeg 


ggg 


gga 


tgt 


ggc 


gat 


tct 


Val 


Gin 


Ala 


Ala 


Gin 


Pro 


Arg 


Gin 


Ala 


Pro 


Gly Gly 


Cys 


Gly 


Asp 


Ser 






45 










50 










55 








aca 


cag 


aag 


att 


cac 


ctg 


ggc 


tea 


aag 


ttg 


gee 


etg 


ttc 


cag 


tat 


tgg 


Thr 


Gin 
60 


Lys 


He 


His 


Leu 


Gly 
65 


Ser 


Lys 


Leu 


Ala 


Leu 
70 


Phe 


Gin 


Tyr 


Trp 



303 



tta tgagtettet gtteategct tetgtattta tgttgeacat ttggggcaag 356 

Leu 

75 

tacactcgtt cgtagattca gttacatcca tctgtcatct gaagaaggag gaaaaaaccc 416 

aacatttctt ggaccaaaag tatagtgact atetgttcat gagagaaatt ttctgtaagc 476 

ttgctgtttt acaggggatt tatcaataat tgattttgag gaatcagttt ttttetatgg 536 

etaataaact tttt 550 



<210> 688 
<211> 501 



wo 02/083898 PCT/IBO 1/009 14 

377 

<212> DNA 

<213> Homo sapiens 

<220> 
<221> CDS 
<222> 8. .487 



<400> 688 



agctcct 


atg 


agg 


eec 


etg 






Met 
1 


Ara 


Pro 


Leu 


teg 


cct 


CCQ 


cca 


ecg 


etg 


Ser 


Pro 


Pro 


Pro 


Pro 


Leu 


15 










20 


gtc 


ggc 


tec 


tct 


ggg 


cgt 


Val 


Gly 


Ser 


Ser 


Gly Arg 










35 




gac 


egg 


eta 


CQC 


etg 


eeg 


Asp Arg 


Leu 


Arg 


Leu 


Pro 








50 






ttt 


tac 


tat 




ate 


etg 


Phe 


Tyr 


Tyr 


Gly 


He 


Leu 






65 








gaa 


gga 


gcc 


aag 


cag 


gag 


Glu Gly 


Ala 


Lys 


Gin 


Glu 




80 










att 


caa 


tgk 


gtg 


ate 


aat 


He 


Gin 


Xaa 


Val 


He 


Asn 


95 










100 


acc 


egg 


age 


tgg 


etc 


tat 


Thr Arg 


Ser 


Trp 


Leu Tyr 










115 




atg 


gtc 


tee 


age 


aat 


tea 


Met 


Val 


Ser 


Ser 


Asn 


Ser 








130 






gtc 


ctt 


ggt 


aaa 


tec 


tgc 


Val 


Leu 


Gly 


Lys 


Ser 


Cys 



145 



ace tct tgaagaagaa gtac 
Thr Ser 
160 



ceo 


CCQ 


ate 


ggc 


gat 


ate 


Pro 


Pro 


Val 


Gly Asp 


Val 


5 










10 


eta 


ceo 


atq 


eeg gtt 


qtg 


Leu 


Pro 


Val 


Pro 


Val 


Val 










25 




etc 


ata 


acc 


tct 


age 


aac 

M w 


Leu 


Met 


Ala 


Ser 


Ser 


Ser 








40 






etc 


tac 


ttc 


etg 


ggt 


ate 


Leu 


Cys 


Phe 


Leu Gly 


Val 






55 








cag 


gaa 


aag 


ata 


aca 


aga 


Gin 


Glu 


Lys 


He 


Thr 


Arg 




70 










aeg 


ttc 


acc 


ttt 


gcc 


tta 


Thr 


Phe 


Thr 


Phe 


Ala 


Leu 


85 










90 


get 


gtg 


ttt 


gee 


aag 


ate 


Ala 


Val 


Phe 


Ala 


Lys 


He 










105 




get 


gee 


tgt 


tct 


ate 


tec 


Ala 


Ala 


Cys 


Ser 


He 


Ser 








120 






gea 


eta 


cag 


ttt 


gtc 


aac 


Ala 


Leu 


Gin 


Phe 


Val 


Asn 






135 








aag 


cca 


ate 


cca 


gtc 


atg 


Lys 


Pro 


He 


Pro 


Val 


Met 



150 



egg etg gag etg 49 
Arg Leu Glu Leu 

age ggg tct cca 97 
Ser Gly Ser Pro 
30 

tec etg gtg eec 145 
Ser Leu Val Pro 
45 

ttt gtc tgc tat 193 
Phe Val Cys Tyr 

60 

gga aag tat ggg 241 

Gly Lys Tyr Gly 

75 

act ttg gtc ttc 289 
Thr Leu Val Phe 

tgr tgg gat cgt 337 
Xaa Trp Asp Arg 
110 

tat etg ggt gee 385 
Tyr Leu Gly Ala 
125 

tac cca act cag 4 33 

Tyr Pro Thr Gin 
140 

etc ctt ggg gtg 481 

Leu Leu Gly Val 

155 

501 



<210> 689 
<211> 494 
<212> DNA 
<213> Homo sapiens 

<220> 
<221> CDS 
<222> 282, .461 



<400> 689 

attttettaa 

gtgeatgeec 

aggacccgeg 

cggeggagga 

gegagacgcc 



tttttcagaa 
tectcatget 
gcagcaggca 
gaaggtggtg 
tgccttgggt 



etttaaacag 
geecctctgt 
ctggagetgg 
gtctgteacg 
gctggatgcc 



atatgtaatt 
teeagggece 
tggtgcagga 
tggtgggcca 
ggctgtacga 



ttc cat tta cag etg gag ttc eac atg aag gag 

Phe His Leu Gin Leu Glu Phe His Met Lys Glu 

10 15 

cag att age tgg tct ttc ate agt gtg eeg gaa 



taaaagatgc ageeectcet 
acctttcctg teetttgagg 
ggtetccagc gtgctcaggt 
ggctattcag ttcttggtca 
c atg egg etc tec cct 
Met Arg Leu Ser Pro 
1 5 
aag aga gag gac etc 
Lys Arg Glu Asp Leu 
20 

atg gcc gtt aat ate 



60 
120 
180 
240 
296 



344 



392 
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378 

Gin lie Ser Trp Ser Phe He Ser Val Pro Glu Met Ala Val Asn He 

25 30 35 

cag ccc aaa cac tgg ggg agg acc agg tgg ctg aga caa gtg cga tgt • 440 
Gin Pro Lys His Trp Gly Arg Thr Arg Trp Leu Arg Gin Val Arg Cys 

40 45 50 

ctg acg ttc tea agg aca tct tgaagcattt ggctggttct gcctctccat cag 494 
Leu Thr Phe Ser Arg Thr Ser 
55 60 

<210> 690 
<211> 582 
<212> DNA 
<213> Homo sapiens 

<220> 
<221> CDS 
<222> 120.. 530 

<400> 690 

aaagtcgtgc tgcaggcgtc gcgccaatct tcgctctgag gtgctgtctc accggtgaga 60 

cctggaagcg ggcgagtctc gtgctgtgtc ggacctgcag cccctggcct tccgccacc 119 

atg gag tac etc ate ggt ate caa gge ccc gac tat gtt ctt gtc gee 167 

Met Glu Tyr Leu lie Gly He Gin Gly Pro Asp Tyr Val Leu Val Ala 

1 5 10 15 

tec gac egg gtg gee gee age aat att gtc cag atg aag gae gat cat 215 

Ser Asp Arg Val Ala Ala Ser Asn He Val Gin Met Lys Asp Asp His 

20 25 30 

gac aag atg ttt aag atg agt gaa aag ata tta etc ctg tgt gtt gga 263 

Asp Lys Met Phe Lys Met Ser Glu Lys He Leu Leu Leu Cys Val Gly 

35 40 45 

gag get gga gac act gta cag ttt gca gaa tat att cag aaa aac gtg 311 

Glu Ala Gly Asp Thr Val Gin Phe Ala Glu Tyr He Gin Lys Asn Val 

50 55 60 

caa ctt tat aag atg cga aat gga tat gaa ttg tct ccc acg gca gca 359 

Gin Leu Tyr Lys Met Arg Asn Gly Tyr Glu Leu Ser Pro Thr Ala Ala 
65 70 75 80 

get aac ttc aca cgc cga aac ctg get gac tgt ctt egg agt egg acc 407 

Ala Asn Phe Thr Arg Arg Asn Leu Ala Asp Cys Leu Arg Ser Arg Thr 

85 90 95 

eca tat eat gtg aac etc etc ctg get gge tat gat gag eat gaa ggg 4 55 

Pro Tyr His Val Asn Leu Leu Leu Ala Gly Tyr Asp Glu His Glu Gly 

100 105 110 

cca gcg ctg tat tac atg gac tac ctg gca gcc ttg gcc aag gcc ctt 503 

Pro Ala Leu Tyr Tyr Met Asp Tyr Leu Ala Ala Leu Ala Lys Ala Leu 

115 120 125 

ttg cag ccc acg get atg gtg cet tec tgaetcteag tatcetcgac 550 
Leu Gin Pro Thr Ala Met Val Pro Ser 

130 135 
egatactaca caeegaetat ctcacgtgar ag 582 

<210> 691 
<211> 506 
<212> DNA 

<213> Homo sapiens 

<220> 
<221> CDS 
<222> 4. .168 

<400> 691 

aag atg gcg gac egt gge gge gtg ggt gaa gee gca get gtt gga gcg 4 8 

Met Ala Asp Arg Gly Gly Val Gly Glu Ala Ala Ala Val Gly Ala 
15 10 15 



wo 02/083898 PCT/IBO 1/009 14 

379 

tct cct gca tct gtc cct ggc eta aac ccg acg eta ggc tgg agg gag 96 
Ser Pro Ala Ser Val Pro Gly Leu Asn Pro Thr Leu Gly Trp Arg Glu 

20 25 30 

cga ctg cgt ccg ggc tgg egg gga ctg ggg cct cgt tgt ggt teg tgg 144 
Arg Leu Arg Pro Gly Trp Arg Gly Leu Gly Pro Arg Cys Gly Ser Trp 

35 40 45 

egg ggc tgg ggc tge ttt acg ccc tgaggatecc tttgaggctg tgtgagaatt 198 
Arg Gly Trp Gly Cys Phe Thr Pro 

50 55 
tggcagcggt gactgtattt ttaaattcat tgacacccaa attctatgtg gcacttacag 258 
ggacctctte attgatatea ggactaatat ttatatttga atggtggtac ttccataage 318 
atggeaeate ttttattgag eaagtatetg taagceattt geaaccactg atgggaggaa 378 
cagagagcag catttcagaa ccaggttctc cttcgaggaa cagagaaaat gaaaecagca 438 
gacagaattt gteaggtgac tacttttcta atgtgttttc agagctgtgt atttaagatt 498 
gagtttgg 506 



<210> 692 
<211> 499 
<212> DNA 
<213> Homo sapiens 

<220> 
<221> CDS 
<222> 64. .459 



<400> 692 



gtectetggg ccacgctcca cttgceegct tccacccgga 


aagceeeeca ggetgagtge 


60 


ggc 


atg 


ate 


tec 


ate 


ace 


gaa 


tgg 


cag 


aag 


att 


ggt 


gtg 


ggg 


ate 


ace 


108 




Met 


He 


Ser 


He 


Thr 


Glu 


Trp 


Gin 


Lys 


He 


Gly 


Val 


Gly 


He 


Thr 






1 








5 










10 










15 




ggt 


ttc 


ggc 


ate 


ttc 


ttc 


ate 


etc 


ttt 


gga 


aea 


etc 


ctg 


tac 


ttt 


gat 


156 


Gly 


Phe 


Gly 


He 


Phe 


Phe 


He 


Leu 


Phe 


Gly 


Thr 


Leu 


Leu 


Tyr 


Phe 


Asp 












.20 










25 










30 






tee 


gtg 


etc 


ctg 


gee 


ttt 


gga 


aac 


ctg 


ctg 


ttc 


ctg 


acg 


ggc 


ctg 


tee 


204 


Ser 


Val 


Leu 


Leu 


Ala 


Phe 


Gly 


Asn 


Leu 


Leu 


Phe 


Leu 


Thr 


Gly 


Leu 


Ser 










35 










40 










45 








etc 


ate 


att 


ggc 


ctg 


agg 


aag 


ace 


ttt 


tgg 


ttc 


ttc 


ttc 


caa 


egg 


cae 


252 


Leu 


lie 


He 


Gly 


Leu 


Arg 


Lys 


Thr 


Phe 


Trp 


Phe 


Phe 


Phe 


Gin 


Arg 


His 








50 










55 










60 










aaa 


etc 


aag 


gga 


ace 


age 


ttc 


etc 


ctg 


ggg 


ggt 


gtg 


gtt 


ate 


gtg 


etc 


300 


Lys 


Leu 


Lys 


Gly 


Thr 


Ser 


Phe 


Leu 


Leu 


Gly 


Gly 


Val 


Val 


He 


Val 


Leu 






65 










70 










75 












eta 


cgc 


tgg 


ccc 


etc 


etc 


ggc 


atg 


ttc 


ctg 


gaa 


ace 


tac 


gga 


ttc 


ttc 


348 


Leu 


Arg 


Trp 


Pro 


Leu 


Leu 


Gly 


Met 


Phe 


Leu 


Glu 


Thr 


Tyr 


Gly 


Phe 


Phe 




80 










85 










90 










95 




age 


etc 


ttt 


aag 


ggc 


ttt 


ttc 


cct 


gtc 


gee 


ttc 


ggc 


ttc 


ctg 


ggc 


aat 


396 


Ser 


Leu 


Phe 


Lys 


Gly 


Phe 


Phe 


Pro 


Val 


Ala 


Phe 


Gly 


Phe 


Leu 


Gly 


Asn 












100 










105 










110 






gtc 


tge 


aac 


ate 


ccc 


ttc 


ctg 


ggt 


geg 


ctg 


ttc 


egg 


aga 


ett 


caa 


ggc 


444 


Val 


Cys 


Asn 


He 


Pro 


Phe 


Leu 


Gly 


Ala 


Leu 


Phe 


Arg 


Arg 


Leu 


Gin 


Gly 





115 120 125 



act age teg atg gtc tgaaaacaga gatgagetcc ttgaacttgg atcattggtt 499 
Thr Ser Ser Met Val 
130 



<210> 693 
<211> 756 
<212> DNA 
<213> Homo sapiens 

<220> 
<221> CDS 
<222> 188. .616 



wo 02/083898 



380 



PCT/IBOl/00914 



<400> 693 

attttattac ccggagtcct ggttcctggg ccagaccggt gtgtcgtttt tggcccaagc 60 
tagagaatgt taagggcttc tgcggtgggt tggtgctaga ggcgccgcga acaggtgctg 120 
cgggggcggc gcgggaggcg gtgcccttgc ttccggatcc ggtctcagct ctgggaggga 180 

229 



277 



acgggag 


atg 


ttg 


eag 


gcg 


ccg 


aga 


ggg 


egg 


gcc 


agg 


gcc 


gca 


etc 


egg 






Met 


Leu 


Gin 


Ala 


Pro 


Arg 


Gly 


Arg 


Ala 


Arg 


Ala 


Ala 


Leu 


Arg 






1 








5 










10 










aga 


etc 


gcg 


gtt 


get 


acg 


cgc 


ace 


atg 


get 


gga 


gcg 


ccc 


acg 


gtc 


teg 


Arg 


Leu 


Ala 


Val 


Ala 


Thr 


Arg 


Thr 


Met 


Ala 


Gly 


Ala 


Pro 


Thr 


Val 


Ser 


15 










20 










25 










30 


ctt 


cct 


gaa 


etc 


cgt 


tea 


etc 


eta 


gcc 


tec 


gga 


egg 


gee 


egg 


etc 


ttc 


Leu 


Pro 


Glu 


Leu 


Arg 


Ser 


Leu 


Leu 


Ala 


Ser 


Gly 


Arg 


Ala 


Arg 


Leu 


Phe 










35 










40 










45 




gac 


gtg 


cgc 


tct 


cgc 


gag 


gag 


gcg 


gca 


get 


ggg 


ace 


ate 


cca 


ggg 


gcg 


Asp 


Val 


Arg 


Ser 


Arg 


Glu 


Glu 


Ala 


Ala 


Ala 


Gly 


Thr 


He 


Pro 


Gly 


Ala 








50 










55 










60 






etc 


aac 


ate 


ccg 


gtg 


tec 


gag 


ttg 


gag 


agt 


get 


etg 


cag 


atg 


gag 


cca 


Leu 


Asn 


He 


Pro 


Val 


Ser 


Glu 


Leu 


Glu 


Ser 


Ala 


Leu 


Gin 


Met 


Glu 


Pro 






65 










70 










75 








get 


gcc 


ttc 


cag 


get 


tta 


tat 


tct 


get 


gag 


aag 


cca 


aag 


etg 


gaa 


gat 


Ala 


Ala 


Phe 


Gin 


Ala 


Leu 


Tyr 


Ser 


Ala 


Glu 


Lys 


Pro 


Lys 


Leu 


Glu 


Asp 




80 










85 










90 










gag 


cat 


etc 


gtt 


ttc 


ttc 


tgt 


cag 


atg 


ggc 


aag 


egg 


ggc 


etc 


cag 


gca 


Glu 


His 


Leu 


Val 


Phe 


Phe 


Cys 


Gin 


Met 


Gly 


Lys 


Arg 


Gly 


Leu 


Gin 


Ala 


95 










100 










105 










110 


ego 


age 


tgg 


ccc 


gga 


gtc 


ttg 


gat 


aca 


etg 


ggg 


etc 


gca 


act 


acg 


etg 


Arg 


Ser 


Trp 


Pro 


Gly 


Val 


Leu 


Asp 


Thr 


Leu 


Gly 


Leu 


Ala 


Thr 


Thr 


Leu 










115 










120 










125 




gag 


cct 


ata 


gag 


aat 


ggt 


tgg 


aga 


aag 


aga 


gtt 


agg 


cag 


gag 


gca 


get 


Glu 


Pro 


He 


Glu 


Asn 


Gly 


Trp 


Arg 


Lys 


Arg 


Val 


Arg 


Gin 


Glu 


Ala 


Ala 








130 










135 










140 







325 



373 



421 



469 



517 



565 



613 



tac tgattgccac ccectggecc cttaatggcc accttaaeta agggtgtgaa 666 

Tyr 

cgggctgact tggtgaattg ggcaactcct tatagtgttg tgeaeacaaa ageateaaat 726 

aaagaacatt taatcaaaaa aaaaaaaaaw 756 



<210> 694 
<211> 563 
<212> DNA 
<213> Homo sapiens 

<220> 
<221> CDS 
<222> 296., 463 



<400> 694 

aatgtttaat gcccgcacct egtcccegge caccccgtga ggeageeaga eactetccaa 60 

aaagcagaga cagcaggaag aggggagtgg aggcagecca ttcacctggg gaaatgactg 120 

ggttgtcgat ggaeggtggc ggcaeccaag ggggacgtgg accegttcta ctatgactat 180 

gagacegtte gcaatggggg cctgatctte gctggactgg ectteatcgt ggggctcctc 240 

ateetcctea gcagaagatt ccgctgtggg ggcaataaga agcgcaggea aatea atg 298 



Met 



aag 


atg 


age cgt aac 


age 


age 


etc 


ggc ggt 


gcc 


ace 


cac 


tgc 


act 


ggg 


Lys 


Met 


Ser Arg Asn 


Ser 


Ser 


Leu 


Gly Gly Ala 


Thr 


His 


Cys 


Thr 


Gly 






5 








10 








15 






gee 


age 


tgg gaa gcc 


aag 


cat 


ggc 


cct gcc 


tct 


ggc 


gee 


tec 


cct 


tct 


Ala 


Ser 


Trp Glu Ala Lys 


His Gly Pro Ala 


Ser 


Gly Ala 


Ser 


Pro 


Ser 






20 






25 








30 








tee 


etg 


ggc ttt aga 


cct 


ttg 


tec 


ccg tea 


etg 


cca 


gcg 


ctt 


ggg 


etg 


Ser 


Leu 
35 


Gly Phe Arg 


Pro 


Leu 
40 


Ser 


Pro Ser 


Leu 


Pro 
45 


Ala 


Leu 


Gly 


Leu 



346 



394 



442 
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381 

aag gaa get cca gac tea atg tgacccccag gtggcatcgc aactcctgcc 493 
Lys Glu Ala Pro Asp Ser Met 
50 55 

tcgtgccacc tcatgcttat aataaagccg gcgtcagaga ccgctgcttc ccycaaaaaa 553 
aaaaaaaaaa 563 



<210> 695 
<211> 502 
<212> DNA 
<213> Homo sapiens 



<400> 695 

agaggcccta 

ctgccagcag 

acaaagtcct 

agccaccacc 

aggtggtgaa 

cagacctgct 

ccaggctaca 

tcactctgca 

gcagcccacc 



cacccaccga 
cttctccaag 
ggaagttttc 
gcccatcacc 
gacccacgag 
cacctacttc 
gatgcactgg 
ggacagaggg 
tatcaccaac 



ggcatggggc 
gcacgggagg 
cccaaaggcc 
tattccctct 
ccggcctcct 
tgcygggcgt 
gagctgtggt 
gcaggcccca 
ag 



tccctgggct 
aagaaattac 
gctgggtgct 
gtggaaccaa 
tcaacctcaa 
cctccacctc 
ccaagccagt 
gggtggagat 



gttctgcttg 
ccctgtggtc 
cataacctgc 
gaacatcaag 
cgtcacactc 
aggtgcccat 
gtctgagctg 
gatctgccag 



gccgtgctgg 
tccattgcct 
tgtgcacccc 
gtggccaaga 
aagtccagtc 
gtggacagtg 
cgggccaact 
gcgtcctcgg 



<210> 696 
<211> 589 
<212> DNA 
<213> Homo sapiens 



60 
120 
180 
240 
300 
360 
420 
480 
502 



<400> 696 

gtccctcctc 

cctgaaggca 

agccctggca 

tgcctctgat 

aaatgtcccc 

tttaggaggt 

tgccccgggt 

tggagatgga 

tagggaactt 

atcatgatga 



ctagttgcag 
ggcgagccag 
ggtctttaag 
ccctgaagga 
arggaaaaca 
ggtgaatacc 
ttkggagagg 
gatgatatgg 
cagttgaggt 
gttttgcctt 



tctggtagtt 
ctgcccagga 
tgcgtttgtg 
agaaaaaaaa 
aakttgtgga 
aggagcctgg 
atgtgcccaa 
aacggttcat 
actctgcgca 
atgccttgaa 



gtcgctggcc 
aaggtggaaa 
cagccgattt 
aaaaaaamcg 
raaggcccca 
aggaaatgtt 
taggcttgtc 
ggaggagatg 
ttcttatagg 
tcttgaggtt 



gtgtgacggc 
gtggtagaag 
caaggctaag 
ggaaaaaaac 
gtgcaraatg 
aaaggggttt 
gataacattg 
agagagctaa 
ggaccctcct 
aataatcat 



tcgctgttgc 
ctgacccctg 
agagaaagac 
tcaacwkgga 
aagcccccgc 
gggctcmacy 
atatgataga 
ggmggaaaat 
caccatgatc 



60 
120 
180 
240 
300 
360 
420 
480 
540 
589 



<210> 697 
<211> 420 
<212> DNA 
<213> Homo sapiens 



<400> 697 

gattcatagg agcatataat ataatctttg taaagtagag ggacaattca gtattttgta 60 

caatatgtag aataacatgt tctcttagta ggcataggat atactacatt aagaaatctg 120 

ggctaagcgt ggtggttcat gcctgtaatc ccagcacttt gggaggccga ggcgggcgga 180 

tcacttgggg tcaggagttc gggaccggcc tggccgacat ggtgaagccc cgtctttact 240 

agaaatacaa aagttggccg ggcgtggtgg tgggcgcctg tggtctcagc tactcggggg 300 

gctgaggcgg gagagtcact tgaacccggg aggcggaggt tgcagtgagc cgagatcacg 360 

ccactgcact ccagcctgga tgacagagtg agactctgtc tcaaaaaaaa aaaaaaaaaa 420 



<210> 698 
<211> 496 
<212> DNA 
<213> Homo sapiens 



<400> 698 

gacataatca gagctatgct ggaggagaag agggcagcca tttgctggct ggcttgcagt 60 

gagccaggag gtggcaggac gagttaggag gctggttcag tagctcgggc aagagcaggg 120 

ccccccagga tctgaaggcc tcccaggccc ccaggcccag cgggtcccag aggagagcga 180 

ggaccccaag gtaactccgg tgagaagggc gaccagggat ttcaaggcca gccaggcttt 240 



wo 02/083898 
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382 

ccgggcccac cgggtccccc tggattccca ggcaaagttg gatcacctgg cccacctggc 300 

cctcaagcag agaagggcag cgaagggatt cgaggcccat caggcctgcc tggctcccct 360 

gggccaccgg gacctcctgg gattcagggc cccgccggtc tggatggttt ggatgggaag 420 

gatggcaagc ctggcttgag gggggacctg gtcctgctgg cccccctgga ctcatgggac 480 

caccgggctt tagggg 496 

<210> 699 
<211> 729 
<212> DNA 
<213> Homo sapiens 



<400> 699 

atctctgagc ctccgcggcc ttctctagca aaatggtgga gccggtaccc ggctgtaagg 60 

caagctggga tgcgcggagt agcgggtgga aaactggast ggagttctca ctcggccgct 120 

gggtgacttc ggtgcactag ggatagtaac agtaccacct cgtagggttg tagaaagacm 180 

aagtgagtta atat.ttgtaa agtgtttaga acagcgtcta ctgcatggaa agtagtattt 240 

atttgttaga gagactccag gaggtcttgc ccagggattg acttctccst tccstgtctc 300 

cascccccas attcmcgttt gtgagccaag ccttgctttt gcaccggccg ggctctcttg 360 

cctggaaatg catacccatc tcaaacctgg gctttaaaat cacctcctct gggacgcctt 420 

cctctacttc tcacagttca ggcctgttct gccccacaca gcagaaaatg cccacgatgc 480 

aataccagag tctgcatggg accatttcaa caacccttag aagctgactc agctgtctcc 540 

tggccctgac ttttctaaga tgagaggagc tgataggaac aatttgggaa ccctcaaagg 600 

ctgagagtat cccacccact tcacagatga agcaaactga ggcccagaga agggaaatta 660 

cttgcccaag atcacccagc aagtaagaaa cagagctgga gatgagctca ggccagcacg 720 

gkaccacgc 729 

<210> 700 
<211> 462 
<212> DNA 
<213> Hpmo sapiens 



<400> 700 

caggaaaatc 

gaagccctgg 

ccccagtcct 

gagaagtgca 

caccccggct 

tgggtgggtg 

accctcagct 

tcatgcctat 



gcgagggacg 
aggctgagga 
ccctgcccgc 
gtaccagcat 
gctacacctg 
acgagctgta 
ctcgggcctg 
ataagttggc 



ggcggccccc 
gagaggtggc 
ctccagggcc 
cgcgaaccag 
tgccgactgt 
ctgtgagaag 
agccgccatg 
atggcaggga 



cgacagtcca 
acgccagcct 
ctggccaccc 
gctgtgcgca 
gggctgaacc 
catgcccgcc 
ccctcagcct 
caatggtggg 



<210> 701 
<211> 417 
<212> DNA 
<213> Homo sapiens 



gctcctttcg 
tcttgcccag 
ctcccaagct 
tccaggaggg 
tgaagatgcg 
agcgctactc 
gcctcactgc 
ca 



gctcttgcag 
ctcactgagc 
ccacacttgt 
ccggtaccgc 
cgggcacttc 
cgcacctgcc 
tgggccaggg 



60 
120 
180 
240 
300 
360 
420 
462 



<400> 701 

aatgctcagc gcgctgcccg gctggggacc cgcgcacctg cagcgcccgc tgctcggccc 60 

tgcatcctgc ctgggcatcc tgcgcccggc catgacggcg cactcattcg ccctcccggt 120 

catcatcttc accacgttct ggggcctcgt cggcatcgcg ggccctggtt cgtgccgaag 180 

gacccaaccg cggagtgatc atcaccatgc tggtcgccac cgccgtctgc tgttacctct 240 

tctggctcat cgccatcctg gcgcastgaa ccccctgttc gggccccagc tgaagaatga 300 

gaccatctgg trcccagctc tcggaatgac tgtggctcca ctgtccctga caaccccttc 360 

gtccggaccc tcccccacac aactatgtct ggtcaccagc tccctcctgc tggcacc 417 

<210> 702 
<211> 470 
<212> DNA 
<213> Homo sapiens 



<400> 702 

gcgcgcccgc cgcgtgccca gcccgggtag ctgctgcagc ctacgctgcc tcggcactgc 
ctgccgcgtg cggagccgga gcccgagcct gagtggcgcc gggcccgacg tggggctcct 



60 
120 
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gggccgcggc ggcgggcggg cgatgctcca gaggcctgac cagccatgga ggccgaggca 180 

ggcggcctgg aggagctgac ggacgaggag atggcggcgc taggcgaaga gctagtgcgg 24 0 

cgcctgcggc gggaggaggc ggcgcgcctg gcggcactgg tcagcgcggc cgcctcatgc 300 

aggaggtgaa tcggcagctg cagggccacc tgggcgagat ccgcgagctc aagcagctca 360 

accggcgtct rcaggcagag aaccgtgagc tgcgcgacct ctgctgcttc ctggactcgg 420 

agcgccagcg cgggcggcgc gccgcacgcc agtggcagct cttcgggacc 470 

<210> 703 
<211> 452 
<212> DNA 

<213> Homo sapiens 



<400> 703 

aatgcgcaag aaggtgcctg cgggactgga gcagagsggc tgcgagggtc ttcccagcgc 60 

aggcgggttt tccagtgtta cttgcggctg ggcgtggggg actagctgcc tttctggcac 120 

aggcaggaag ccgcaaaaag tttctgagcc cccgaacctg tagcggacgt ggaaaaagaa 180 

cgcccctcct caagtgtctg gctgaaagat gccacccagg gaakggaact cgggctagct 240 

aaggaggcca ttcttgatgt tgcttctaga tctcatgtca tcaccgagcc ctcagctgct 300 

ggtggcagct gctcagcaga cccttggcat gggaaagaga cggagtccac cccaagccat 360 

ctgccttcac ttagctggag aggtgctggc tgtggccggg gactgaagcc agctgtgctc 420 

tatgattgca actgtgcagg ggcatcagag ct 452 

<210> 704 
<211> 304 
<212> DNA 
<213> Homo sapiens 



<400> 704 

atttcgggag cgggattgtt tcgcgcagga agcaggctcc attttagcgc cgcccgccgt 60 

cgccatctgt ttcccttccc tcccccatac ctagcccgag tctgagccct aacgagaagg 120 

ctgggcctag gccgctggat gctggagtga aaggaaggga gaaagggaaa aagcgggaag 180 

agtcgagaag ggagtgttaa gaggccaagt gcgacgcgcg tatccgggca gacggactga 240 

cggacgggcc cgtgcttctg ccgcggctgc gcccggggcc tcttaaccac tgatctttga 300 

aata 304 



<210> 705 
<211> 269 
<212> DNA 

<213> Homo sapiens 
<220> 

<221> misc__feature 

<222> 84,216 

<223> n=a, g, c or t 



<400> 705 

agtcgtggac tcgtgcagct ggggcgtccg cagccgctcg tcacccgcgt gatgctgttt 60 

ctggcgttgg gcagcccgtg gggntcgaac tgcctctctg cggaaggagg actgcattgt 120 

gtgcggccgc cgcgctccga ggtccccggg cctctgtctc ccgggcgtcc tccagsagcg 180 

ggysttcggg gmgggtagcc ggctggagta cgggmncttc gggagccgcg cgccttctcc 240 

ggcgtccggg tcgagcgcag atccctgtt 269 



<210> 706 
<211> 476 
<212> DNA 
<213> Homo sapiens 



<400> 706 

atttgttcag cctccataat tcgaatacca gggcaggccg agccagccgt gcgccgcgct 60 

ccagggccca gggcgccgca cacgcatcca cccacccacc cagcctcgca gcgccatggg 120 

caagaacaag cagccacgcg gccagcagag gcaggggggc ccgccggccg cggacgccgc 180 

tgggcccgat gacatggagc cgaagaaggg cacgggggcc cccaaggagt gcggggaggg 240 

atcattgtct gcttagtggc ctatcttggc ttgtttatgc tttgtgtctc atatcaggtt 300 
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gacgaacgga catgtattca attttctatg aaactgttat actttctgct gagtgccctg 360 
ggcctgacgg tctcgtgtgc tggccgtggc ctttgccgcc caccactatt cgcagctcac 420 
acagtttacc tgtgagacca cactcgactc ttgccagtgc aaactgccct cctcgg 476 

<210> 707 
<211> 266 
<212> DNA 

<213> Homo sapiens 



<400> 707 














ctctgcgtgc 


gccggaggct 


gccgtggcgg 


gtgggccgcc 


tgacttctcc 


tcccggccag 


60 


ttctcgagcg 


cctcaccggg 


cctcgccctg 


cagcctcgct 


ctcgctggcg 


ctgcgcggcc 


120 


taggggactg 


ggctgctggc 


ctccgggtgc 


ggggtggggg 


caggctccga 


cctggggcgt 


180 


cctggcagcg 


cgagccgcgg 


gatgggggcc 


cgggccgcgg 


aggaggcrcc 


gctggygtgt 


240 


cccttggtgg 


agagggcgct 


gccggy 








266 



<210> 708 
<211> 384 
<212> DNA 

<213> Homo sapiens 



<400> 708 

aaaaaacaac atgccacagt taaaacttac ttcagtttgg acaactactc acagctacta 60 

cacagagacc cgaacgagtc actgatatac acctggacca ccaccaatgg atatacaaat 120 

ggcaaacaat tttactccgc cctctgcaac tcctcaggga aatgactgtg acctctatgc 180 

acatcacagc acggccagta tagtaatgcc tctgcattac agcctcgtct tcatcattgg 240 

gctcgtggga aacttactag ccttggtcgt cattgttcaa aacaggaaaa aaatcatctc 300 

accttgagcc cacccccacc ccattgtcta atctgtagaa gctaataaat aatcatccct 360 

ccttgcctag caaaaaaaaa aaaa 384 

<210> 709 
<211> 497 
<212> DNA 

<213> Homo sapiens 



<400> 709 

ctcttttgct 

tcgctctttc 

gtttcttgat 

gaaggaaaaa 

agagatttta 

cctgctgctt 

cacaattatc 

caaggcggat 

tcgacagtgc 



ytaacagaca 
actgctagta 
ctgcttctaa 
caaaataacc 
cttagatgat 
aatcttgccc 
acagatacgg 
gatagcccat 
gaagaat 



agccascgay 
agatcagatt 
aagaagaagt 
tcaactccgt 
ttacacaatg 
ctgamcctct 
agttgccacc 
gtaaagcaat 



tttaggctgg 
gcgtttcttt 
agagaagata 
tttgaaaaaa 
aagaaagtac 
taatgctgat 
actgaaactt 
catgaaaaga 



ataatagtca 
cagttactct 
aatcctgtct 
acattccaag 
atgcactttg 
tctgaggaag 
atgcattcat 
tttttcttca 



aattcttacc 
tcaatcgcca 
tcaatacctg 
aactttcatc 
ggcttctgtc 
atgaagaaca 
tttgtgcatt 
atattttcac 



60 
120 
180 
240 
300 
360 
420 
480 
497 



<210> 710 
<211> 471 
<212> DNA 

<213> Homo sapiens 

<220> 

<221> misc feature 

<222> 423 - 

<223> n=a, g, c or t 



<400> 710 

attcccaact gccagtgatc tctgaagccg actctgaggc tccctctttg ctctaacaga 60 

cagcagcgac tttaggctgg ataatagtca aattcttacc tcgctctttc actgctagta 120 

agatcagatt gcgtttcttt cagttactct tcaatcgcca gtttcttgat ctgcttctaa 180 

aagaagaagt agagaagata aatcctgtct tcaatacctg gaagaaaaac aaaataacct 24 0 

caactccgtt ttgaaaaaaa cattccaaga actttcatca gakattttac ttagatgatt 300 

tacacaatga agaaagtaca tgcactttgg gcttctgtat gcctgctgct taatcttgcc 360 



wo 02/083898 

cctgcccctc ttaatgctga 
gantgccacc actgaaactt 

<210> 711 
<211> 405 
<212> DNA 
<213> Homo sapiens 

<400> 711 

aatgatcctg cagcagccct tgcagcgagg cccccaggga ggggccagcg cctcccgcgg 60 

gccgccttgg gggtgacttg gggcctggac gccagctccc ctctccgagg agctgtgccc 120 

atgagcacca agcggcgcct ggaggaggag caggagcctc tgcgcaagcr rtttctgtct 180 

gaggagaaca tggccaccca cttctctcaa ctcagcctgc acaatgacca cccctactgc 240 

agccccccca tgaccttctc cccagccctg cccccactca ggagcccttg ctctgagctg 300 

cttctctggc gctatcctgg cagcctcatc cctgaggccc tccgtctgct gakgctgggg 360 

gacaccccca tgttccccct accctgcaac cccagctggg gacat 405 

<210> 712 
<211> 160 

<212> DNA 

<213> Homo sapiens 



PCT/IBOl/00914 

385 

ttctgaggaa gatgaagaac acacaattat cacagatacg 420 
atgcattcat tttgtgcatt caagtcggat g 471 



<220> 

<221> misc_feature 
<222> 105 

<223> n=a, g, c or t 



<400> 712 

aagaaaccca atgatcctgc agcagccctt gcagcgaggc ccccagggag gggccagcgc 60 

ctcccgcggg ccgccttggg ggtgacttgg ggccwggacg ssagntcccc yctccwagga 120 

gctgtgccca tgagcaccaa gcggcgvctg gaggaggaga 160 

<210> 713 
<211> 427 
<212> DNA 
<213> Homo sapiens 



<400> 713 

caatcttcgc tctgaggtgc tgtctcaccg gtgagacctg gaagcgggcg agtctcgtgc 60 

tgtgtcggac ctgcagcccc tggccttccg ccaccatgga gtacctcatc ggtatccaag 120 

gccccgacta tgtkctkkkk hvggcytccg accgggtggg ccgccagcaa tattgtccag 180 

atgaaggacg atcatgacaa gatgtttaag atgagtgaaa agatattact cctgtgtgtt 240 

ggakaggctg gagacactgt acagtttgca gaatatattc agaaaaacgt gcaactttat 300 

aagatgcgaa atggatatga attgtctccc acggcagsag ctaacttcac acgccgaaac 360 

ctggctgact gkcttcggag tcggacccca tatcatgtga acctcctcct ggctggctat 420 

gatgagc 427 

<210> 714 
<211> 433 
<212> DNA 
<213> Homo sapiens 



<400> 714 

agactgcgtg cagaaggtga ctgtctcagt ggagctgggt catctcagca ggccttggct 60 

ccttgaactt ttggccgcca tgtgcttccc gaagtcctct ctgatgacat gaagaagctg 120 

aaggcccgaa tgcaccaggc catagaaaga ttttatgata aaatgcaaaa tgcagaatca 180 

ggacgtggac aggtgatgtc gagcctggca gagctggagg acgacttcaa agagggctac 240 

ctggagacag tggcggctta ttatgaggag cagcacccag agctcactcc tctacttgaa 300 

aaagaaagag atggattacg gtgccgaggc aacagatccc ctgtcccgga tgttgaggat 360 

cccgcaaccg aggagcctgg ggagagcttt tgtgacaagg tcatgagatg gttccaggcc 420 

atgctgcagc ggc 433 



<210> 715 
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386 



<211> 326 
<212> DNA 
<213> Homo sapiens 

<220> 

<22l> misc_feature 
<222> 126 

<223> n*=a, c or t 



<400> 715 

atggcggtct 

ctcgcccaac 

aaactnccgc 

tttgactgga 

cgcagatcca 

ctgacctgtg 



tggcacctct 
cttactacct 
ygctctgcca 
gagaaktgga 
tgccccatga 
acatctctgt 



aattgctctc 
tctgtcggcc 
cggtctgccc 
gatcctgatg 
agagggacaa 
gtttca 



gtgtattcgg 
ctgctctctg 
acccaacgcg 
tttctcagtg 
agaatgggat 



tgccgcgact 
ctgccttcct 
aagacggtaa 
ccattgtgat 
ataggagtgc 



ttcacgatgk 
actcgtkagg 
cccgtgtgac 
gatgaagaac 
acagcccagc 



60 
120 
180 
240 
300 
326 



<210> 716 
<211> 507 
<212> DNA 
<213> Homo sapiens 



<400> 716 

agactgaggc 

ccacggctgc 

tgcgcaccag 

agaaagaagt 

tcttgggacc 

aagggaaaat 

aagtcccaga 

agatcatcag 

tcaaggggat 



tgaggctggg 
ggctcgtgcc 
cccgcctgta 
tttcccattt 
cgtggaaaaa 
cccagatgaa 
agaatatggt 
catggatggg 
catcttggct 



<210> 717 
<211> 561 
<212> DNA 

<213> Homo sapiens 



gaacatcggg 
tgccggggtc 
cgagctttcg 
ccagaagtta 
ttcttcactg 
actttggaga 
ggcctgggct 
tccatcactg 
ggcactg 



cagcatgagc 
tggtggtctc 
ccaaagagct 
gccaagatga 
aagaggtgga 
aattgaagag 
tctccaacac 
tgaccctggc 



gctgcggctc 
taccgcgaac 
tttcctaggc 

acttaatgaa 
ctcccgaaaa 
cctagggctt 
catgtactca 
agcgcaccag 



ttcctgcgca 
cggcggctac 
aaaatcaaga 
atcaatcagt 
attgaccagg 
tttgggctgc 
cgactagggg 
gctattggcc 



60 
120 
180 
240 
300 
360 
420 
480 
507 



<400> 717 

cttcagaatg ctggatgaaa aaattgaaaa gggtcgggat tactgttcgg aagaagagga 60 

tatcacatag caccaatttt accactcaaa ccaggagcta ctactgtgta aataggttac 120 

accccagttg aaatctttgc aaaggtcggt tctattcagc gaacagcact atagcaaaag 180 

aagatcgttc catattgtac gccccattaa attacagtgt ttcttaatga acttgcaaag 240 

gaatattgct aaaaacaaac aaaaaaaact gttatctaac tttctttgtt gctgctagtt 300 

aaaacttgtt gcaacttttc acttctcttg tgtccaggta tgcagcaaaa ttctgcaatt 360 

tcaccttaaa gatactgttg gttttacaga tgctctccaa cctattttct ataagatgag 420 

gtagtggtga actcagataa caaacttctc ttctaaactg gttctgcttc taagacaagc 480 

atctcctgcc ctctctcctt cctccccatc tctcgcacgc agtctagaga tggactgagc 540 

cttgcttctc actggcagtg t 561 

<210> 718 
<211> 530 
<212> DNA 

<213> Homo sapiens 



<400> 718 

aacatctggg gacagcggga aaacatgagt gactccaagg aaccaagggt gcagcagctg 60 

ggcctcctgg aagaagatcc aacaaccagt ggcatcagac tttttccaag agactttcaa 120 

ttccagcaga tacatggcca caagagctct acagggtgtc ttggccatgg cgccctggtg 180 

ctgcaactcc tctccttcat gctcttggct ggggtccttg tggccatcct tgtccaagtg 24 0 

tccaaggtcc ccagctccct aagtcaggaa caatccgagc aagacgcaat ctaccagaac 300 

ctgacccagc ttaaagctgc agtgggtgag ctctcagaga aatccaagct gcaggagatc 360 

taccaggagc tgacccagct gaaggctgca gtgggtgagt tgccagagaa atccaagctg 420 
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caggagatct accaggagct gacccggctg aaggctgcag tgggtgagtt gccagagaaa 480 
tccaagctgc aggagatcta ccaggagctg acccggctga aggctgcagt 530 

<210> 719 
<211> 512 

<212> DNA 

<213> Homo sapiens 



<400> 719 

aataatgctc 

ccaggggcag 

tagcagcctg 

tctgtcaacc 

ataccaacca 

agagccaaag 

ttctgcctag 

ttactcatgt 

gaaaaaacgg 



ccatcaaatg 
gatagcctga 
gctggaagat 
cggctgcatt 
gacttcttct 
ccagtctggc 
ctgtgctcac 
cctcaaagac 
agaaatatta 



<210> 720 
<211> 507 
<212> DNA 
<213> Homo sapiens 



tcatcaactt 
agaaacgtct 
tctgagtgct 
aaatcctgcc 
ttcttatgat 
tggaactctg 
tgctgtgctg 
gactcatgay 
attggaaagt 



ctcccaagca 
acaggcaaaa 
ctgtctgccc 
tcattgcagt 
tatcattcac 
tctctgatgc 
cagtggaaac 
gctggatatg 
tt 



gagaaacccg 
gtcaaagtta 
tggtgggttt 
gtaagttgga 
cttacaccat 
tggtttctac 
agactgtctg 
aagaactatt 



aacccaccaa 
ttggggtgca 
cattctcctg 

cgaaaaggat 
ggactgccat 
tgtgttggag 
acttccctgg 
gacttcttgg 



60 
120 
180 
240 
300 
360 
420 
480 
512 



<400> 720 

acctcgccag ggccggggcc gggacgggac ggaaaactgg cggcttggtt tgttcagcgc 60 

gtcgcgtcat ggctctcgtt tcaggcctcc acggtgtgca cctggagtca ggagcgcgct 120 

tgtgcgtaca gttcggttca gttactaaaa gatgtaactg aactgcagat ccttggtgaa 180 

atatctttca acaaatctct atatgaggga ctgaatgcag agaaccacag aactaagatc 240 

actgtcgtct tcctgaaaga tgagaagtac cattctttgc ctatcatcat taaaggcagc 300 

gttggtggac ttctggtgtt gatcgtgatt ctggtcatcc tgttcaagtg tggctttttt 360 

aaaagaaaat atcaacaact gaacttggag agcatcagga aggcccagct gaaatcagag 420 

aatctgctcg aagaagagaa ttaggacctg ctatccactg ggagaggcta tcagcagtcc 480 

tgggacttgg agacccagca tcctttg 507 

<210> 721 
<211> 408 
<212> DNA 
<213> Homo sapiens 

<220> 

<221> misc_feature 
<222> 261 

<223> n=a, g, c or t 



<400> 721 

acctttttta ctttcacagc aatagtgcag aatccagaat ggatgtcctc tttgtagcca 60 

tctttgctgt gccacttatc ctgggacaag aatatgagga tgaagaaaga ctgggagagg 120 

atgaatatta tcaggtggtc tattattata cagtcacccc cagttatgat gactttagtg 180 

cagatttcac cattgattac tccatatttg agtcagagga caggctgaac aggttggata 240 

aggacataac agaagcaata nagactacca ttagtcttga aacagcacgt gcagaccatc 300 

cgaagcctgt aactgtgaaa ccagtaacaa cggaacctgt ggaagaaggc tgctatgact 360 

ctttggatgg gagtctggca agaggaaatt ggaagataaa ataaataa 408 

<210> 722 
<211> 371 
<212> DNA 

<213> Homo sapiens 



<220> 

<221> misc feature 

<222> 281 ^ 

<223> n=a, g, c or t 
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<400> 722 

agagaaagat gttccaggca gagggaagag gcctctttcc tccaggcctg gagctttcat 60 

gacgctggtc attatcttgc tgcatgtatt ctggggcatt gtattttttg atggctgtga 120 

gaagaaaaag tggggcatcc tccttatcgt tctcctgacc cacctgctgg tgtcagccca 180 

gaccttcata agttcttatt atggaataaa cctggcgtca satttataat cctggtgctc 240 

atgggcactg ggcattctta gctgcgggag gcactgccga nagcctgaaa ctctgcctgc 300 

tctgccaaga caagaacttt cttctttacr accagcgctc cagataacct cagggaacca 360 

gcacttccca a 371 

<210> 723 
<211> 404 
<212> DNA 
<213> Homo sapiens 

<400> 723 

aagctaggag ccgcgcragt cgtagtgtcg ctgtttgcgg gtctccgcgc gggaccgggg 60 

cgcagggggt cgctgaggcg agggtgtcat gtcagacaac gaggacaatt ttgatggcga 120 

cgactttgat gatgtggakg aggatraagg gctagatgac ttggagaatg ccgaagagtt 180 

ttagtagtga cggggttcac cgtgttagcc aggatggtct cgatctcctg ccttgtgatc 240 

cgcccgcctc ggcctcccaa agtgctggga ttacaggaag gccaggagaa tgtcgagatc 300 

ctcccctctg gggagccgac cgcagccaac cagaagcgaa tcaccacacc atacatgacc 360 

aagtacgagc gagcccgcgt gctgggcacc cgagcgctcc agat 404 

<210> 724 
<211> 486 
<212> DNA 

<213> Homo sapiens 
<400> 724 

aaagtggaag gcaacgacac acggcccgtg gcgccttcag ctagggagaa agagaggacg 60 
ggggctggcg ggcgggatta gctgccggag gctgagtttc cgcagcctga gtctccatct 120 
gtcgctagac tggagtgcag tggcgcgatc ttggctcact tcaacctcca catcctgggt 180 
tcaagcgact ctcctgcctc agcctcccaa gtagctggga ctacaggtgc gcaccaccat 240 
gttcaactaa tttttgtaat tttagtggag acgaggtttc accatgttgg ccaagatggt 300 
ctcgatctct tgaccttgtg atccacccac cttggcctcc caaaggtact gagattahag 360 
atgtgagcca ctgcgsctag ccagaaaaat atttattgat cctaaagtgg gagtcagcgc 420 
cactgttgmt gctagtgttg aatccagaam catagctgga catgggatca gaggatgaac 480 
aagaga 466 

<210> 725 
<211> 374 
<212> DNA 

<213> Homo sapiens 
<400> 725 

aaaaaaaaag tcctagctag ggtcccaagt gacagccagc atcaactata acatgtaaag 60 
gagccagtct tcagatgatt ccagcctcag ccttcaggcc actcagctgg tgccaaatag 120 
agtagggatg agctgtcccc acagagacct gcccagtgca cattgtgaga aagccccaga 180 
atgttgacag tcgctctcct agcccttctc tgtgcctcag cctctggcaa tgccattcag 240 
gccaggtctt cctcctatag tggagagtaa ggaagtggtg gtggaaagcg attctctcat 300 
tctggcaacc agttggacgg scccatcacc gccctscggt ccgagtcaac acatactaca 360 
tcgtaggtct tear 374 

<210> 726 
<211> 523 
<212> DNA 

<213> Homo sapiens 
<400> 726 

aacaagatgg cggcgctgcg ggacggctag cggccctgcg tggaggcgag gaatccgcat 60 
ctatggagat gtccctgcat cccatgactc ggagctgatg gccttcatga cgaggaagtt 120 
gtgggacctg gagcagcagg tgaaggccca gactgatgag atactgtcca aggatcagaa 180 
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gatagcggcc ctagaggacc tggtgcagac cctccggcca cacccggccg aggcaaccct 240 

gcagcggcag gaggaactgg agacgatgtg tgtgcagctg cagcggcagg tcagggagat 300 

ggagcggttc ctcagtgact atggcctgca gtgggtgggc gagcccatgg accaggagga 360 

ctcagagagc aagacagtct cagagcatgg cgagagggac tggatgacag ccaagaagtt 420 

ctggaagcca ggggactcat tggcgccccc tgaggtggac tttgacaggc tgctggccag 480 

cctgcaggat cttagtgagc tggtggtaga gggtgacacc aaa 523 

<210> 727 
<211> 492 
<212> DNA 
<213> Homo sapiens 

<400> 727 

attttgcgaa cggcgagcag cggcggcggc gcggagagac gcagcggagg ttttcctggt 60 

ttcggacccc agcggccgga tggtgaaatc ctccctgcag cggatcctca atagccactg 120 

cttcgccaga gagaaggaag gggataaacc cagcgccayc catccacgcc agccgcacca 180 

tgccgctcct aagcctgcac agccgcggcg gcagcagcag tgagagttcc agggtctccc 240 

tccactgctg tagtaacccg ggtccggggc ctcggtggtg ctcctgatgc ccctcaccca 300 

cccctgaaga tcccaggtgg gcgaggagga ggagtagggy bgcctcgggg ctgggcatcc 360 

ggcccctggg gccacccctt gtcagccggg tgggtaggaa ccgtagactc gctcatctcg 420 

cctgggtttg tccgcatgtt gtaatcgtgc aaataaacgc tcactccgaa tttagcggtg 480 

tatttcttga ag 4 92 

<210> 728 
<211> 467 
<212> DNA 
<213> Homo sapiens 

<400> 728 

acagggccac tgctgctcac agaagcagtg aggatgatgc caggatgatg tctgcctcgc 60 

gcctggctgg gactctgatg cccagccatg gccttcctct cctgcgtgag accagaaatc 120 

tgggagccct gcgtggaggt gtgaaatcca gacaattgaa gatggggcat atcagagcct 180 

aagccacctc tctaccttaa tattgacagg aaaccccatc cagagtttag ccctgggagc 240 

cttttctgga ctatcaagtt tacagaagct ggtggctgtg gagacaaatc tagcatctct 300 

agagaacttc cccattggac atctcaaaac tttgaaagaa cttaatgtgg ctcacaatct 360 

tatccaatct ttcaaattac ctgagtattt ttctaatctg accaatctag agcacttgga 420 

cctttccagc aacaagattc aaagtattta ttgcacagac ttgcggg 467 

<210> 729 
<211> 528 
<212> DNA 
<213> Homo sapiens 

<400> 729 

atctttggga gctgcatttt tgataaataa gaaaaacctc ctcattttaa tgaagtagtt 60 

tcaatggagc aaaaacatgg aattaaagaa ccacagttca actccatcaa tgaagtaaca 120 

tcatttgcca actaacaaac tgagtctaca tctctttatt tgcaatgtaa aaccccatcc 180 

tgttagggga aaaaaaaata gtactaatgt tggatttaga agtccggagt tcaattttag 240 

gctttggaga gaagtaatcc acttttactt gagatttcca atctcatatc accaccatca 300 

tcattatcac cattatcatc accaccacca ccaccatgag caaccaacat catcatcatc 360 

aagaacaact tgcttttgct agccaaagca atgggaacac agcactctat tagggggcaa 420 

ggtattaatt tgctttttat ttaattataa caaaagcgaa atacatgttt ttgtaatttc 480 

ctatcaataa tttagaagaa ttaaatataa atataaaata aaaaccaa 528 

<210> 730 
<211> 379 
<212> DNA 
<213> Homo sapiens 

<220> 

<221> misc_feature 

<222> 221,233 

<223> n=a, g, c or t 
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<400> 730 

gtcggcaggt ctgaggcggc gctgtgtgtg tgaagcgtac ctagggcggg aggcgacatg 60 

gagacagggg cggccgagct gtatgaccag gcccttttgg gcatcctgca gcacgtgggc 120 

aacgtccagg atttcctgcg cgttctcttt ggcttcctct accgcaagac agacttctat 180 

cgcttgctgc gccacccatc ggaccgcatg ggcttycccg nccbkrggcc ggngaagccy 240 

tgggksctgc aggtattcaa aacctttgac cacatgrccc cgtcagratg atgagaagag 300 

aaggcaggaa cttgaagaga aaatcagaag aaaggaagag gaagaggcca agactgtgtc 360 

agctgctgca gctgagaar 379 

<210> 731 
<211> 103 
<212> DNA 

<213> Homo sapiens 



<400> 731 

aaagcaaccg cgtggtacga gtaacaactc caccggctgg cgtccgckgc gcttggcsac 60 
aagtcacttt tccagatcta ggagcaccat ggacacctcg tea 103 

<210> 732 
<211> 842 
<212> DNA 

<213> Homo sapiens 



<400> 732 

actgtttacc 

aactgtacat 

ttttaggaaa 

tgcctccatt 

agtttttatt 

atcagcacaa 

ctatggttca 

aatcttgacg 

aggctgggta 

attttcccac 

tgacacaagc 

agccaattct 

acttctccca 

tctgatgcca 

ta 



aaacaccttg 
tcacaactta 
attatagtga 
atataggtcc 
atttcttctt 
agcctgatag 
agtcactaaa 
gtttctgccc 
ctttcatact 
atacattttt 
agctaataac 
ctgggtgtcc 
ctcttggcat 
aagggttaaa 



gtcataataa 
tgtttctttg 
aaatttgaca 
ttccaggaac 
ttccttttgt 
tactttacta 
gattcatttt 
gtcctaatgg 
ttgttttctt 
acattgtacc 
catttctggg 
agtgagtggt 
gtaagaaata 
gcttcttgga 



tgtcattagt 
agattaatag 
gttgattggc 
tcaaacactg 
tttcataata 
aaatgactgc 
tgttgagtcc 
cagagctctc 
gttttgcttt 
ttaggactca 
tttctgccta 
ggcttttttt 
agcatttaca 
tttcatttca 



ttctccattt 
atattggggg 
ataatttctt 
taagtgaaat 
taatgcagtt 
attctttgga 
ttatgagaaa 
tgacttgggt 
aaaactacga 
gtcatctcca 
accccctaat 
ctttccacat 
taattggaaa 
ttgatataca 



ttattttctg 
aaaaacgcct 
gtttgaatgc 
atgggagtat 
tgttcaggaa 
ttccttcagt 
cagcagtatg 
gtatgctgcc 
ctcagcatac 
cttaaattga 
tgtctgttaa 
tggcacattc 
aatctggatt 
gccactattt 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
842 



<210> 733 
<211> 351 
<212> DNA 

<213> Homo sapiens 

<220> 

<221> misc feature 

<222> 333,345 

<223> n»a, g, c or t 



<400> 733 

agcaacgggg 

atccagggat 

tggacgtgca 

cgccgggacg 

gtctcgctgc 

gtcgccctgt 



tttcctgaac 
tcttctaccc 
gcggcctcct 
tgctgcatat 
tctccaagca 
ccaaagagtc 



<210> 734 
<211> 200 
<212> DNA 

<213> Homo sapiens 



cttcgggcca 
atggaaaaac 
cctcctgagc 
gcccggtttc 
ccgagggacc 
cagggatgtg 



cccgaatcca 
gtgcggagga 
tgcagcatct 
ctggcgggtc 
aagaagacgc 
ganggaggca 



ccttctcaga 
agcacatgtg 
gcttgttctg 
cgtgtgacat 
cgtccacggg 
ccgangggga 



cggaaataga 
ggcgctggtc 
gtgggccaag 
gtccaagtcc 
cagcgtgcca 



60 
120 
180 
240 
300 
351 
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<220> 

<221> raisc_feature 
<222> 43 

<223> n=a, g, c or t 



<400> 734 

tttgcttgac tgcttcctcc taaccctcta sccactagca ckntacttcc taaagctgtt 60 

gtgtcattaa ctctgttgga tcaactctct gggaaaagat tctgttaatg taagtgcact 120 

tactccctgg atgttgtcac tagtctagtg gcttttgcta aataaacctt tcttatttct 180 

agaaaaaaaa aaaaaaaaaa 200 

<210> 735 
<211> 160 
<212> DNA 
<213> Homo sapiens 

<220> 

<221> misc_feature 
<222> 118 

<223> n=a, g, c or t 



<400> 735 

aaaaggactt gaattctgat cgggctgcag acgctctgca ccccgcccaa gcttctatgg 60 

tactttggtg tcctctctag cactggcaag tcgccttcct cttcgcccgg cgccagdntc 120 

ccctctctat gatcgccgag ttcccggcgc cgtctcgttk 160 



<210> 736 
<211> 369 
<212> DNA 

<213> Homo sapiens 



<400> 736 

gcactggatg cacgcagcca catcagaaag gcggttaaga ggggcggggg aagcgctctg 60 

atccacgatg tcctgcgagc tgatcaatca gcttcacgct ggaaataacc atataaaaac 120 

ctgaactcaa agaagagaac tggaataatc ctccctaatc ccacttccca tggtgagccc 180 

cagctgggac gctgacacat ggccccaact ttttctcctg cctgatgtct ggacccacag 240 

gggacagatg gtggtccctg gtcactgaag tacccacaga acccatgcaa gaggcacaaa 300 

accatctgct tagtggtcat gtggcagcag acagtatctc agaagtcctc gcagcatatg 360 

acttcactg 369 



<210> 737 
<211> 382 
<212> DNA 
<213> Homo sapiens 

<400> 737 

aaacacacag atcgtcccca ggctgctgtt cgagaatgag tattcctata cgacccagat 60 

agactacaac cccaaggacc gcctgctcta tgcctgggac aatggccacc aggtcactta 120 

ccatgtcatc tttgcctact gacacccttg tccccacaag cagaagcaca gaggggtcac 180 

tagcaccttg tgtgtatgtg tgtgcgcgca cgtgtgtgta ggtgggaggc aggcaacacc 240 

aggagcagaa atgaaagagg caagaaataa gtgctatgtg gcgagaaaaa aagttttaat 300 

gtattggaga agttttaaaa aacccagaaa aacgcttttt ttttttaata aagaagaaat 360 

ttaaaatcaa aaaaaaaaaa ar 382 

<210> 738 
<211> 406 
<212> DNA 
<213> Homo sapiens 

<220> 

<221> misc feature 
<222> 197 
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<400> 738 

agttctagag gctgggaagt ccgggattga ggggccagca tctaccaagg gccttcttgt 60 

tacttcatcc catgatggaa ggtggaaggg taaagagagg agaagaagca gtaccgggaa 120 

tcttacatca gtgacaacct ggacctcgac atggaccagc tggaaaaacg gtcgcgggcc 180 

agcgggagca gtgcggngca gcatgaaaca caagcgcctg gtsccgkcca tycaacsgcm 240 

akccacagca gttcccacac ctcgggcatt gaggcagaca ccaagccccg ggacacgggg 300 

ccggaagaca gctactccag cagtgcsats caccgcaagc tgaaaacctg cagctcaatg 360 

accagtcatg gcagctccca cacctcaggg gtggagartg gcggca 406 

<210> 739 
<211> 502 
<212> DNA 
<213> Homo sapiens 



<400> 739 

cactttgaaa gaaaactgcc attttgtatt taatagcctc caggttcttt cagtaatgtt 60 

atttgctctg tgtgtttttg tgtgtttgtt gatgtgcgtt tgtgcatatg cgtgagtttc 120 

attgccgggg ttggggcaca attgtggact ggggccatga ggcttccctg gtccccactg 180 

aacccacctt agttccacat ttggctgcat cttgaattat gccgactcca gacttctcct 240 

ccttttttgc ccttggctct tgacactcta aacccctgga ccatctgaat ggagcagcca 300 

agttcagtcc cacatttctg tactgttcct ctttcacagc tggaatatgt cacatgatga 360 

agttgtatag aaacagaacc atggatggat ggccaggatt gccgtggtcc ctagctagat 420 

ccccttccta tcaatcacct gatagcaaca gggacagctg ccaataccct gctctttact 480 

caatggtacc agggagggag ca 502 



<210> 740 
<211> 367 
<212> DNA 
<213> Homo sapiens 



<400> 740 

cagcaagccc 

atcgccatcc 

tacacctgct 

gtgagaagat 

gttccagagg 

ctgggacacg 

ttttttt 



aggcacttcg 
tgtctatcct 
tcaacagctg 
acaccacgca 
ggcccacgtg 
ccctgaagct 



ggctgtcccc 
cctggccgtc 
caggagcact 
cctcgcgttc 
tccctccacc 
tctggaccct 



ggcccccaga 
ctcctcacgg 
tccctatcag 
agcactcctg 
tcctccctgg 
gagagagatt 



gcccgggcac 
tcctcctggc 
gcccagagga 
ccgagggggc 
cccaggctgc 
gtttcttttc 



cgtggtcatc 
tgtgctcata 
ggcagggagt 
ttcctgaggg 
agagcctgag 
tttactattt 



60 
120 
180 
240 
300 
360 
367 



<210> 741 
<211> 427 
<212> DNA 
<213> Homo sapiens 



<400> 741 

aatgttcttg 

ccattcttgg 

gcctttggga 

ccttcttgcg 

caagggcggg 

caaggccagc 

taagaaaggc 

ggctttt 



tgaatcttag 
tggattttat 
ttttctaccc 
caasgcagtt 
gccacatcca 
cgccccctgt 
ttttctctcc 



<210> 742 
<211> 408 
<212> DNA 
<213> Homo sapiens 



acaaccccca 
gagatgtcaa 
acccagcaac 
gtaccatgaa 
ctgcagctga 
cttccctctg 
agtcattctg 



caacatcatt 
cactcctcat 
caccagaggt 
tgcggtcgtc 
gctcagagcc 
gagctgtttg 
cttgctgatc 



ccctgcttta 
taggattccc 
gcagaaaccc 
ctgctgcaga 
ttctgtgggg 
aggaatgcag 
aagaacactt 



tttgcttagt 
tccttctgaa 
ctgcagacat 
ccacgcccac 
tctgcagtgg 
ttcttttgca 
aacagcttca 



60 
120 
180 
240 
300 
360 
420 
427 



<400> 742 

tcatctatgt ttatcacatt ttaataccac agcacttata atgatgtcac tacatataga 



60 
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agctcaaagt taagggattt gctgaagact gtaaagttaa tggaagaatt gagacaaaaa 120 

tccagtgtag ctggccactt atcccagggc tttttctact tcatcacaag gaatgttttg 180 

aaagtgtctg ctttttttat ccttaaaatt cacctgtcag ggaggcatta aaaatttgga 240 

aatgtatgcc agcaaaatgt gagctctgta ttttttggca ttcttatgtt tgggtttaat 300 

aagattaaga aaatgatact gggaattttc tttttcctga aactttgaat caccctagta 360 

agtcaaagta ctaaaaaatg tactagatca ttaagactta tgtgctct 408 

<210> 743 
<211> 449 
<212> DNA 
<213> Homo sapiens 

<400> 743 

aacttgttat ctattgcagg aaagtgcctg caatatgcca ttgtgacaag gcaccatcaa 60 

tcatcatgtc cctttaccag gagggcccgt gtcataagcc actatataaa gggaatgatg 120 

ggtggactgc tttggacaaa tggacctgcg gtaggagaga gggacaacag taggagcagg 180 

cagatcttgc tgtttcaacc aaaacctcat gctgaccaga gttgaggaac agaagaagat 240 

ggtgaaggcc tgcagtcagc agaatatgtc aatgttttgg ttgaagaagc tgcttgaatc 300 

tgggcttttc tgtgccatgt gttctcccag ggccagcaca aagawgggct tttggtgcag 360 

gcccaagacc accataatca tcattgatta ttcctctcca cgccagktct ctaaataaac 420 

tttctcttct ttctctgaaa aaaaaaaar 449 

<210> 744 
<211> 207 
<212> DNA 
<213> Homo sapiens 

<400> 744 

atttccggtt ccggtgtcag ttcgaggcgc cgccgccgcc gccgcagccg ccggagccgc 60 

aatgcctaaa ggaggaagaa agggaggcca caaaggccgg gcgaggcagt atacaagccc 120 

tgaggagatc gacgcgcast gcaggctgag aagcagaagg ccagggaaga agaggagcaa 180 

aaaaaaaaaa aaaaagctta tggatac 207 

<210> 745 
<211> 208 
<212> DNA 
<213> Homo sapiens 

<400> 745 

agaacatcct aatcrwaaaa gttcgaggcg ccgccgccgc cgccgcagcc gccggagccg 60 

caatgcctaa aggaggaaga aagggaggcc acaaaggccg ggcgaggcag tatacaagcc 120 

ctgaggagat cgacgcgcas tgcaggctga gaagcagaag gccagggaag aagaggagca 180 

aaaaaaaaaa aaaaaagctt atggatac 208 

<210> 746 

<211> 442 

<212> DNA 

<213> Homo sapiens 

<400> 746 

cgtcatttag gcttttggga tttgttggta tggaaaaata catggattct catggtactc 60 

ggtggtttga gtagtctggc gttctctagg acactgtggg gaggtaggac gcagttgtct 120 

gcccctgtgc cccaaacatt cagctcagga tggctgctga gctkggkttt gggaaaactc 180 

acaaccctgt ggatgtaaca ggagtcagca gttttgatca ctggttcatg gaaacatgtg 240 

agaatatata tgaacatctt ttgagtcttg ctggtattgt cattcattct ttcaatcact 300 

aattcattca tttatacatt cagccccttg tttaaatatg tgcttcctgc cggtcagcag 360 

gaagtcttaa gaatggagcc tggggtccaa atgagagccc tcagtggctg ttcaggggsc 4 20 

aatccgttat gtacagtggg to 442 

<210> 747 

<211> 487 

<212> DNA 

<213> Homo sapiens 
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<220> 

<221> misc_feature 
<222> 221 

<223> n=a, c or t 
<400> 747 

tctgtggaaa ggcgaggtct ggaaagcagc ccctcccctg ggagagagcs cagggcaggc 60 
atgctgtcct gtttccccca ctcsscaaca gggctggcst ccggctctgt cataccctat 120 
ctactcctgc cttcttccag ctggcattcc ctggcagcga tggcggggaa actggcagac 180 
atgagcaggt gccatctgta tccaccacat actcctgctg ntgtggtgtg gcgaagacaa 240 
ggctgcccag ccccaacacc acgaggcacc ctcgggcatg ctgttgcttc tgggaccagt 300 
gaccagcctg stgsscctgc tggaccttag gaggccaagg caggtgaatt gcttgagtcc 360 
aggagtttga gaccagcctg agcaacatgg tgaaactccg tctctacaaa aaacaaaaaa 420 
aatccagcgt ggaggcatgc acctgtggtt tcagctactc aagaagtgag gcaggaggat 4 80 
tgcttka 487 

<210> 748 
<211> 601 
<212> DNA 
<213> Homo sapiens 

<400> 748 

aaaaagcgcc attacgcaga gagaaagtta cgaggttcgt ggccgcggtt tccccaggca 60 

gctggcgctg gaggcttcgg cgtcacgtgc tggtctggat ttttctcgat gcactgggga 120 

aagcggtgga ctcttatcgt gggagggctc ttgatctgtg atttatagat aggcacagct 180 

actcccgttc gggaacccaa cggcagacag gtcctagtgc ccatcagata cccgcggccg 240 

ggactcggag ctgtggggtg tggggaggcg gaggcaccaa ctaagagcga cctagcatcg 300 

caaagccgcc ctcggggcct catggcggga cgctcctggg aaaggcttta gcgcggtgtc 360 

tctctctctg gccttggctc tgtgactatc aggtcctcgc gctgtcgcgg catccaggcg 420 

ttcagaaact cgttttcatc ttcttggttt catcttaata ccaacgtcat gtctggttct 480 

aatggttcca aagaaaattc tcacaataag gctcggacgt ctccttaccc aggttcaaaa 540 

gttgaacgaa gccaggttcc taatgagaaa gtgggctggc ttgttgagtg gcagactata 600 

a 601 

<210> 749 
<211> 210 
<212> DNA 
<213> Homo sapiens 

<400> 749 

aaaaattaaa cctcaccgmc ggattcaatt ctctcgttct caggccgtct tcatcgccgg 60 

agsgtttwct tacgaaaaga atgggcttct ggacattgtt aaaaggtctg ctgctctttg 120 

caaatgcact ggcaatcctc aacgaagacc gtttccttgt tccaagagga tggacactcg 180 

gagaactcca ccaaaccggt agaagaaast 210 

<210> 750 
<211> 380 
<212> DNA 
<213> Homo sapiens 

<400> 750 

tgacataaaa ggttaaagaa caggcaacac aatgagcact taagttttta acatgtgggg 60 

aatagggcat tttaaaggct ggaaccagtt cagaggaaac aagggtttgg gtagaggtag 120 

aaaggtttaa ttaaccctgg agatctgcaa atggagggag gggtgaagga ggaatcttaa 180 

gacggaggga gaagagccaa ggacaaggtc aggttgaatk agtwagrggk gtgtacacah 240 

atcctcttgt gtgctaggta gtgggcgtgc atctggatct gagaggcagt aaagcagtga 300 

tgaagacagt gcagcagaac attgaagtcc tgattgcagc agtgactctg caaccttgga 360 

taaattattt caatctttat 380 

<210> 751 
<211> 245 
<212> DNA 
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<400> 751 

actcttctga ggatccggca agatggcaga agtagagsag aagaagaagc ggaccttccg 60 

caagttcacc taccgcggcg tggacctcga ccagctggca rsatggtggg sgtctacaac 120 

ggcaagacct tcaaccaggt ggagatcaag cccgagatga tcggccasta cctgggcgag 180 

ttctscatca cctacaagys cgtaaagcat ggscggsccg gcatcggggc cacccactcc 240 

tcccg 245 



<210> 752 
<211> 369 
<212> DNA 
<213> Homo sapiens 



<400> 752 

aagttagtac actgaaattc aaagtcatgc tcataactgt taatgaaagc agattcaaag 60 

caacaccacc accactgaag tatttttagt tatataagat tggaactacc aagcatgtgg 120 

ctcctggtca gtgtaattct aatctcacgg atatcctctg ttgggggata aggactgtgt 180 

ttctttcctt ttgtggaaaa tkgtcattct gaatcttcag gvcaaacaca tctggaaggt 240 

gatactgtac aaattatttg caacacagga tacagacttc aaaacaatga gaacaacatt 300 

tcatgtgtag aacggggctg gtccactcct cccaaatgca rgtccactat ttctgcagaa 360 

aaatgttgg 369 

<210> 753 
<211> 491 
<212> DNA 
<213> Homo sapiens 



<400> 753 

accaggtgcc 

cccatctctg 

ctgcagctat 

ggtgcmrtry 

gcttgggagc 

tcaaagagac 

tcgcactcca 

cttaawsgaa 

tgccatggca 



cagtcttcca 
ctcaccatgt 
gcctttatgt 
ggagagcact 

aaatggctct 
agtgagaaga 
ctaaagaaaa 
ctygargtgg 
a 



gttgcgaggg 
ggacgctgaa 
tctcttctct 
ttcggattcg 
ggcttytktt 
ataaggagca 
atcaaaatgc 
agcttwtgaa 



caagcaaacc 
atcgtccctg 
gagacagaaa 
gcagaaycta 
tgytgttgtg 
gagtcctcct 
ttctctttac 
atttgtgtcc 



cgtcatgagc 
gtcctgcttc 
actagcgaac 
ccagagcaca 
ccgtttgtga 
ggccttcgag 
aaagactgtg 
raagtgcrra 



aactcccttc 
tgtgcctcac 
cccaggggaa 
cccaaggctg 
tactgmagtg 
gcttcccatt 
tattcaatac 
atcttaaasg 



60 
120 
180 
240 
300 
360 
420 
480 
491 



<210> 754 
<211> 336 
<212> DNA 
<213> Homo sapiens 



<400> 754 

ataaatgctt tccaatcagt gctcccattc ttcattctat tcaccactta gttactagaa 60 

tgtttttctc aactatgaat ctgacttcac tcccattctt gaaaacctcc tgtagttcct 120 

gttgtgtacg ggataaagac tgtgagcccc tggagttcag ggaccattac ttaatcatat 180 

tccagaacct atcacaatga atgacacgaa gaatatattt gataaaaatc tgatgaatga 240 

accactctaa aatgcaactc taataaagaa aattactgcc tawgtcrgdd skgkmtcams 300 

gktayaaacc gtdaaaatta tgtatagtct gataaa 336 



<210> 755 
<211> 497 
<212> DNA 
<213> Homo sapiens 



<400> 755 

ataaatgctt tccaatcaat gctcccattc ttcattctat tcaccactta gttactagaa 60 

tgtttttctc aactatgaat ctgacttcac tcccwttctt gaaaacctcc tgtagttcct 120 

gttgtgtacg ggataaagac tgtgagcccc tggagttcag ggaccattwc ttaatcatat 180 

tccagaacct atcacaatga atgacacgaa gaatatattt gataaaaatc tgatgaatga 240 

accactctaa aatgcaactc taataaagaa aattactgcc tattcagttg gaactgtacc 300 
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aygttacaaa cgtaaaatta tgtatagtct gataaagaaa tactrtaata gccttggaaa 360 

cagctcacat acaatttttt ttataaatgg attcattacc atataactgg accagcttac 420 

tgaagagtag ccctacagca gtcacttcgc agtttgcaat tcatttgagg ttcaaagcta 480 

gctcaaggga aaacaac 4 97 

<210> 756 
<211> 400 
<212> DNA 

<213> Homo sapiens 

<400> 756 

atcattttcc ctgcaggaca cccggcctcg gtcctgcttt agtattcatc ccagcttcat 60 

tactgcagcc acctcgcaac ccaccctggg cagctgctaa aagggggaaa aattaaagca 120 

attcatactc ctggtgactt actgctgcag aataattggt ttctgtttgc aaggaacatt 180 

atcacagcaa agcccagaaa acagaaaggg atgatcatgg tgattttact gctgcttgtg 240 

gctatcgtgg ttgttgcagt ctggccgacc aactgatggc agtaaagaga ccaccagcag 300 

tgacacctgc caatgacaga tgcaagccca acaccctttt ggtacgcaaa acctgctctc 360 

aataaattcc cccaaagctc caaaaaaaaa aaaaaaaagg 400 

<210> 757 
<211> 524 
<212> DNA 

<213> Homo sapiens 



<400> 757 

acttcctcgt tgcccccgcc gcgggccgag atggattccg ggtgctggtt gttcggcggc 60 

gagttcgagg actcggtgtt cgaggagagg ccggagcggc ggtcaggacc gccgcgtcct 120 

actgcgccaa gctctgcgag ccgcagtggt tttatgaaga aacagaaagc agtgatgatg 180 

ttgaagtgct gactctcaag aaattcaaag gagacctggc ctacagacga caagagtatc 240 

agaaagcact gcaggagtat tccagtatct ctgaaaaatt gtcatcaacc aattttgcca 300 

tgaaaaggga tgtccaggaa ggtcaggctc ggtgtctggc tcacctgggt aggcatatgg 360 

aggcgctgga gattgctgca aacttggaaa ataaagcarc carcacagac catttaacca 420 

csgtactcta cctccagctt gctatttgtt caagtttgca gaacttggag aaaacaattt 480 

tctgcctgca gaaactgatt tctttgcatc cttttaatcc ttgg 524 

<210> 758 
<211> 543 
<212> DNA 

<213> Homo sapiens 



<400> 758 

aagaagggag gtggtcgccc tccgtcgtgg tctggcgtgt attccgagcs ttggtgtctg 60 

gcggtttccg agcgttggtg tctggcggtt tccgaccgtt ggtgtctggc ggtttccgac 120 

cgttggtgtc tggcacgcgc caccctctct tgctttggtt gcgccatgcc gatgtaccag 180 

acaagaagac aagaaaatga tttgaggaca gcttcaatcg cggtgtgaag aagaaagcag 240 

caaaacgacc actgaaaaca acgccggtgg caaaatatcc aaagaaaggg tcccaagcgg 300 

tacatcgtca tagccggaaa cagtcagagc caccagccaa tgatrattttc aatgctgcga 360 

aastgccaaa agtgacatgc agggatgtcc ttcctgagat ccgtgctatc tgcattgagg 420 

aaattgggtg ttggatgcaa agctacagca cgtctttcct caccgacagc tatttaaaat 4 80 

atattggttg gactctgcat gataagcacc gagaagtccg cgtgaagtgc gtgaaggctc 540 

tga 543 



<210> 759 
<211> 850 
<212> DNA 

<213> Homo sapiens 
<400> 759 

aagtttcatg ggttcctgga cgctcagcca ggggtagtct ctgaagggcc gcctggcaat 60 
tgggagcaga agccagtttc ccgccatctg ctctccgtgg aggcagtgct ctcgcggctg 120 
ctgcccgttg gggtccagct gaggaaggag cgcggatccc aggctcgttc tttgcctggg 180 
ctgcccctgc ggcccctggg gcgggtgctg ctgcggcgcc agctccaagg gcgkatccag 24 0 
gcgggagggg cctcctcgga gaagcggggc gcggtcccaa ctacgcagag gctggcacgc 300 
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cgaccctcca cacctcacca cgcccccatc tccgtccgtg tacacacact cacacaagga 360 

cgccaacccc acctagatgc aaagcaggat tcaaaagaac atctttgcgt tttctaccgg 420 

ctccccatca tcgtactagg gaggaagaag cgggtgagaa acaaaacttc tttccattgt 4 80 

cctgcccgtt tctgcggact tgttctgagg ccgaggcacc tctaagatac tgatggctct 540 

gcagaggacc cattcattgc ttctgctttt gctgctgacc ctgctggggc tggggctggt 600 

ccagcctcct atggccagga tggcatgtac cagcgattcc tgcggcaaca cgtgcaccct 660 

gaggagacag gtggcagtga tcgctactgc aacttgatga tgcaaagacg gaagatgact 720 

ttgtatcact gcaagcgctt caacaccttc atccatgaag atatctggaa cattcgtagt 780 

atctgcagca ccaccaatat ccaatgcaag aacggcaaga tgaactgcca tgagggtgta 840 

gtgaggtcac 850 



<210> 760 
<211> 532 
<212> DNA 
<213> Homo sapiens 

<220> 

<221> misc feature 
<222> 524 

<223> n»a, g, c or t 

<400> 760 

ataaaacaaa atagtatgca acttacagca atgtctctcc cacttataat aaaacaccta 60 
actctcatgt agcattttat ccatattgag agctctgtga gatagaaatt attatattac 120 
aagtcacaga tatgtaaact gagatgcaga gacatcaata acttgctcaa ggttacaagg ISO 
tggacttagt agttttaagc agaaccatga aaacctctgt gacaactccc tccagctcca 240 
agagtgccgg gaggtggggg gcggcgcatc cgcggcctcg agcttgctac ctcagcccat 300 
ccccaccacc cctgacatcg agaacgctga gctcaccccc atcttgccct tcctgttcct 360 
tggcaatgag caggatgctc aggacctgga caccatgcag cggctgaaca tcggctacgt 420 
catcaacgtc accactcatc ttcccctcta ccactatgag aaaggcctgt tcaactacaa 480 
gcggctgcca gcactgacag caacaagcag aacctgcggc agtntttgaa ga 532 

<210> 761 
<211> 275 
<212> DNA 
<213> Homo sapiens 

<400> 761 

tgtttgtttt gcttctacta agactgtttt gtttcaaaaa ggaaacaagt tttgtgtttg 60 

ctgtctacgc tggagtcctg aactgtgggt agaaaacacg acctggcttt gtagaaagga 120 

cacagggctg ttttatgaac taagcggtga ggctcaggtg gcggctctcg cagacccctg 180 

atgctgttgt tcttkgaggg cttaaggcct gatgaacgta ggcacgtgat gcataatagt 240 

cttcaatggt acacttaact agtctcttct gtgta 275 

<210> 762 
<211> 546 
<212> DNA 
<213> Homo sapiens 

<400> 762 

aggtatctag gtatcctccc atctcaggcc agactggaga ttgagcgcct gttgtatacg 60 

ggccctgggc taaccctccc cccatcacac aactcaacag caacgcccaa gccacaactt 120 

ctctggacct caggttccct acctgcaaaa cagtttgcac ctcctccctc cacaaaactg 180 

caaccggggc cggcgcggtg gctcacgcct gtcatcttaa caccttggga ggcggaggtg 240 

ggcggatcat cagaggtcag gagttcgaga ccagcctggc caacaaggtg aaaccccatc 300 

tgtactaaaa atacagaaat cagccgggca tggtggctcg cgcctgtagt cccagctact 360 

tgggaggctg aggtaggaga atcgcttgaa tccgggaggt ggaagtcgca gtgagccgag 420 

atcgcgccac tgcactccag cctgggtgac acagtgagac tccacctcaa aaacaaaaca 480 

aaacaaaaaa ctgcccgcag agagctgccc cgaccacaag gcctgaatcc agccttccag 540 

ccccag 54 6 



<210> 763 
<211> 545 
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<400> 763 

ctggtgtctc 

ccacttctga 

agaaatccaa 

gggcaaaatc 

atctgtttta 

ttttaaagtt 

ttttagtaca 

tgtctaaata 

gtctgctttc 

tcttg 



tcaaaagcct 
tgattctgac 
gggggagagt 
tgtacgggca 
aaatgtgagg 
acctgaagct 
agacatcaaa 
gtgaccatct 
ttttgtcttt 



gatcctgcca 
tctaattttg 
gatgacttcc 
aagaaaccta 
cgattatttt 
cttaacttcc 
gtgaagtaaa 
catgggcatt 
aaacctgatt 



398 



aaaccaagaa 
agaaaattgt 
atatggactt 
taaagtacct 
aagtaattat 
tcccctctga 
gcccaagtgt 
gttttcttct 
tttaagttct 



tcgccgcaaa 
ttcgaaagca 
tgactcagct 
ggaagagtca 
cttaccaagc 
atttagtttg 
tctttagctt 
ctgctttgtc 
tctgaactgt 



aggaagccat 
gtcacaagca 
gtggctcctc 
gatgaagatg 
ccaagactgg 
gggaaggtgt 
tttataatac 
tgtgttttga 
agaatagcta 



<210> 764 
<211> 500 
<212> DNA 
<213> Homo sapiens 



60 
120 
180 
240 
300 
360 
420 
480 
540 
545 



<400> 764 

gacctgggct 

cgtccgagga 

gcccctactt 

tggaaacctt 

ggatgccgca 

atccctacca 

tgggatgtgg 

caagggttga 

gagctgatga 



gcagttaggt 
gaggatcaag 
gggccccact 
ctcaagagcc 
gaaagaccac 
ttgggttcaa 
gtggccagga 
tatttgtggt 
gaatgctggc 



tgtcgcctgc 
tcttccatac 
gtccccagaa 
tgattgggaa 
catcctatac 
tgtggagaca 
caagattcga 
cgacagcaat 



aggaggtgtg 
cagggaccag 
agcagctgct 
gaaggagatg 
aagctgaaac 
gtggagtata 
cccctctgga 
gatcgggagc 



ctgttgaggg 
gccaaaacca 
gtgatcatgg 
cgcatcctga 
tgggggagat 
agaacatcag 
gacactactt 
gagtaaatga 



agaggaaaac 
gactgccact 
gcaatatctt 
tggtgggcct 
cgtcaccacc 
ctttacagtg 
ccagaacacc 
ggcccgggaa 



<210> 765 
<211> 527 
<212> DNA 
<213> Homo sapiens 



60 
120 
180 
240 
300 
360 
420 
480 
500 



<400> 765 

gctttgacaa 

gtctttaaca 

aataacctca 

ctcaagcttc 

ttctctctat 

ttgtctgccg 

ctccttcttg 

attcaggttc 

ggttttgtct 



aaccagggaa 
gtgcattcaa 
aagaagagca 
tgaccaaagc 
cgaccatcga 
ccacaaaccc 
gaccggtttt 
ttgtggctct 
ccaatttgca 



agagacagag 
tggcaaataa 
cagcgctcca 

cgagaaagca 
gcgtcttgga 
ggaaacgcct 
tgcatatgtg 
actctctgtt 
gaaatctgat 



agagataatg 
gaagccatca 
caggtgaaac 

ggacttttgt 
ttgctgacca 
ggaacgttat 
gttcctgaag 
cttggtggct 
tagcccgatc 



gcggctaaac 
ccttctgctg 
tgctgacaag 
ctttagcaga 
aagcagagga 
tcactttgag 
attacacttg 
ctgctgcttt 
gtttttt 



taatctgttc 
caacaagaac 
agtcgagcag 
gaaatcaggt 
gttcggcgtt 
cctcggttta 
ggaagtagtg 
tgctgcttct 



60 
120 
180 
240 
300 
360 
420 
480 
527 



<210> 766 
<211> 921 
<212> DNA 

<213> Homo sapiens 



<400> 766 

agtagggagg aaaaccggag gagagcgcag gaggaaacag taccggctgg aggccggtct 60 

tgcaggagcg ggggactgct gggggcgggg cttggtggtg accgctggcg gggcggggcc 120 

tggggctcag aggggtgggc tttggagatc agagggtcga cgctgcttcg ttgcctggac 180 

tctggtttcc gccctggagc aagccggggc ctggtcggca gctgggccgc catggagtcc 240 

acgctgggcg cgggcatcgt gatagccgag gcgctacaga accagctagc ctggctggag 300 

aacgtgtggc tctggatcac ctttctgggc gatcccaaga tcctctttct gttctacttc 360 

cccgcggcct actacgcctc ccgccgtgtg ggcatcgcgg tgctctggat cagcctcatc 420 

accgagtggc tcaacctcat cttcaagtgg tttctttttg gagacaggcc cttttggtgg 480 

gtccatgagt ctggttacta cagccaggct ccagcccagg ttcaccagtt cccctcttct 540 

tgtgagactg gtccaggcag cccttctgga cactgcatga tcacaggagc agccctctgg 600 
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399 



cccataatga 


cagccctgtc 


ttcgcaggtg 

3 3 3 3 


gccactcggg 


cccgcagccg 


ctgggtaagg 


660 


gtgatgccta 


gcctggctta 


ttgcaccttc 


cttttggcgg 


ttggcttgtc 


gcgaatcttc 


720 


atcttagcac 


atttccctca 


ccaggtgctg 


gctggcctaa 


taactggcgc 


tgtcctgggc 


780 


tggctgatga 


ctccccgagt 


gcctatggag 


cgggagctaa 


gcttctatgg 


gttgactgca 


840 


ctggccctca 


tgctaggcac 


cagcctcatc 


tattggaccc 


tctttacact 


gggcctggat 




ctttcttggt 


ccatcagcct 


a 








921 


<210> 767 














<211> 570 














<212> DNA 














<213> Homo 


sapiens 












<400> 767 














aagggaatct 


ccgggcgggg 


tagtgcaggc 


gccgggtttc 


ccgcggtccg 


agctggcgcg 


60 


ggcggaggag 


aatcgctctt 


aaagggccag 


cgcacacgcg 


ttcttttgtt 


ccggggccgc 


120 


aggcggggca 


ggcccgactt 


tcgccgtctt 


cttgtctact 


ctccagaacg 


gccatgattt 


180 


cccaattctt 


cattctgtcc 


tccaaggggg 


accgctcatc 


tacaaagact 


tccgcgggga 




cagtggcggc 


cgggatgtgg 


ccgagctctt 


ctaccggaag 


ctgacgggac 


tgccaggaga 


300 


cgagtccccg 


gttgtcatga 


ggcagaggtt 
^^ ^^^ 


gcagtgagcc 


gagatccgga 


caccgcactc 


360 


cagcctggga 


gacagagcga 


gactccctct 


cgggcatcac 


catggccgtc 


atttcattca 


420 


catcagacac 


agcggcctct 


atttggtggt 


cacaacttca 


gaaaacgttt 


ctcccttcag 


480 


cctcctagaa 


ctgctctcca 


ggttggccac 


ccttctgggc 


gattactgtg 


gctccctggg 


540 


cgagqggaca 


tctcccgmaa 


tgtggctctg 

3 3 3 3 








570 


<210> 768 














<211> 490 














<212> DNA 














<213> Homo 


sapiens 












<400> 768 














cactcgctgc 


ctcggcagcg 


ctgctcttct 


aagatggctg 


ccgctaccgg 


tgcggtggca 


60 


gcctcggccg 


cctcgggtca 


ggcggaaggt 


aaaaagatca 


ccgatctgcg 


ggtcatcgat 


120 


ctgaagtccg 


agctgaatcg 


gcggaactta 


gacatcaccg 


gagtcaagac 


cgtgctcatc 


loU 


tcccgactca 


agcaggctat 


tgaagaggaa 

3 ^ 33 


ggaggcgatc 

3 3 3 3 3 


cagataatat 


tgaattaact 


240 


gtttcaactg 


atactccaaa 


caagaaacca 


actaaaggca 


aaggtaaaaa 


acatgaagca 


300 


gatgagttqa 


gtggagatgc 

3 3 3 3 3 


ttctgtggaa 


gatgatgctt 


ttatcaagga 


ctgtgaattg 


360 


gagaatcaag 


aggcacatga 


gcaagatgga 


aatgatgaac 


taaaggactc 


tgaagaattt 


420 


ggtgaaaatg 


aagaagaaaa 


tgtgcattcc 


aaggagttac 


tctctgcaga 


agaaaacaag 




agagctcatg 












490 


<210> 769 














<211> 491 














<212> DNA 














<213> Homo 


sapiens 












<400> 769 














cactcgctgc 


ctcggcagcg 


gctgctcttc 


taagatggct 


gccgctaccg 


gtgcggtggc 


60 


agcctcggcc 


gcctcgggtc 


aggcggaagg 


taaaaagatc 


accgatctgc 


gggtcatcga 


120 


tctgaagtcc 


gagctgaatc 


ggcggaactt 


agacatcacc 


ggagtcaaga 


ccgtgctcat 


1 fin 


ctcccgactc 


aagcaggcta 


ttgaagagga 


aggaggcgat 


ccagataata 


ttgaattaac 


240 


tgtttcaact 


gatactccaa 


acaagaaacc 


aactaaaggc 


aaaggtaaaa 


aacatgaagc 


300 


agatgagttg 


agtggagatg 

3 3 3 ^ 3 


cttctgtgga 


agatgatgct 


tttatcaagg 


actgtgaatt 


360 


ggagaatcaa 


gaggcacatg 


agcaagatgg 


aaatgatgaa 


ctaaaggact 


ctgaagaatt 


420 


tggtgaaaat 


gaagaagaaa 


atgtgcattc 


caaggagtta 


ctctctgcag 


aagaaaacaa 


480 


gagagctcat 


g 










491 


<210> 770 














<211> 403 














<212> DNA 














<213> Homo 


sapiens 












<400> 770 














cgcccacgcg 


gaactcgaac 


tcctgcaggg 


cctcgtagtc 


cagcgacctr 


agggcgaaca 


60 
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ggtggccgtt 
atgctgaaaa 
gccatggtca 
tttccagaaa 
wmaaaatcta 
ttagaaatct 



gtctgtgttg 
gagctgcaac 
gttccggagc 
gacctgcccc 
gttttctgat 
cttcatcacc 



<210> 771 
<211> 374 
<212> DNA 

<213> Homo sapiens 



<400> 771 

attcttttcc 

gccacctgag 

actctggact 

tgtggtctct 

aacaaaagat 

aggtgctgga 

ggtgakagct 



cccttctcac 
gacccagagc 
gctgaccttt 
gatctaatcc 
aatgagaaaa 
gcaggggtcg 
ccag 



<210> 772 
<211> 380 
<212> DNA 
<213> Homo sapiens 



<400> 772 

agaaacagtc 

agtgttatcc 

ggtttttctg 

caggaagtac 

agaggtgttg 

caggagtcgg 

aatcaccatt 



tctccccagc 
tgctgatccc 
aagctgctga 
tcaaagatca 
atgagggcaa 
gatccacatc 
gaaaaaaaar 



atggagacca 
ggtctcagga 
gttatcgttg 
atcagaggct 
tattttaatt 
ataaaaaatg 



cggccaggat 
tcaaagccac 
gaactattat 
ttagagaatg 
gtgaggttgg 
tgatctgaat 



tcacttggct 
aatttcctat 
catcgggtca 
tggctcaaaa 
taaaagcaaa 
tgcagactac 



400 

gggaggcgag 
gaattttcgc 
atatctacca 
ttaatatcca 
tctccccttc 
agtagaataa 



gggcagaatc 
ggatggggct 
cctgtattat 
cctgatatga 
gttcattaag 



gcggtcttga 
tgtgaatgaa 
gacagtgcat 
cacaatttww 
tgcaaatgtc 



gcagacataa 
atgttgagaa 
gttatttcca 
tgaccatgaa 
aagttggttt 
gaccagaggg 



gaagatacta 
agttctggca 
gccctcaaca 
gtctgcagag 
ggtggcttta 
acaaaaacaa 



tgatggcagg 
gggcagagat 
gggaaatgca 
gacacttttc 
actgagccag 
aaggactgat 



agctggagca 
aactgtatcc 
aaccaaagga 
ctacctggta 
gagctataac 
ggaattggat 



aagagaacta 
ccagaaaatc 
ccatcacctt 
cagagacaag 
tcacggtgag 
acaacacaga 



gatgttatct 
ctgaatgata 
tggccatgtt 
agattgacaa 
ctacttctca 
ttaaaagcac 



120 
180 
240 
300 
360 
403 



60 
120 
180 
240 
300 
360 
374 



60 
120 
180 
240 
300 
360 
380 



<210> 773 
<211> 472 
<212> DNA 

<213> Homo sapiens 



<400> 773 

agatggctca 

aactgggccc 

ctgtgctgtt 

tcagtagaag 

gttttgcgac 

agcaaacttg 

ggggcccatg 

ggagtagcaa 



ggtgcttcgg 
tagagactgg 
tgatgcagtt 
ctatatcaac 
agagcatcag 
tgaagacctt 
ttttccctca 
caggactctg 



<210> 774 
<211> 521 
<212> DNA 
<213> Homo sapiens 



<400> 774 

attccaggcg 

tttccctgct 

gtgctaggta 

agcctcctcg 

cgagtgacct 

tgagaatttt 

cagtgtaata 



cctgcaacta 
tcttatcgtc 
gcagcctccc 
acctcccttc 
tgagcccagg 
aatgcatgga 
gcaggaccaa 



ctgccacggc 
ggcaggcggt 
gatcactgat 
aatggttgtt 
ccacaagtat 
attatataac 
gcagtcggat 
gactgtcttc 



agaggttctc 
tctgactcag 
attccactct 
ttcctttcct 
ttgatgcgag 
tttatcagac 
gggggaggac 
acactaggtg 



tgctctcttt 
aagatctttt 
tagatattac 
gccaactctt 
gtccatgtct 
aagaaaagcc 
tgctttatgg 
gtgtgggcag 



aacgtggccg 
tgctttaacg 
ggccctcggg 
ctggtgacgg 
gcgacggtca 
aaagctgtcc 
aatattctgt 



ggtctgcaag 
ccttaaatag 
aaaggcgggg 
acgaacagtt 
gcttgttaat 
atgttccaac 
caatcagctg 



tctctgctgg 
ggaagaattg 
atatgaaata 
ccacaaagaa 
tccatacacc 
acctcctcca 
acttgcatca 
ca 



ctaggtgcca 
cccgctgaag 
tggggaggcg 
cccgtagaat 
tcctggctgc 
tgctgtacat 
accatatact 



gcggggaaag 
gctgcagcag 
agagcagctt 
ttcgcttcac 
aggaactttg 
ccaaaagtct 
taatgactcc 



60 
120 
180 
240 
300 
360 
420 
472 



60 
120 
180 
240 
300 
360 
420 
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401 



taaaatctcg 


tggacttcta 


agaaagcgcc 


atggcctgtg 


ctgctgttat 


gattcctggg 




ttgttgcggt 


gctctgttgg 


agccatccgt 


attgaggctg 


y 




521 


<210> 775 














<211> 447 














<212> DNA 














<213> Homo 


sapiens 












<400> 775 














acttagctga 


tagctagcat 


tagctgctgg 


ccaggtgcat 


aagccatctt 


agatgtccag 


60 


cccaggttca 


tagaatggga 


tgcccactga 


tgacaatgcc 


aggccctcca 


gggaaacgag 




gtaagagagg 


ccgaagaqqa 


gaatctggtc 


ctcctggaca 


gcctggtcct 


cagggccctc 


180 


ctggtccaaa 


aggcgataag 


ggagaacaag 


gtgatcaggg 


acctaggatg 


stgtttccta 


240 


aaatcaatca 


tgggtttctc 


tctgctgatc 


agcagctcat 


taaacgccgc 


ctgattaagg 


300 


gtgaccaagg 


acaggcaggg 


cctccaggac 


cccctggccc 


tccaggccca 


agagggccac 


360 


ctggggacac 


agggaaagat 


ggcccccgtg 


gaatgccagg 


agtacccggt 


gaaccaggaa 


420 


agccaggaga 


acaaggcttg 


atgggtc . 








447 


<210> 776 














<211> 491 














<212> DNA 














<213> Homo 


sapiens 













<400> 776 

attctacagg 

tgggctccac 

atggatggag 

tcccggcgcc 

gaagaagata 

gatggatgcc 

ctgcccagca 

caggctccgt 

ggtcccawtt 



aagcaatttt 
tgaggaagag 
aacgcctgct 
aacgcggccg 
aagagcggag 
gcgtcccctt 
acctgccagc 
cgcggagtcg 

g 



ttggttgtga 
cccttcctcc 
ccaagccgga 
ccgccaaaat 
actaggccag 
cgcagcgacg 
tacaggagcc 
ctgagtccgt 



agggagaaat 
cagcagtgtt 

cgacgacatt 
ccaggcgagt 
aactgagcat 
agacttccct 
ccctgcgtcc 
gcccttttag 



gggctcctca 
gtataaagta 
ctagacatcc 
tttcggggcc 
tttcaaagtt 
gccgtgtttg 
cagagactcc 
ttagttctgc 



tggaggttcc 
gagaagataa 
cgctggacga 
acatggcgcg 
cccgaggaga 
tgaccccctc 
ctcacccagg 
agtctagtat 



60 
120 
180 
240 
300 
360 
420 
480 
491 



<210> 777 
<211> 389 
<212> DNA 

<213> Homo sapiens 



<400> 777 

tgtgtaatgt tgaatctgga aattgatcag cattaaaggg cacatgaagc agtgtctgca 60 

ggcgttcagt gctgcggasc tgttaaaggt cactcagatg tgcaggtgtt aatcttctct 120 

aaaagcctgg tgatacagct ctggctttct gagcacacta cggatctgga aaatactgga 180 

aaatgtgata cttagaatac tttggctgct aaggaaactt cctctccatt gcagaatagc 240 

tgagccaagt gagtgagttt gcagaaagca ggtggtgagc tcctgcctgc tggaggttgc 300 

catggagggc cattcctgcc cggcaacagc accgtcctgc agggagccac ttggcagaag 360 

ggtgcagggc tgctggtgtc agagcaaga 389 



<210> 778 
<211> 440 
<212> DNA 
<213> Homo sapiens 

<400> 778 

tgtagtgttt gcagtagcgt ttctcacctt ccctgatgaa gggtcttttc agttgcaaac 60 

caacttgcca cctcagcggc agaagttgag tgttgttcgt gtcaccacag agtctcgttc 120 

tgtcacccag cctggtgtgc agtggttcar gccattctcc tgcctcagcc tcccaagtag 180 

ctgggattac aggcgcccac caccaagccc agctaatttt tgtatgttcg gtagagacgg 240 

ggtttcacca tgttggccag gctggtctca aactcctgac ctcaagcgac ccacccgcct 300 

cggcctccca gagttttgcg attacaggcg tgacaccacg cccagcggaa tttcttattc 360 

tttttgatct atcaaaacaa cctcattgtt ctcaatgact gaccttataa taagtaaatg 420 

ttatagaaca taaaaaaaaa 440 



wo 02/083898 

<210> 779 
<211> 418 
<212> DNA 
<213> Homo sapiens 



402 
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<400> 779 

atagattaca 

gctccttctc 

cgttaccttg 

cacggcttcc 

gactctcatg 

ttcaacccga 

ggctgctgaa 



acggtgatgg 
ggcgtgaacc 
gaggggatgc 
taaacacctc 
agtgctgagg 
acatctcgtg 
gttcccgttg 



cgggcaagcg 
tggtggtgat 
tacagcactg 
ttatcaccgg 
ccaacctggc 
gcagcctatt 
ggcccatgtc 



gtccggctgg 
gccgcccacc 
ggaactgggc 
caagaggttt 
tggactcttc 
cctgtgcaca 
cccgttatga 



agccgggcgg 
cgggcccgga 
caggccctgc 
atgtgcgasr 
cctcccaacg 
ctgtsccatc 
gcagctgcag 



ctctcctcca 
gtctgcgctt 
ggcasgctat 
gacttgaccg 
gatgcagcgc 
actgaggaca 
aacgagac 



60 
120 
180 
240 
300 
360 
418 



<210> 780 
<211> 362 
<212> DNA 
<213> Homo sapiens 



<400> 780 

aaacttgccc 

ccgcctttga 

ccggcggccc 

ctccttgcct 

tcttctccta 

tcgatatgga 

ca 



ttctcaaaca 
tctcgttggt 
gacggagcag 
ttgtccagac 
tgtgcttggg 
ggctttcact 



tggccgccac 
ggggctgggg 
aagagagagc 
acacctcccg 
gtcctggagg 
gagatgatgg 



ggcgcctctg 
gatgagagct 
atggagctgg 
gagccgacct 
actgggcccc 
aggcctatgt 



gaagggaacc 
gcaccgcgcg 
agaggatcgt 
cagtggcttg 
tcggcattca 
gcctggcttc 



gctctgggcc 
ggacaagtcg 
cagtgcagcc 
gatgaggtca 
gaggagaact 
gccacatccc 



60 
120 
180 
240 
300 
360 
362 



<210> 781 
<211> 379 
<212> DNA 
<213> Homo sapiens 



<400> 781 

agattatttc 

aaccactcyt 

caggtctggc 

gagcaggttg 

tcggctgcct 

cactgccaga 

agctgaacta 



tacttggttg 
tttcaaagat 
cacggccatg 
gggagtctgg 
ttgcgcccat 
gtcctggggt 
tacagtgcc 



ccttccagat 
ctctgaggga 
agcatgctga 
ctggttccag 
ctgtgtgggc 
ggctgagctg 



gtttcacact 
gacattgcac 
gccatcatgc 
aagcaagcct 
atcgtcttcc 
agccagcgct 



tggacagcaa 
ctggccactg 
ccaccgtgga 
tcctcatctt 
tgggtttcac 
gtggctggag 



actgatttca 
cagcccagag 
tgacattctg 
atgcctgctg 
acctgaccac 
ccctgcgagg 



60 
120 
180 
240 
300 
360 
379 



<210> 782 
<211> 453 
<212> DNA 
<213> Homo sapiens 



<400> 782 

ttctcatctt 

tgtcttccta 

cttgacctct 

ttaccctgtc 

ttatatcact 

caacatctcg 

gtgtgataca 

agatcaagct 



ttgtaatgtg 
actcactagt 
taatgttaga 
tttgtgatct 
agctctgaga 
aatgcctgtt 
cagagatgaa 
tgaccacata 



tggtcctcac 
tgctccttct 
gtcccagagt 
agtataatgg 
tctctcacga 
cttcaagagt 
actggttttc 
ttatcccctc 



ttggcctctg 
cactctcctt 
gtagtcatag 
ccaccaccta 
actccagatg 
ttgagtcatg 
acaggtgata 
cac 



ggacacctcc 
tgctggttct 
tccttctctg 
taaacttata 
tgcacatctg 
aagtgtgaag 
gatggaaggc 



aagtcttagt 
tccttttctc 
tgatctattt 
gctcacgaat 
actgcttact 
cctagagatg 
tgttttcaaa 



60 
120 
180 
240 
300 
360 
420 
453 



<210> 783 
<211> 553 
<212> DNA 
<213> Homo sapiens 



<400> 783 
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403 

aactacttcc gggggagcgg cgcggcggcg sggagttggt tctaaagagt ggtgagtcag 60 

aagagacgtc aggcagcaag cgacttgggc catggctctg acctagactt ctcacctccg 120 

cagtgcccga gcccactttc ctggagaacc tgctacggta cggactcttc ctgggagcca 180 

tcttccagct catctgtgtg ctggccatca tcgtacccat tcccaagtcc cacgaggcgg 24 0 

aggctgaacc gtctgagccc agaagtgctg aggtgacgag gaagcccaag gctgctgttc 300 

cttctgtgaa caagaggccc aagaaagaga ctaagaagaa gcggtagaag aggaggcctg 360 

aggagctggg cgggcaggga gagggtcttg gggacagccc tcctgggaat ctacattgtg 420 

ttcccccgca ttccaggctc agggtctgag gaggctgtga cgccctatga ccgcagagat 480 

ctagacagtc gtaacagtcc ccaggctcba gctgggcaat ccaccacttc ctcttccttc 540 

tgcttctgtg acg 553 

<210> 784 
<211> 472 
<212> DNA 

<213> Homo sapiens 

<220> 

<221> misc_feature 
<222> 445 

<223> n=a, g, c or t 



<400> 784 

ttccctttcg aattccaggg tatatctggg aggccggaga cgtgtctggt tattacacag 60 

atgcacagct ggacgtggga tccacacagc tcagaacagt tggatcttgc tcagtctctg 120 

tcagaggaag atcccttgga caagaggacs ctgccttggt gtgagagtga gggaagarga 180 

agctggaacg agggttaagg aaaaccttcc agtctggaca gtgactggag agctccaagg 240 

aaagcccctc ggtaacccag ccgctggcac catgaaccca gagagcagta tctttattga 300 

ggattacctt aagtatttcc aggaccaagt gagcagagag aatctgctac aactgctgac 360 

tgatgatgaa gcctggaatg gattcgtggc tgctgctgaa ctgcccaggg atgaggcaga 420 

tgagctccgt aaagctctga acaangcttg caagtcacat ggtcatgaag ga 472 



<210> 785 
<211> 157 
<212> PRT 

<213> Homo sapiens 

<220> 

<221> SIGNAL 
<222> -29.. -1 



<220> 

<221> UNSURE 

<222> 120 

<223> Xaa His, Gin 



<400> 785 




























Met 


Ala 


Thr 


Met 


Val 


Pro 


Ser 


Val 


Leu 


Trp 


Pro Arg 


Ala 


Cys 


Trp 


Thr 










-25 










-20 










-15 




Leu 


Leu 


Val 


Cys 
-10 


Cys 


Leu 


Leu 


Thr 


Pro 

-5 


Gly 


Val 


Gin 


Gly 


Gin 

1 


Glu 


Phe 


Leu 


Leu 
5 


Arg 


Val 


Glu 


Pro 


Gin 
10 


Asn 


Pro 


Val 


Leu 


Ser 
15 


Ala 


Gly 


Gly 


Ser 


Leu 


Phe 


Val 


Asn 


Cys 


Ser 


Thr 


Asp 


Cys 


Pro 


Ser 


Ser 


Glu 


Lys 


lie 


Ala 


20 










25 










30 










35 


Leu 


Glu 


Thr 


Ser 


Leu 


Ser 


Lys 


Glu 


Leu 


Val 


Ala 


Ser 


Gly 


Met 


Gly Trp 










40 








45 










50 




Ala 


Ala 


Phe 


Asn 
55 


Leu 


Ser 


Asn 


Val 


Thr 
60 


Gly 


Asn 


Ser 


Arg 


lie 
65 


Leu 


Cys 


Ser 


Val 


Tyr 


Cys 


Asn 


Gly 


Ser 


Gin 


lie 


Thr 


Gly Ser 


Ser 


Asn 


lie 


Thr 






70 










75 










80 








Val 


Tyr 
85 


Arg 


Leu 


Pro 


Glu 


Arg 
90 


Val 


Glu 


Leu 


Ala 


Pro 
95 


Leu 


Pro 


Pro 


Trp 


Gin 


Pro 


Val 


Gly 


Gin 


Asn 


Phe 


Thr 


Pro 


Ala 


Leu 


Pro 


Ser 


Gly 


Gly Trp 



wo 02/083898 

100 105 
Val Ala Pro Asp Xaa Pro 
120 

<210> 786 
<211> 151 
<212> PRT 

<213> Homo sapiens 
<220> 

<221> SIGNAL 
<222> -16.. -1 

<400> 786 

Met Val Met Leu Leu Leu 
-15 

Ala Ala Glu Gly Gin Ala 

1 5 

Val Gin Glu Asn Phe Asp 

20 

He Glu Lys He Pro Thr 
35 

Asn Tyr Ser Leu Met Glu 
50 

Leu Arg Ala Asp Gly Thr 
65 70 
Val Asn Leu Thr Glu Pro 
85 

Met Pro Ser Ala Pro Tyr 

100 

Ala Leu Val Tyr Ser Cys 
115 

Phe Ala Trp He Leu Ala 
130 

<210> 787 
<211> 143 
<212> PRT 

<213> Homo sapiens 
<220> 

<221> SIGNAL 
<222> -17. .-1 

<400> 787 



Met 


Ala 


Leu 
-15 


Glu 


Val 


Leu 


Ala 


Glu 
1 


Asn 


Leu 


His 


Val 
5 


Val 


Ser 


Val 


He 


Pro 
20 


Val 


Asp 


Glu 


Leu 


His 
35 


Leu 


Gly 


Tyr 


Val 


Tyr 
50 


Glu 


Phe 


He 


Arg 


Val 


Val 


Ser 


Glu 


Glu 


65 










Thr 


Pro 


Arg 


Asn 


He 


Asp 


80 










85 


Ser 


Thr 


Ser 


Arg 


Lys 
100 


Ser 


Glu 


He 


Lys 


Leu 
115 


Asp 


Pro 









404 












110 




His 


Gly 


Gly 


Ala 


Ala 


Ser 








125 






Leu 


Leu 


Ser 


Ala 


Leu 


Ala 


-10 










-5 


Phe 


His 


Leu 


Gly 


Lys 


Cys 








10 






Val 


Asn 


Lys 


Tyr 


Leu 


Gly 






25 








Thr 


Phe 


Glu 


Asn 


Gly 


Arg 




40 










Asn 


Gly 


Lys 


He 


Lys 


Val 


55 










60 


Val 


Asn 


Gin 


He 


Glu 


Gly 










75 




Ala 


Lys 


Leu 


Glu 


Val 


Lys 








90 






Trp 


He 


Leu 


Ala 


Thr 


Asp 






105 








Thr 


Cys 


He 


He 


Gin 


Leu 




120 










Arg 
























Met 


Leu 


Leu 


Ala 


Val 


Leu 




-10 










Lys 


He 


Ser 


Cys 


Ser 


Leu 










10 




Ala 


Glu 


Ser 


Arg 


Asn 


Leu 








25 






Met 


Gly 


Cys 


Pro 


Ala 


Asn 






40 








Tyr 


Leu 


Val 


Arg 


Asp 


Cys 




55 










Thr 


Leu 


Leu 


Phe 


Gin 


Thr 


70 










75 


His 


Asp 


Pro 


Gin 


Glu 


He 










90 




Val 


Trp 


Leu 


Thr 


Pro 


Val 








105 






Ser 


Pro 


Phe 


He 


Ala 


Asp 






120 
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115 

Leu 



Gly 


Leu 


Phe Gly 


Pro 


Asn 


Pro 


Pro 






15 




Arg 


Trp 


Tyr 


Glu 




30 






Cys 


He 


Gin 


Ala 


45 








Leu 


Asn 


Gin 


Glu 


Glu 


Ala 


Thr 


Pro 








80 


Phe 


Ser 


Trp 


Phe 






95 




Tyr 


Glu 


Asn Tyr 




110 






Phe 


His 


Val 


Asp 


125 








He 


Trp 


Thr 


Gly 


-5 








Asp 


Trp 


Leu 


Met 








15 


Tyr 


He 


Phe 


Ala 






30 




Arg 


He 


His 


Thr 




45 






Gly 


He 


Arg 


Thr 


60 








Glu 


Leu 


Tyr 


Phe 


His 


Leu 


Glu 


Cys 








95 


Ser 


Thr 


Glu 


Asn 






110 




Phe 


Gin 


Thr 






125 
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405 
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<210> 788 
<211> 148 
<212> PRT 

<213> Homo sapiens 
<220> 

<221> SIGNAL 
<222> -17. .-1 

<400> 788 



Met Ala 


Leu 


Glu 


Val 


Leu 


Met Leu Leu Ala 


Val Leu 


He Trp Thr Gly 




-15 








-10 




-5 




Ala Glu 


Asn 


Leu 


His 


Val 


Lys He Ser Cys 


Ser Leu 


Asp Trp Leu Met 


1 








5 




10 




15 


Val Ser 


Val 


He 


Pro 


Val 


Ala Glu Ser Arg 


Asn Leu 


Tyr He Phe 


Ala 








20 




25 




30 




Asp Glu 


Leu 


His 


Leu 


Gly 


Met Gly Cys Pro 


Ala Asn Arg He His 


Thr 






35 






40 




45 




Tyr Val 


Tyr 


Glu 


Phe 


He 


Tyr Leu Val Arg 


Asp Cys Gly He Arg Thr 




50 








55 




60 




Arg Val 


Val 


Ser 


Glu 


Glu 


Thr Leu Leu Phe 


Gin Thr 


Glu Leu Tvr 


Phe 


65 










70 


75 






Thr Pro 


Arg 


Asn 


He Asp 


His Asp Pro Gin 


Glu He 


His Leu Glu 


Cys 


80 








85 




90 




95 


Ser Thr 


Ser 


Arg 


Lys 


Ser 


Val Trp Leu Thr 


Pro Val 


Ser Thr Glu 


Asn 








100 




105 




110 




Glu He 


Lys 


Leu 


Asp 


Pro 


Ser Pro Phe He 


Ala Asp 


Phe Gin Thr 


Thr 






115 






120 




125 




Ala Glu 


Glu 


Leu 
















130 
















<210> 789 
















<2H> 75 
















<212> PRT 
















<213> Homo sapiens 












<220> 


















<221> SIGNAL 














<222> -24.,- 


-1 














<220> 


















<221> UNSURE 














<222> 51 
















<223> Xaa = 


GlUr Lys 












<400> 789 
















Met Leu 


Pro 


Pro 


Leu 


Pro 


Ser Arg Leu Gly 


Leu Leu 


Leu Leu Leu 


Leu 








-20 




-15 




-10 




Leu Cys 


Pro 


Ala 


His 


Val 


Gly Gly Leu Trp 


Trp Ala 


Val Gly Ser 


Pro 






-5 






1 




5 




Leu Val 


Met 


Asp 


Pro 


Thr 


Ser He Cys Arg 


Lys Ala 


Arg Arg Leu 


Ala 


10 










15 


20 






Gly Arg Gin 


Ala 


Glu 


Leu 


Cys Gin Ala Glu 


Pro Glu 


Val Val Ala 


Glu 


25 








30 




35 




40 


Leu Ala Arg 


Gly Ala Arg 


Leu Gly Val Arg 


Xaa 







45 50 



<210> 790 
<211> 64 
<212> PRT 

<213> Homo sapiens 



wo 02/083898 



PCT/IBOl/00914 



406 

<220> 

<221> SIGNAL 
<222> -22.. -1 

<400> 790 

Met Arg Gly Met Lys Leu Leu Gly Ala Leu Leu Ala Leu Ala Ala Leu 

-20 -15 -10 

Leu Gin Gly Ala Val Ser Leu Lys lie Ala Ala Phe Asn lie Gin Thr 

-5 15 10 

Phe Gly Glu Thr Lys Met Ser Asn Ala Thr Leu Val Ser . Tyr He Val 

15 20 25 

Gin He Leu Ser Arg Tyr Asp He Ala Leu Val Gin Glu Val Arg Asp 
30 35 40 

<210> 791 
<211> 117 
<212> PRT 

<213> Homo sapiens 

<220> 

<221> SIGNAL 
<222> -33. --1 

<220> 

<221> UNSURE 
<222> 82 

<223> Xaa = Glu, Gly 
<400> 791 



Met Ser 


Ala 


Gin 


Glu 


Ser Cys Leu 


Ser Leu He Lys 


Tyr 


Phe 


Leu 


Phe 






-30 






-25 




-20 






Val Phe 


Asn 


Leu 


Phe 


Phe Phe Val 


Leu Gly Ser Leu 


He 


Phe 


Cys 


Phe 




-15 






-10 




-5 








Gly He 


Trp 


He 


Leu 


He Asp Lys 


Thr Ser Phe Val 


Ser 


Phe 


Val 


Gly 


1 








5 


10 








15 


Leu Ala 


Phe 


Val 


Pro 


Leu Gin He 


Trp Ser Lys Val 


Leu 


Ala 


He 


Ser 








20 




25 






30 




Gly He 


Phe 


Thr 


Met Gly He Ala 


Leu Leu Gly Cys Val Gly Ala 


Ser 






35 






40 




45 






Arg Ser 


Ser 


Ala 


Ala 


Ser Trp Ala 


Cys He Leu Gly Cys 


Cys 


Cys 


Ser 




50 






55 




60 








Cys Leu 


Pro 


His 


Arg 


Ser Pro Trp Glu Ser Ser Ser 


Pro 


Leu 


Ser Gly 


65 








70 


75 










Pro Ala 


Xaa 


Ala 


Lys 















80 

<210> 792 
<211> 82 
<212> PRT 

<213> Homo sapiens 

<220> 

<221> SIGNAL 
<222> -32. .-1 

<220> 

<221> UNSURE 
<222> 35 

<223> Xaa = His, Gin 



<400> 792 

Met Ala Lys Met Phe Asp Leu Arg Thr Lys He Met He Gly He Gly 
-30 -25 -20 
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Ser Ser Leu Leu Val Ala Ala Met Val Leu Leu Ser Val Val Phe Cys 

-15 -10 -5 

Leu Tyr Phe Lys Val Ala Lys Ala Leu Lys Ala Ala Lys Asp Pro Asp 
15 10 15 

Ala Val Ala Val Lys Asn His Asn Pro Asp Lys Val Cys Trp Ala Thr 

20 25 30 

Asn Ser Xaa Ala Lys Ala Thr Thr Met Glu Ser Cys Pro Ser Leu Gin 
35 40 45 

Cys Cys 
50 

<210> 793 
<211> 60 
<212> PRT 

<213> Homo sapiens 

<220> 

<221> SIGNAL 
<222> -19.. -1 

<400> 793 



Met Lys lie Leu 


Val 


Ala Leu 


Ala 


Val Phe Phe Leu Val 


Ser Thr 


Gin 


-15 






-10 


-5 




Leu Phe Ala Glu 


Glu 


He Gly Ala Asn Asp Asp Leu Asn Tyr Trp Ser 


1 






5 


10 






Asp Trp Tyr Asp 


Ser Asp Gin 


He 


Lys Glu Glu Leu Pro 


Glu Pro 


Phe 


15 




20 




25 






Glu His Leu Leu Gin Arg lie 


Ala 


Arg Arg Pro Lys 






30 




35 




40 






<210> 794 














<211> 79 














<212> PRT 














<213> Homo sapiens 












<220> 














<221> SIGNAL 














<222> -28. .-1 














<400> 794 














Met lie Tyr Thr 


Met 


Lys Lys 


Val 


His Ala Leu Trp Ala Ser Val Cys 


-25 








-20 


-15 




Leu Leu Leu Asn 


Leu 


Ala Pro 


Ala 


Pro Leu Asn Ala Asp 


Ser Glu 


Glu 


-10 






-5 


1 






Asp Glu Glu His 


Thr 


He He 


Thr Asp Thr Glu Leu Pro 


Pro Leu 


Lys 


5 




10 




15 




20 


Leu Met His Ser 


Phe 


Cys Ala 


Phe 


Lys Ala Asp Asp Gly 


Pro Cys 


Lys 




25 






30 


35 




Ala He Met Lys 


Arg 


Phe Ser 


Ser 


He Phe Ser Leu Asp 


Ser Ala 




40 








45 


50 





<210> 795 
<211> 76 
<212> PRT 

<213> Homo sapiens 
<220> 

<221> SIGNAL 
<222> -24. .-1 



<400> 795 

Met Gly Ser Ala Cys He Lys Val Thr Lys Tyr Phe Leu Phe Leu Phe 
-20 -15 -10 



wo 02/083898 



Asn Leu lie Phe Phe lie Leu Gly Ala 

-5 1 
Trp lie Leu Ala Asp Lys Ser Ser Phe 

10 15 
Ser Ser Ser Leu Arg Met Gly Ala Tyr 
25 30 
Val Thr Met Leu Met Gly Phe Leu Gly 
45 

<210> 796 
<211> 151 
<212> PRT 
<213> Homo sapiens 

<220> 

<221> SIGNAL 
<222> -18.. -1 

<220> 

<221> UNSURE 
<222> 117 

<223> Xaa = Ser, Thr 
<400> 796 



Met 


Lys 


Ala 


Leu 


He Val Leu Gly Leu 








-15 


-10 


Gin 


Gly 


Lys 


Val 


Phe Glu Arg Cys Glu 






1 




5 


Leu 


Gly 


Met 


Asp 


Gly Tyr Arg Gly He 


15 








20 


Leu 


Ala 


Lys 


Trp 


Glu Ser Gly Tyr Asn 










35 


Ala 


Gly 


Asp 


Arg 


Ser Thr Asp Tyr Gly 








50 


55 


Tyr 


Trp 


Cys 


Asn 


Asp Gly Lys Thr Pro 






65 




70 


Leu 


Ser 


Cys 


Ser 


Ala Leu Leu Gin Asp 




80 






85 


Cys 


Ala 


Lys 


Arg 


Val Val Arg Asp Pro 


95 








100 


Ala 


Gly 


He 


Thr 


Gly Val Xaa Asn Cys 










115 


He 


Lys 


Glu 


He 


Thr Gin- Thr 








130 





<210> 797 

<211> 82 

<212> PRT 

<213> Homo sapiens 

<220> 

<221> SIGNAL 
<222> -29. .-1 

<400> 797 

Met Asp Leu Gin Gly Arg Gly Val Pro 
-25 

Leu Leu Met Leu Phe His Thr Met Ala 

-10 -5 
Val Glu Asn Leu Ser Gly Leu Ser Thr 

5 10 
Val Val Arg Glu Asn Gly Thr Thr Cys 
20 25 



PCT/TBOl/00914 

408 



Val 


He 


Leu 


Gly Phe Gly Val 








5 






He 


Ser 


Val 


Leu 


Gin 


Thr Ser 






20 








Val 


Phe 


He 


Gly Val 


Gly Ala 




35 








40 


Cys 


He Gly 








50 












Val 


Leu 


Leu 


Ser 


Val 


Thr Val 










-5 




Leu 


Ala 


Arg 


Thr 


Leu 


Lys Arg 






10 








Ser 


Leu 


Ala 


Asn 


Trp 


Met Cys 




25 








30 


Thr 


Arg Ala 


Thr 


Asn 


Tyr Asn 


40 










45 


He 


Phe 


Gin 


He 


Asn 


Ser Arg 










60 




Gly 


Ala 


Val 


Asn 


Ala 


Cys His 








75 






Asn 


He 


Ala 


Asp Ala 


Val Ala 






90 








Pro 


Thr 


Ser 


Ala 


Ser 


Gin Ser 




105 








110 


Ala 


Arg 


Pro 


His 


Ser 


Val Leu 


120 










125 


Ser 


He 


Asp 


Arg 


Leu 


Arg Val 


-20 










-15 


Gin 


He 


Met 


Ala 


Glu 


Gin Glu 










1 




Asn 


Pro 


Glu 


Lys 


Asp 


He Phe 






15 








Leu 


Met 


Ala 


Glu 


Phe 


Ala Ala 




30 








35 
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Lys Phe lie Val Pro Tyr Asp Val Trp Ala Ser Asn Tyr Val Asp Leu 
40 45 50 

He Thr 

<210> 798 
<211> 60 
<212> PRT 

<213> Homo sapiens 
<220> 

<221> SIGNAL 
<222> -24. .-1 

<400> 798 

Met Glu Glu Gly Gly Asn Leu Gly Gly Leu He Lys Met 

-20 -15 
Leu Val Leu Ser Gly Ala Trp Gly Met Gin Met Trp Val 

-5 15 
Ser Gly Phe Leu Leu Phe Arg Ser Leu Pro Arg His Thr 

10 15 20 

Val Gin Ser Lys Leu Phe Pro Phe Tyr Phe His He 
25 30 35 

<210> 799 
<211> 142 
<212> PRT 

<213> Homo sapiens 
<220> 

<221> SIGNAL 
<222> -26. .-1 

<220> 

<221> UNSURE 
<222> 47 

<223> Xaa « Gly, Arg 

<400> 799 



Met Lys 


Thr 


Glu 


Gin 


Leu Asn 


Arg 


Phe 


Ala 


Gly 


Phe 


Gly 


He 


Gly Leu 


-25 








-20 










-15 








Ala Ser 


Leu 


Phe 


Thr 


Glu Asn 


Val 


Leu 


Ala 


His 


Pro 


Cys 


He 


Val Leu 


-10 








-5 








1 








5 


Arg Arg 


Gin 


Cys 


Gin 


Val Asn 


Tyr 


His 


Ala 


Gin 


His 


Tyr 


His 


Leu Thr 






10 








15 










20 




Pro Phe 


Thr 


Val 


He 


Asn He 


Met 


Tyr 


Ser 


Phe 


Asn 


Lys 


Thr Gin Gly 




25 








30 










35 






Pro Arg 


Ala 


Leu 


Trp Lys Gly Met 


Xaa 


Ser 


Thr 


Phe 


He 


Val 


Gin Gly 


40 








45 










50 








Val Thr 


Leu 


Gly 


Ala 


Glu Gly 


He 


He 


Ser 


Glu 


Phe 


Thr 


Pro 


Leu Pro 


55 








60 








65 








70 


Arg Glu 


Val 


Leu 


His 


Lys Trp 


Ser 


Pro 


Lys 


Gin 


He 


Gly 


Glu 


His Leu 








75 








80 










85 


Leu Leu 


Lys 


Ser 


Leu 


Thr Tyr 


Val 


Val 


Ala 


Met 


Leu 


Phe 


Tyr 


Ser Ala 






90 








95 










100 




Ser Leu 


He 


Glu 


Thr 


Val Gin 


Ser 


Glu 


He 


He Arg Asp 


Asn 






105 








110 










115 







<210> 800 
<211> 126 
<212> PRT 

<213> Homo sapiens 



Val His Leu 

-10 
Thr Phe Val 

Phe Gly Leu 



<220> 
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<221> SIGNAL 


























<222> -44 . .-1 


























<400> 800 


























Met His Glu Glu 


Glu 


He 


Tyr 


Thr 


Ser 


Leu 


Gin 


Trp Asp Ser 


Pro 


Ala 




-40 










-35 










-30 




Pro Asp Thr Tyr 


Gin 


Lys 


Cys 


Leu 


Ser 


Ser 


Asn 


Lys 


Cys 


Ser 


Gly Ala 


-25 










-20 










-15 






Cys Cys Leu Val 


Met 


Val 


He 


Ser 


Cys 


Val 


Phe 


Cys 


Met 


Gly 


Leu 


Leu 


-10 








-5 










1 








Thr Ala Ser He 


Phe 


Leu 


Gly Val 


Lys 


Leu 


Leu 


Gin 


Val 


Ser 


Thr 


He 


5 




10 










15 










20 


Ala Met Gin Gin 


Gin 


Glu 


Lys 


Leu 


He 


Gin 


Gin 


Glu Arg Ala 


Leu 


Leu 




25 










30 










35 




Asn Phe Thr Glu Trp Lys 


Arg Ser Cys Ala Leu 


Gin 


Met 


Lys 


Tyr Cys 


40 










45 










50 






Gin Ala Phe Met 


Gin 


Asn 


Ser 


Leu 


Ser 


Ser 


Ala 


His 


Asn 


Ser 


Ser 


Pro 


55 








60 










65 








Cys Pro Asn Asn Trp 


He 


Gin Asn Arg 


Glu 


Ser 


Cys 


Tyr 


Tyr 






70 






75 










80 










<210> 801 


























<211> 52 


























<212> PRT 


























<213> Homo sapiens 
























<220> 


























<221> SIGNAL 


























<222> -20. .-1 


























<400> 801 


























Met Gly Arg Ala 


Arg Glu 


Val Gly Trp Met Ala 


Ala 


Gly 


Leu 


Met 


He 


-20 




-15 










-10 










-5 


Gly Ala Gly Ala 


Cys 


Tyr 


Cys 


Val 


Tyr 


Lys 


Leu 


Thr 


He 


Gly Arg Asp 




1 








5 










10 






Asp Ser Glu Lys 


Leu 


Glu 


Glu 


Glu 


Gly Glu Glu 


Glu 


Trp Asp Asp Asp 


15 








20 










25 








Gin Glu Leu Asp 


























30 


























<210> 802 


























<211> 154 


























<212> PRT 


























<213> Homo sapiens 
























<220> 


























<221> SIGNAL 


























<222> -15.. -1 


























<400> 802 


























Met Asn He Leu 


Met 


Leu 


Thr 


Phe 


He 


He 


Cys 


Gly 


Leu 


Leu 


Thr 


Arg 


-15 




-10 










-5 










1 


Val Thr Lys Gly 


Ser 


Phe 


Glu 


Pro 


Gin 


Lys 


Cys 


Trp 


Lys 


Asn 


Asn 


Val 


5 










10 










15 






Gly His Cys Arg 


Arg Arg 


Cys 


Leu 


Asp 


Thr 


Glu 


Arg 


Tyr 


He 


Leu 


Leu 


20 








25 










30 








Cys Arg Asn Lys 


Leu 


Ser 


Cys 


Cys 


He 


Ser 


He 


He 


Ser 


His 


Glu 


Tyr 


35 






40 










45 










Thr Arg Arg Pro 


Ala 


Phe 


Pro 


Val 


He 


His 


Leu 


Glu Asp 


He 


Thr 


Leu 


50 




55 










60 










65 


Asp Tyr Ser Asp 


Val Asp 


Ser 


Phe 


Thr 


Gly 


Ser 


Pro 


Val 


Ser 


Met 


Leu 




70 










75 










80 




Asn Asp Leu lie 


Thr 


Phe 


Asp 


Thr 


Thr 


Lys 


Phe 


Gly Glu Thr Met 


Thr 
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85 90 95 

Pro Glu Thr Asn Thr Pro Glu Thr Thr Met Pro Pro Ser Glu Ala Thr 

100 105 110 

Thr Pro Glu Thr Thr Met Pro Pro Ser Glu Thr Ala Thr Ser Glu Thr 

115 120 125 

Met Pro Pro Pro Ser Gin Thr Ala Leu Thr 
130 135 



<210> 803 

<211> 165 

<212> PRT 

<213> Homo sapiens 
<220> 

<221> SIGNAL 

<222> -39, .-1 



<400> 803 








Met 


Met 


Thr 


He 


Thr Phe Leu 


Pro 










-35 




Phe 


Pro 


Asp 


Val 


Pro Leu Gly 


He 








-20 






Ala 


He 


Gly 


Val 


Val Gin Ala 


Leu 






-5 






1 


Pro 


His 


Leu 


Leu 


Ser Pro Gin 


He 


10 








15 




Tyr 


Arg 


Arg 


His 


Val Leu Gly 


He 










30 




Phe 


Ala 


Ala 


Ala 


He Phe Ser 


Leu 








45 






Leu 


Met 


Val 


Thr 


Val He Leu 


Leu 






60 






65 


Trp 


Cys 


Arg 


Asp 


Arg Leu Leu 


Gly 




75 






80 




Val 


Glu 


Val 


Phe 


Ser Phe Asp 


Leu 


90 








95 




Val 


Glu 


Ala 


Phe 


Ser Asp Gly 


Val 










110 




He 


Leu 


Asp 


He 


Trp 





125 



Tyr 


Thr 


Phe 


Ser 


Leu 


Met 


Val 


Thr 




-30 










-25 




Phe 


Leu 


Phe 


Cys 


Val 


Cys 


Val 


He 


-15 










-10 






He 


Val 


Gly Tyr Ala 


Phe 


His 


Phe 


Gin Arg 


Ser 


Ala 


His 


Arg 


Ala 


Leu 






20 










25 


Val 


Leu 


Gin Gly Pro 


Ala 


Leu Cys 




35 










40 




Phe 


Phe 


Val 


Pro 


Leu 


Ser 


Tyr 


Leu 


50 










55 






Pro 


Tyr 


Val 


Ser 


Lys 


Val 


Thr Gly 










70 








His 


Arg 


Glu 


Pro 


Ser 


Ala 


His 


Pro 








85 










His 


Glu 


Pro 


Leu 


Ser 


Lys 


Glu 


Arg 






100 










105 


Tyr 


Ala 


He 


Val 


Ala 


Thr 


Leu 


Leu 




115 










120 





<210> 804 
<211> 145 
<212> PRT 

<213> Homo sapiens 
<220> 

<221> SIGNAL 
<222> -22. .-1 



<220> 

<221> UNSURE 
<222> 84 

<223> Xaa « Ala, Pro, Ser, Thr 
<220> 

<221> UNSURE 
<222> -1 

<223> Xaa « Ala, Ser 



<220> 

<221> UNSURE 
<222> 21 



wo 02/083898 
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<223> Xaa = Cys, Phe, lie, Asn, Ser, Thr, Tyr 
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<220> 

<221> UNSURE 
<222> 46 

<223> Xaa = Cys, Phe, Ser, Tyr 



<400> 804 



Met 


Gly 


Ara 
-20 


Tro 


Ser 


Leu 


Thr 


Ala 
-15 


Leu 


Pro 
-5 


Met 


Val 


Thr 


Xaa 


Gly 
1 


Pro 


Thr 


Ser 


Trp 


Asp 


Thr 
15 


Val 


Leu 


Thr 


Gin 


Leu 


He 


Ser 
30 


Thr 


Thr 


Leu 


Gly 


Pro 


Thr 


Thr 


Xaa 


Cys 


Ser Gly 


Thr 






45 










50 


Asp 


Thr 

60 


Met 


Gin 


Arg 


Glu 


Glu 

65 


Leu 


Glu 


Pro 


Ala 


Arg 


Ala 


Glu Asp 


Leu 


75 










80 






Thr 


Ser 


Cys 


Cys 


Pro 
95 


Thr 


Leu 


Ala 


Gly 


Thr 


He 


Leu 
110 


Met 


Thr 


He 


Met 



Glu 



<210> 805 
<211> 109 
<212> PRT 

<213> Homo sapiens 
<220> 

<221> SIGNAL 
<222> -30. .-1 

<220> 

<221> UNSURE 
<222> 64 

<223> Xaa = He, Met 

<220> 

<221> UNSURE 
<222> 58 

<223> Xaa « Leu, Val 
<220> 

<221> UNSURE 
<222> 42 

<223> Xaa = Lys,Arg 
<400> 805 

Met Ala Gly Lys Arg Ser Gly Trp 
-30 -25 
Leu Leu Gly Val Asn Leu Val Val 
-10 

Leu Arg Phe Val Thr Leu Glu Gly 

5 10 
Gin Ala Leu Arg Gin Arg Tyr His 

20 25 
Arg Gin Glu Val Tyr Val Arg Xaa 
35 40 



Ser 


Pro Val Thr 


Leu 


Thr 


Ser 


Leu 






-10 








Ala 


Ala Gly Lys 


Ser 


Ser 


Ser 


Ser 




5 








10 


Ala 


He Xaa Cys 


Ala 


Ser 


Thr 


Val 




20 






25 




Ala 


Ser Ala Ser 


Gly Ala 


Arg Met 


35 






40 






Thr 


Val Phe Leu 


Thr 


Ala 


Leu 


Gin 






55 








Val 


Lys Asn Ala 


Thr 


Pro 


Pro 


Ala 




70 










Ser 


Xaa Ala Pro 


His 


Val 


Thr 


Pro 




85 








90 


Pro 


Ala Ala Pro 


Pro 


Ala 


Ser 


Leu 




100 






105 




Phe 


Ala Ser Val 


Gly 


Phe 


Ser 


Asn 


115 






120 






Ser 


Arg Ala Ala 


Leu 


Leu 


Gin 


Leu 




-20 








-15 


Met 


Pro Pro Thr 


Arg Ala 


Arg 


Ser 




-5 






1 




Met 


Leu Gin His 


Trp 


Glu 


Leu 


Gly 






15 








Gly 


Phe Leu Asn 


Thr 


Ser 


Tyr 


His 




30 










Tyr 


Phe Asp Arg 


Thr 


Leu 


Met 


Ser 




45 








50 



wo 02/083898 



Ala Glu Ala Asn Leu Ala Gly Xaa Phe 
55 

Phe Asn Pro Asn lie Ser Trp Gin Pro 
70 75 

<210> 806 
<211> 57 
<212> PRT 

<213> Homo sapiens 
<220> 

<221> SIGNAL 
<222> -32.. -1 

<400> 806 



Met Ala Thr 


Pro 


Pro Phe Arg Leu 


He 


Arg Lys 


Met 


Phe 


Ser 


Phe 


Lys 


-30 




-25 








-20 








Val Ser Arg 


Trp Met Gly Leu Ala Cys 


Phe Arg 


Ser 


Leu 


Ala 


Ala 


Ser 


-15 




-10 






-5 










Ser Pro Ser 


lie Arg Gin Lys Lys 


Leu 


Met His 


Lys 


Leu 


Gin 


Glu 


Glu 


1 




5 




10 








15 




Asn Ala Phe 


Arg Glu Glu Met Lys 


He 
















20 




25 














<210> 807 




















<211> 79 




















<212> PRT 




















<213> Homo sapiens 
















<220> 




















<221> SIGNAL 


















<222> -19..- 


-1 


















<400> 807 




















Met Val Gly 


He 


Leu Pro Leu Cys 


Cys 


Ser Gly Cys 


Val 


Pro 


Ser 


Leu 






-15 




-10 








-5 




Cys Cys Ser 


Ser 


Tyr Val Pro Ser 


Val 


Ala Pro 


Thr 


Ala 


Ala 


His 


Ser 




1 


5 








10 








Val Arg Val 


Pro 


His Ser Ala Gly His 


Cys Gly Gin Arg Val 


Leu 


Ala 


15 




20 






25 










Cys Ser Leu 


Pro 


Gin Val Phe Leu 


Lys 


Pro Trp 


He 


Phe 


Val 


Glu 


His 


30 




35 




40 










45 


Phe Ser Ser 


Trp 


Leu Ser Leu Glu 


Leu 


Phe Ser 


Phe 


Leu Arg 


Tyr 





50 55 60 



<210> 808 
<211> 147 
<212> PRT 

<213> Homo sapiens 

<220> 

<221> SIGNAL 
<222> -20.. -1 



<400> 808 



Met 


Asp Ser 


Ser 


Thr 


Ala 


His 


Ser 


Pro 


Val 


Phe 


Leu 


Val 


Phe 


Pro 


Pro 


-20 








-15 










-10 










-5 


Glu 


He Thr 


Ala 


Ser 


Glu 


Tyr 


Glu 


Ser 


Thr 


Glu 


Leu 


Ser 


Ala 


Thr 


Thr 








1 








5 










10 






Phe 


Ser Thr 


Gin 


Ser 


Pro 


Leu 


Gin 


Lys 


Leu 


Phe 


Ala 


Arg 


Lys 


Met 


Lys 




15 










20 










25 








He 


Leu Gly 


Thr 


He 


Gin 


He 


Leu 


Phe 


Gly 


He 


Met 


Thr 


Phe 


Ser 


Phe 



30 35 40 
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Pro Pro Asn Gly Xaa Gin Arg 
60 65 
He Pro Val His 
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414 



Gly Val 


He 


Phe 


Leu Phe 


Thr 


Leu 


Leu 


Lys Pro Tyr 


Pro 


Arg 


Phe 


Pro 


45 








50 








55 








60 


Phe 


He 


Phe 


Leu 


Ser Gly Tyr 


Pro 


Phe Trp Gly Ser Val 


Leu 


Phe 


He 










65 








70 






75 




Asn 


Ser 


Glv 


Ala 


Phe Leu 


He 


Ala 


Val 


Lys Arg Lys 


Thr 


Thr 


Glu 


Thr 








80 








85 






90 






Leu 


He 


He 


Leu 


Ser Arg 


He 


Met 


Asn 


Phe Leu Ser 


Ala 


Leu 


Gly Ala 






95 








100 






105 








He 


Ala 


Gly 


He 


He Leu 


Leu 


Thr 


Phe 


Gly Phe His 


Pro 


Arg 


Ser 


Lys 




110 








115 






120 










Leu 


His 


Leu 






















125 



























<210> 809 
<211> 96 
<212> PRT 

<213> Homo sapiens 
<220> 

<221> SIGNAL 
<222> -33. .-1 



<400> 809 






Met 


Lys 


Leu Lys 


Leu Lys Asn 


Val 






-30 






He 


Ala 


Gly Leu 


Leu Tyr Ala 


Leu 






-15 




-10 


Cys 


Leu 


Pro Pro 


Leu Arg Ala 


Ala 




1 




5 




Leu 


Arg 


He Ser 


Gin Leu Gin 


Ala 








20 




Pro 


Ala 


Gin Pro 


Pro Glu Pro 


Glu 






35 






Thr 


Pro 


Thr Tyr Ala Arg Leu 


Val 






50 




55 



Phe 


Leu 


Ala 


Tyr Phe 


Leu 


Val 


Ser 


-25 








-20 






Val 


Gin 


Leu Gly Gin 


Pro 


Cys 


Asp 


Ala 


Glu 


Gin 


Leu Arg 


Gin 


Lys 


Asp 






10 








15 


Glu 


Leu 


Arg 


Arg Pro 


Pro 


Pro 


Ala 




25 








30 




Ala 


Leu 


Pro 


Thr He 


Tyr 


Val 


Val 


40 








45 






Gin 


Lys Ala 


Glu Leu 


Val 


Arg 


Leu 



60 



<210> 810 
<211> 169 
<212> PRT 

<213> Homo sapiens 
<220> 

<221> SIGNAL 
<222> -16. .-1 



<400> 810 








Met 


Ser 


Gly 


Cys 


Gly Leu Phe 


Leu 




-15 






-10 




Cys 


Arg 


Gly 


Leu 


Val Val Ser 


Thr 


1 








5 




Ser 


Pro 


Pro 


Val 


Arg Ala Phe 


Ala 








20 






Lys 


Lys 


Lys 


Glu 


Val Phe Pro 


Phe 






35 






40 


Asn 


Glu 


He 


Asn 


Gin Phe Leu 


Gly 




50 






55 




Glu 


Val 


Asp 


Ser 


Arg Lys He 


Asp 


65 








70 




Thr 


Leu 


Glu 


Lys 


Leu Lys Ser 


Leu 










85 




Glu 


Glu 


Tyr 


Gly 


Gly Leu Gly 


Phe 








100 






Gly 


Glu 


He 


He 


Ser Met Asp 


Gly 



Arg 


Thr 


Thr Ala 
-5 


Ala 


Ala 


Arg Ala 


Ala 


Asn 


Arg Arg 


Leu 


Leu 


Arg 


Thr 




10 








15 




Lys 


Glu 


Leu Phe 


Leu 


Gly 


Lys 


He 


25 








30 






Pro 


Glu 


Val Ser 


Gin 


Asp 


Glu 


Leu 








45 








Pro 


Val 


Glu Lys 


Phe 


Phe 


Thr 


Glu 






60 










Gin Glu Gly Lys 


He 


Pro 


Asp Glu 






75 








80 


Gly Leu 


Phe Gly Leu 


Gin 


Val 


Pro 




90 








95 




Ser 


Asn 


Thr Met 


Tyr 


Ser 


Arg 


Leu 


105 








110 






Ser 


He 


Thr Val 


Thr 


Leu 


Ala 


Ala 



wo 02/083898 



115 

His Gin Ala lie Gly Ser 
130 

Arg Lys Pro Asn Asn Leu 
145 150 

<210> 811 
<211> 136 
<212> PRT 

<213> Homo sapiens 
<220> 

<221> SIGNAL 
<222> -36. .-1 



<400> 811 



Met 


Glu 


Glu Leu Gin Glu 


Pro Leu 


Arg 


Gly Glu Leu Arg Leu Cys Phe 




-35 




-30 




-25 


Thr 


Gin 


Ala Ala Arg Thr 


Ser Leu 


Leu 


Leu Leu Arg Leu Asn Asp Ala 


-20 




-15 






-10 -5 


Ala 


Leu 


Arg Ala Leu Gin 


Glu Cys 


Gin 


Arg Gin Gin Val Arg Pro Val 






1 




5 


10 


lie 


Ala 


Phe Gin Gly His 


Arg Gly 


Tyr 


Leu Arg Leu Pro Gly Pro Gly 






15 


20 




25 


Trp 


Ser 


Cys Leu Phe Ser 


Phe He 


Val 


Ser Gin Cys Cys Gin Glu Gly 




30 




35 




40 


Ala 


Gly 


Gly Ser Leu Asp 


Leu Val 


Cys 


Gin Arg Phe Leu Arg Ser Gly 


45 




50 






55 60 


Pro 


Asn 


Ser Leu His Cys 


Leu Gly 


Ser 


Leu Arg Glu Arg Leu He He 






65 






70 75 


Trp 


Ala 


Ala Met Asp Ser 


He Pro 


Ala 


Pro Ser Ser Val Gin Gly His 






80 




85 


90 


Asn 


Leu 


Thr Glu Asp Ala 


Arg His 










95 


100 







<210> 812 
<211> 109 
<212> PRT 

<213> Homo sapiens 
<220> 

<221> SIGNAL 
<222> -23. .-1 

<220> 

<221> UNSURE 
<222> 81 

<223> Xaa = Pro, Ser 

<400> 812 



Met 


Cys 


Tyr 


Gly 


Lys Cys 


Ala Arg 


Cys 


He 


Gly His 


Ser 


Leu 


Val 


Gly 








-20 






-15 








-10 






Leu 


Ala 


Leu 


Leu 


Cys He 


Ala Ala 


Asn 


He 


Leu Leu 


Tyr 


Phe 


Pro 


Asn 






-5 






1 






5 










Gly Glu Thr 


Lys 


Tyr Ala 


Ser Glu 


Asn 


His 


Leu Ser Arg 


Phe 


Val 


Trp 


10 








15 








20 








25 


Phe 


Phe 


Ser 


Gly 


He Val 


Gly Gly 


Gly 


Leu 


Leu Met 


Leu 


Leu 


Pro 


Ala 










30 






35 








40 




Phe 


Val 


Phe 


He 


Gly Leu 


Glu Gin 


Asp 


Asp Cys Cys 


Gly Cys 


Cys Gly 








45 






50 








55 






His 


Glu 


Asn 


Cys 


Gly Lys 


Arg Cys 


Ala 


Met 


Leu Ser 


Ser 


Val 


Leu 


Ala 






60 






65 








70 








Ala 


Leu 


He 


Gly 


He Ala 


Gly Xaa 


Gly 


Tyr 


Cys Val 


He 
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120 ' 125 

Arg Gly Ser Ser Trp Leu Ala Leu Arg Ser 
135 140 
Pro Lys Leu 
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75 80 85 



<210> 813 
<211> 74 
<212> PRT 

<213> Homo sapiens 

<220> 

<221> SIGNAL 
<222> -17. .-1 



<400> 813 

Met Trp Pro Asn Leu Leu Pro Leu 
-15 -10 
Gly Arg Phe Arg Cys Asn Gly Phe 

1 5 
Glu Glu Asn Cys Thr Ala Asn Pro 
20 

His Cys Lys Asn Gly Leu Cys lie 
35 

Gin Asn Asn Cys Gin Asp Asn Ser 
50 55 



Cys Gin Arg lie His Cys lie lie 
-5 

Glu Asp Cys Pro Asp Gly Ser Asp 

10 15 
Leu Leu Cys Ser Thr Ala Arg Tyr 

25 30 
Asp Lys Ser Phe lie Cys Asp Gly 
40 45 
Asp Glu 



<210> 814 
<211> 137 
<212> PRT 

<213> Homo sapiens 
<220> 

<221> SIGNAL 
<222> -47.. -1 

<220> 

<221> UNSURE 
<222> 87 

<223> Xaa =» Pro, Arg 



<400> 814 








Met Thr 


Leu 


Gin Trp His 


Leu 


Tyr 




-45 






-40 


Gin Asp 


Met 


Leu Arg Ala 


Glu 


Val 


-30 






-25 




Gly Asp 


Met 


Ala Thr Met 


Leu 


Gin 


-15 




-10 






lie Lys 


Glu 


Thr Leu Arg 


Leu 


His 


Tyr Leu 


Val 


Asn Asp Leu Val 


Leu 




20 






25 


Thr Leu 


Val 


Gin Val Ala 


He 


Tyr 


35 






40 




Phe Phe 


Asp 


Pro Glu Asn 


Phe 


Asp 


50 




55 






Lys Asn 


He 


Thr Tyr Phe 


Arg 


Asn 






70 






Gin Cys 


Leu 


Gly Arg Xaa 


He 


Ala 



85 



Glu Met Ala Arg Asn Leu Lys Val 
-35 

Leu Ala Ala Arg His Gin Ala Gin 
-20 

Leu Val Pro Leu Leu Lys Ala Ser 

-5 1 
Pro He Ser Val Thr Leu Gin Arg 
10 15 
Arg Asp Tyr Met He Pro Ala Lys 
30 

Ala Leu Gly Arg Glu Pro Thr Phe 
45 

Pro Thr Arg Trp Leu Ser Lys Asp 

60 65 
Leu Gly Phe Gly Trp Gly Val Arg 
75 80 

Glu 
90 



<210> 815 
<211> 98 
<212> PRT 

<213> Homo sapiens 



<220> 
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417 

<221> SIGNAL 
<222> -20. .-1 



<400> 815 






Met Asp 


Lys Asp Ser 


Gin Gly Leu 


-20 






-15 


Gly Thr 


Ala 


Ser Arg 

1 


Ser Glu Asp 


Arg Ala 


Ser 


Ser Gin 


Ala Leu Gly 




15 




20 


Ser Arg 


Phe 


lie Lys 


Arg Lys Lys 


30 






35 


Ser Leu 


Ala 


Lys Ser 


Ser Thr Arg 


45 






50 


Gin Gly 


Ala 


Val Arg 


Val Val Asn 






65 




Tyr Gin 









Leu Asp Ser Ser Leu Met Ala Ser 
-10 -5 

Glu Glu Ser Leu Ala Gly Gin Lys 
5 10 
Thr lie Pro Lys Arg Arg Ser Ser 
25 

Phe Asp Asp Glu Leu Val Glu Ser 
40 

Ala Lys Gly Ala Ser Gly Val Glu 

55 60 
Pro Pro Pro Val Arg Arg Arg Arg 
70 75 



<210> 816 
<211> 113 
<212> PRT 

<213> Homo sapiens 



<220> 

<221> SIGNAL 
<222> -36.. -1 



<400> 816 



Met 


Met 


Thr 


Glu 


Lys 


His 


Glu Asp His Cys Leu Lys 


Ser 


Cys 


Ser Gly 




-35 










-30 -25 










His 


Ser 


Tyr 


He 


Arg 


Lys 


Asn Trp Leu Gly Cys He 


Val 


Phe 


Pro 


Phe 


-20 










-15 


-10 








-5 


Ser 


Ala 


Leu 


Leu 


Gin 
1 


Gin 


Ser Glu He Ser Gly Thr 
5 


Phe 


Gin 

10 


Val 


Thr 


He 


Pro 


Pro 


Val 


Leu 


Leu Gly Tyr Thr Trp Ser Asn 


Thr 


Tyr 


Val 


Phe 






15 








20 


25 








Pro 


Lys 
30 


Glu 


Asp 


Ser 


Asn 


Glu Gin Asn Leu Lys Glu 
35 40 


Cys 


Thr 


Phe 


Leu 


Asn 


He 


Phe 


Ala 


Thr 


He 


Glu Pro Gin He Ser Tyr 


Val 


Thr 


Cys 


Asn 


45 










50 


55 








60 


Pro 


Thr 


Leu Asp 


Lys 


Phe 


Leu Asp Gin Thr Glu Val 


Leu 


Gin 


Arg 


Ala 










65 




70 






75 





Gin 



<210> 817 
<211> 136 
<212> PRT 
<213> Homo sapiens 

<220> 

<221> SIGNAL 
<222> -41.. -1 



<220> 

<221> UNSURE 
<222> 91 

<223> Xaa = Phe, Leu 



<400> 817 

Met Lys Ala He Lys Lys Ser Leu Thr Glu Glu Glu Tyr Leu Tyr Leu 

-40 -35 -30 

Asp Phe Ser His Gin Thr Glu Gly Cys He Phe Pro Leu His Thr Ser 
-25 -20 -15 -10 
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Val Thr Leu Phe Leu Leu Ser Tyr Cys Asp Cys Lys lie Phe Lys lie 

-5 15 
Cys Leu Val Val Thr Lys Glu Val Ser Arg Asp Ser Ser Leu Leu Arg 

10 15 20 

Asp Asp Leu He Gin Asp Val Glu He Gin He He Ser Arg Gin Glu 

25 30 35 

Leu Pro Pro He Val Gin Asn Cys Cys Leu Pro Ala Val Val Glu Arg 
40 45 50 55 

Ser Asp Asn Phe Cys Arg Ala Gly Leu Ala Val Val Leu Arg His He 

60 65 70 

He Gin Lys Ser Tyr Glu Ala Asp Pro Leu Lys Lys Glu Leu Leu Glu 

75 80 85 

Leu Leu Gly Xaa Lys Arg Leu Ala 
90 95 

<210> 818 
<211> 137 
<212> PRT 
<213> Homo sapiens 

<220> 

<221> SIGNAL 
<222> -16. .-1 

<220> 

<221> UNSURE 

<222> 57 

<223> Xaa = Asp, Gly 
<220> 

<221> UNSURE 

<222> 58 

<223> Xaa = Cys, Ser 

<400> 818 



Met 


Glu 


Asn 


Phe 


Ser 


Leu 


Leu 


Ser 


He Ser Gly Pro Pro He 


Ser 


Ser 




-15 










-10 




-5 






Ser 


Ala 


Leu 


Ser 


Ala 


Phe 


Pro 


Asp 


He Met Phe Ser Arg Ala 


Thr 


Ser 


1 








5 








10 


15 




Leu 


Pro 


Asp 


He 


Ala 


Lys 


Thr 


Ala 


Val Pro Thr Glu Ala Ser 


Ser 


Pro 






20 










25 30 






Ala 


Gin 


Ala 


Leu 


Pro 


Pro 


Gin 


Tyr 


Gin Ser He He Val Arg 


Gin 


Gly 






35 










40 


45 






He 


Gin 


Asn 


Thr 


Val 


Leu 


Ser 


Pro 


Xaa Xaa Ser Leu Gly Asp 


Thr 


Gin 




50 










55 




60 






His 


Gly 


Glu 


Lys 


Leu 


Arg Arg Asn 


Cys Thr He Tyr Arg Pro 


Trp 


Phe 


65 










70 






75 




80 


Ser 


Pro 


Tyr 


Ser 


Tyr 


Phe 


Val 


Cys 


Ala Asp Lys Glu Ser Gin 


Leu 


Glu 










85 








90 


95 




Ala 


Tyr 


'^sp 


Phe 


Pro 


Glu 


Val 


Gin Gin Asp Glu Gly Lys Trp 


Asp 


Asn 








100 










105 110 






Cys 


Leu 


Ser 


Glu 


Asp 


Met 


Ala 


Glu 


Asn 







115 120 



<210> 819 
<211> 113 
<212> PRT 
<213> Homo sapiens 

<220> 

<221> UNSURE 
<222> 34 

<223> Xaa « Ala, Glu 
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<400> 819 








Met Leu 


Gly 


Ser 


Arg Val Asp 


Arg 


1 






5 




Val Val 


Leu 


Glu 


Glu Asp Gin 


Val 






20 






Glu Xaa 


Gly 


Cys 


Arg Leu Arg 


Asp 




35 






40 


Lys Gly 


Lys 


Val 


He Leu Ser 


Met 


50 






55 




Val Phe 


Trp 


Glu 


Phe lie Asn 


Ser 


65 






70 




Tyr His 


Ser 


Lys 


Asn Pro Glu 


Val 








85 




Trp Trp 


Phe 


Leu 


Ser Trp Phe 


Asn 






100 






Gly 










<210> 820 








<211> 174 








<212> PRT 








<213> Homo sapiens 




<400> 820 








Met Ala 


Ala Gly 


Leu Phe Gly 


Leu 


1 






5 




Ala Ala 


Thr Arg 


Gly Leu Pro 


Ala 






20 






Phe Ser 


Arg 


Thr 


Val Val Ala 


Pro 




35 






40 


Pro Glu 


Pro 


Thr 


Thr Pro Trp 


Gin 


50 






55 




Asn Leu 


Tyr Glu 


Lys Asn Pro 


Asp 


65 






70 




Val Leu 


Asp Val 


Trp Asn Met 


Arg 








85 




He He 


Leu 


Val 


Leu Gly Ser 


Thr 






100 






Arg Met 


Lys 


Glu 


Trp Ser Arg 


Arg 




115 






120 


Arg Glu 


Ala 


Asn 


Gly Phe Pro 


Ser 


130 






135 




Ala Arg 


Ser 


Ser 


Cys Gin Arg 


Met 


145 






150 




Gin Glu 


Ala 


Pro 


Pro Ser Pro 


Pro 








165 





<210> 821 
<211> 159 
<212> PRT 

<213> Homo sapiens 
<400> 821 

Met Glu Arg Tyr Ala Ala Ala Leu 

1 5 

Gin Gin Glu Arg His Tyr Gin Leu 

20 

Lys Glu Leu Pro Ser Ser Phe Gin 

35 40 
Ser Asp Leu Ala Leu Ala Leu Leu 

50 55 
Gin Gly Leu Leu Glu He Gin His 
65 70 



419 



Pro Lys 


Leu Ser 


Val 


Ala 


Pro 


Ser 


10 








15 




Leu Val 


Ser Pro 


Ala 


Val 


Asp 


Leu 


25 






30 






Phe Thr 


Glu Lys 


He 


Met 


Asn 


Val 






45 








Leu Val 


Val Ser 


Thr 


Val 


He 


He 




60 










Thr Glu 


Gly Ser 


Phe 


Leu 


Trp 


He 




75 








80 


Asp Asp 


Ser Ser 


Ala 


Gin 


Lys 


Gly 


90 








95 




Asn Gly 


He His 


Asn 


Tyr 


Gin 


Gin 


105 






110 






Ser Ala 


Arg Arg 


Leu 


Leu 


Ala 


Ala 


10 








15 




Ala Arg 


Val Arg Trp Glu 


Ser 


Ser 


25 






30 






Ser Ala 


Val Ala 


Gly 


Lys 


Arg 


Pro 






45 








Glu Asp 


Pro Glu 


Pro 


Glu 


Asp 


Glu 




60 










Ser His 


Gly Tyr Asp 


Lys 


Asp 


Pro 




75 








80 


Leu Val 


Phe Phe 


Phe 


Gly 


Val 


Ser 


90 








95 




Phe Val 


Ala Tyr 


Leu 


Pro 


Asp 


Tyr 


105 






110 






Glu Ala 


Glu Arg 


Leu 


Val 


Lys 


Tyr 






125 








Trp Asn 


Pro Thr 


Ala 


Ser 


Thr 


Pro 




140 










Ser Asp 


Gin Leu 


Leu 


Ser 


Gly 


Ala 




155 








160 


Pro Ala 


Cys His 


Ser Asp 






170 












Glu Glu 


Val Ala Asp Gly 


Ala 


Arg 


10 








15 




Leu Ser 


Ala Leu 


Gin 


Ser 


Leu 


Val 


25 






30 






Gin Arg 


Leu Ser 


Tyr 


Thr 


Thr 


Leu 






45 








Asp Gly 


Thr Val 


Phe 


Glu 


He 


Val 




60 










Leu Thr 


Glu Lys 


Ser 


Leu 


Tyr 


Asn 




75 








80 



wo 02/083898 PCT/IBO 1/009 14 

420 



Gin Arg Leu Arg Leu 


Gin Asn Glu His Arg 


Val 


Leu 


Arg 


Gin 


Ala 


Leu 


85 


90 










95 




Arg Gin Lys His Gin 


Glu Ala Gin Gin Ala 


Cys Arg 


Pro 


His 


Asn 


Leu 


100 


105 








110 






Pro Val Val Gin Ala 


Ala Gin Gin Arg Glu 


Leu 


Glu 


Ala 


Val 


Glu 


His 


115 


120 






125 








Arg He Arg Glu Glu 


Gin Arg Ala Met Asp 


His 


Lys 


He 


He 


Leu 


Glu 


130 


135 




140 










Leu Asp Arg Lys Val 


Ala Asp Gin Gin Ser 


Thr 


Leu 


Glu 


Lys 


Ala 




145 


150 


155 












<210> 822 
















<211> 143 
















<212> PRT 
















<213> Homo sapiens 
















<400> 822 
















Met Ser Gly Phe Glu 


Asn Leu Asn Thr Asp 


Phe 


Tyr 


Gin 


Thr 


Ser 


Tyr 


1 5 


10 










15 




Ser lie Asp Asp Gin 


Ser Gin Gin Ser Tyr 


Asp Tyr 


Gly 


Gly 


Ser 


Gly 


20 


25 








30 






Gly Pro Tyr Ser Lys 


Gin Tyr Ala Gly Tyr 


Asp Tyr 


Ser 


Gin 


Gin 


Gly 


35 


40 






45 








Arg Phe Val Pro Pro 


Asp Met Met Gin Pro 


Gin 


Gin 


Pro 


Tyr 


Thr 


Gly 


50 


55 




60 










Gin He Tyr Gin Pro 


Thr Gin Ala Tyr Thr 


Pro 


Ala 


Ser 


Pro 


Gin 


Pro 


65 


70 


75 










80 


Phe Tyr Gly Asn Asn 


Phe Glu Asp Glu Pro 


Pro 


Leu 


Leu 


Glu 


Glu 


Leu 


85 


90 










95 




Gly He Asn Phe Asp 


His He Trp Gin Lys 


Thr 


Leu 


Thr 


Val 


Leu 


His 


100 


105 








110 






Pro Leu Lys Val Ala 


Asp Gly Ser He Met 


Asn 


Glu 


Thr 


Asp 


Leu 


Ala 


115 


120 






125 








Gly Pro Met Val Phe 


Cys Leu Ala Phe Trp 


Ser 


Thr 


Leu 


Leu 


Leu 




130 


135 




140 










<210> 823 
















<211> 138 
















<212> PRT 
















<213> Homo sapiens 
















<400> 823 
















Met Val Lys Thr Lys 


Leu Lys Thr Val Val 


He Asp 


He 


Thr 


Val 


He 


1 5 


10 










15 




Pro Phe Ala Asp Asp 


Phe Phe Cys Pro Gly 


Ala 


Pro 


Ala 


Gly 


Val 


Glu 


20 


25 








30 






He Ser Pro Ala Ala 


Asp Gly Pro Ser Pro 


Glu Gly 


Pro 


He 


Ala 


Glu 


35 


40 






45 








Pro Asp Gly Glu Ser 


Asp Gly Glu Glu Glu 


Val 


Glu 


Pro 


Pro 


Gly 


Ala 


50 


55 




60 










Asn Asp Gly Glu Thr 


Val Gly Glu Phe Glu 


Thr 


Gly 


Asp 


Glu 


Asp 


Leu 


65 


70 


75 










80 


Gly Ala He Asp Pro 


Gly Gly Glu Val He 


Gly Ala 


Glu 


Asp 


Gly 


Gly 


85 


90 










95 




He Val Glu Asp Asp 


Leu Gly Ala He Gly 


Ala Gly 


Gly 


Glu 


Val 


He 


100 


105 








110 






Gly Ala Glu Asp Gly 


Glu Thr Gly Asp Val 


Asp 


Leu 


Gly 


Gly 


Ala 


Gly 


115 


120 






125 









Gly Glu Val Val Gly Glu Thr Gly Glu Leu 
130 135 

<210> 824 
<211> 74 



wo 02/083898 

<212> PRT 

<213> Homo sapiens 
<220> 

<221> UNSURE 
<222> 53 

<223> Xaa = * , Ala, Asp, Glu, Phe, Leu, Ser, Val, Tyr 
<220> 

<221> UNSURE 
<222> 28 

<223> Xaa = Hi5,Asn,Tyr 
<220> 

<221> UNSURE 
<222> 26 

<223> Xaa = He, Lys, Met , Asn 



PCT/IBOl/00914 

421 



<220> 

<221> UNSURE 
<222> 27 

<223> Xaa = any one of the twenty amino acids 
<400> 824 

Met Arg Ser His Thr He Thr Met Thr Thr Thr Ser Val Ser Ser Trp 

15 10 15 

Pro Tyr Ser Ser His Arg Met Arg Phe Xaa Xaa Xaa His Ser Asp Gin 

20 25 30 

Pro Pro Gin Asn Phe Ser Ala Thr Pro Asn Val Thr Thr Cys Pro Met 

35 40 45 

Asp Glu Lys Leu Xaa Thr Thr Val Leu Thr Thr Ser Tyr Ser Val He 

50 55 60 

Phe He Val Gly Leu Val Gly Asn He He 
65 70 



<210> 825 
<211> 135 
<212> PRT 

<213> Homo sapiens 
<220> 

<221> UNSURE 
<222> 8 

<223> Xaa = Glu, Gin 



<220> 

<221> UNSURE 
<222> 63 

<223> Xaa - Glu, Gly 
<220> 

<221> UNSURE 
<222> 133 

<223> Xaa = His, Pro 



<400> 825 

Met Lys Glu Ala Gly Gin Met Xaa 
1 5 
Arg Ser Val Ser Thr Gin Thr Gly 
20 

Leu Ser Lys Val Asn Leu Phe Thr 

35 40 
Phe Pro Ala Glu Ala Gin Arg Gin 



Asn Leu Glu Ser Ala Arg Ala Gly 

10 15 
Ser Met Thr Gly Gin He Pro Arg 
25 30 
Leu Leu Ser Leu Trp Met Glu Leu 
45 

Lys Ser Gin Lys Asn Glu Xaa Gly 



wo 02/083898 
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50 




55 


60 








Lys 


His 


Gly 


Pro Leu Gly Asp Asn Glu 


Glu Arg Thr Arg 


Val 


Ser 


Thr 


65 






70 


75 






80 


Asp 


Lvs 


Arg 


Gin Val Lys Arg Thr Gly 


Leu Val Val Val 


Lys 


Asn 


Met 








85 


90 




95 




Lys 


lie 


Val 


Gly Leu His Cys Ser Ser 


Glu Asp Leu His 


Ala 


Gly 


Gin 








100 105 




110 






lie 


Ala 


Leu 


He Lys His Gly Ser Arg 


Leu Lys Asn Cys 


Asp 


Pro 


Tyr 






115 


120 


125 








Phe 


Ser 


Arg Lys Xaa Val Leu 












130 




135 











<210> 826 
<211> 61 
<212> PRT 

<213> Homo sapiens 
<220> 

<221> UNSURE 

<222> 44 

<223> Xaa = * , Glu, Gin 



<400> 826 

Met Pro Gly Ala Ala Gly Arg Arg 

1 5 

Gly Pro Asp Phe Gly Ser Ser Gly 

20 

Ser Thr Cys Pro Asp Pro Ser Leu 

35 40 
Pro Glu Pro Lys Pro Cys Arg Gly 
50 55 



Cys 


Pro Asp Arg 


Ser 


Pro 


Pro 


Phe 




10 






15 




Val 


Leu Met Leu 


Leu 


Lys 


Pro 


Glu 


25 






30 






Tyr 


Asp Ala Xaa 


Leu 


Cys 


Pro 


Val 






45 








Val 


Ala Ser Asp 


Pro 









60 



<210> 827 
<211> 130 
<212> PRT 

<213> Homo sapiens 



<400> 827 




Met Arg 


Ser 


His 


1 






Pro Tyr 


Ser 


Ser 






20 


Pro Pro 


Gin 


Asn 




35 




Asp Glu 


Lys 


Leu 


50 






Phe He 


Val 


Gly 


65 






Gly He 


His 


Arg 


Ala lie 


Ala 


Asp 






100 


Tyr His 


He 


Asn 




115 




Val Val 






130 







Thr He Thr Met 
5 

His Arg Met Arg 

Phe Ser Ala Thr 
40 

Leu Ser Thr Val 
55 

Leu Val Gly Asn 

70 

Lys Arg Asn Ser 
85 

Leu Leu Leu He 

Gin Asn Lys Trp 
120 



Thr 


Thr 


Thr 


Ser 




10 






Phe 


He 


Thr 


Asn 


25 








Pro 


Asn 


Val 


Thr 


Leu 


Thr 


Thr 


Ser 








60 


He 


He 


Ala 


Leu 






75 




He 


Gin 


He 


Tyr 




90 






Phe 


Cys 


Leu 


Pro 


105 








Thr 


Leu Gly Val 



Val Ser Ser Trp 
15 

His Ser Asp Gin 
30 

Thr Cys Pro Met 
45 

Tyr Ser Val He 

Tyr Val Phe Leu 
80 

Leu Leu Asn Val 

95 

Phe Arg He Met 
110 

He Leu Cys Lys 
125 



<210> 828 
<211> 67 
<212> PRT 

<213> Homo sapiens 



<400> 828 



wo 02/083898 



Met 


Asn 


Phe 


Leu 


Ser Ala Leu Gly Ala 


1 








5 


Thr 


Phe 


Gly 


Phe 


He Leu Asp Gin Asn 








20 


25 


Gin 


Asn 


Ser 


Gin 


Cys Lys Ala Val Thr 






35 




40 


He 


Thr 


Leu 


Met 


Thr Phe Ser He He 




50 






55 


Ser 


Gin 


Phe 






65 











PCT/IBOl/00914 

423 

He Ala Gly He He Leu Leu 
10 15 
Tyr He Cys Gly Tyr Ser His 
30 

Val Leu Phe Leu Gly He Leu 
45 

Glu Leu Phe He Ser Leu Leu 
60 



<210> 829 
<211> 55 
<212> PRT 

<213> Homo sapiens 

<220> 

<221> UNSURE 
<222> 26 

<223> Xaa = His, Gin 



<400> 829 

Met Pro Lys Gly Gly Arg Lys Gly Gly His Lys Gly Arg Ala Arg Gin 

15 10 15 

Tyr Thr Ser Pro Glu Glu He Asp Ala Xaa Cys Arg Leu Arg Ser Arg 

20 25 30 

Arg Pro Gly Lys Lys Arg Ser Lys Lys Gly Gly Asp Gly Ala Ala Gly 

35 40 45 

Asp Pro Lys Lys Lys Lys Lys 
50 55 



<210> 830 
<211> 55 
<212> PRT 

<213> Homo sapiens 

<220> 

<221> UNSURE 
<222> 26 

<223> Xaa = His, Gin 



<400> 830 

Met Pro Lys Gly Gly Arg Lys Gly Gly His Lys Gly Arg Ala Arg Gin 

15 10 15 

Tyr Thr Ser Pro Glu Glu He Asp Ala Xaa Cys Arg Leu Arg Ser Arg 

20 25 30 

Arg Pro Gly Lys Lys Arg Ser Lys Lys Gly Gly Asp Gly Ala Ala Gly 

35 40 45 

Asp Pro Lys Lys Lys Lys Lys 
50 55 



<210> 831 
<211> 86 
<212> PRT 

<213> Homo sapiens 

<220> 

<221> UNSURE 
<222> 9,72 

<223> Xaa = Gly,Arg,Trp 



<220> 

<221> UNSURE 
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<222> 67 




















<223> Xaa = 


Lys, 


Met, 


Arg, 


Thr 












<220> 




















<221> UNSURE 


















<222> 2 




















<223> Xaa « 


Phe, 


Leu 
















<220> 




















<221> UNSURE 


















<222> 85 




















<223> Xaa = 


Ser, Tyr 
















<400> 831 




















Met Xaa Leu 


Val 


Asp Trp 


Phe 


Tyr Xaa 


Val 


Leu Ser 


Ser Leu Gly Leu 


1 




5 








10 




15 




Trp Gin Lys 


Glu 


Ala 


Lvs 


He 


Leu Phe 


Leu Gly Leu 


Asp Asn Ala 


Gly 




20 








25 






30 




Lys Thr Thr 


Leu 


Leu 


His 


Met 


Leu Lys 


Asp Glu Arg 


Leu Val Gin 


His 


35 










40 






45 




Gin Pro Thr 


Gin 


Tyr 


Pro 


Thr 


Ser Glu 


Glu 


Leu Ser 


He Gly Arg 


He 


50 








55 






60 






Lys Phe Xaa 


Ala 


Phe 


Asp 


Leu 


Xaa Gly His Gin He 


Ala Arg Arg Val 


65 






70 








75 




80 


Trp Lys Asp 


Tyr 


Xaa 


Ala 


















85 
















<210> 832 




















<211> 131 




















<212> PRT 




















<213> Homo sapiens 
















<220> 




















<221> UNSURE 


















<222> 129 




















<223> Xaa = 


Leu, 


Val 
















<400> 832 




















Met Ser Pro 


Leu 


Glu 


Cys 


Ser 


Glu Cys 


Phe 


Gly Asp 


Gin Leu Leu 


His 


1 




5 








10 




15 




Arg Thr Tyr 


Thr 


Trp Gin 


Leu 


Thr Leu 


His 


Ser Arg 


Pro Asn Tyr 


Thr 




20 








25 






30 




Arg Lys Arg 


Asp 


Thr Arg 


Ser 


Glu Ser 


Leu 


Glu He 


Pro He Ser 


Val 


35 










40 






45 




Val Leu Pro 


Gin 


Arg Gly Thr 


Ala Glu 


Pro 


Phe Pro 


Arg Leu His 


Asn 


50 








55 






60 






Leu Tyr Ser 


Thr 


Pro Arg 


Cys 


Ala Gin 


Gin 


Ala Ala 


Leu Pro Arg 


Leu 


65 






70 








75 




80 


Ser Arg Arg 


Met 


Ala 


Ser 


Gin 


His Ser 


Tyr 


Pro Leu 


Asn Arg Phe 


Ser 






85 








90 




95 




Ser Val Pro 


Leu 


Asp 


Pro 


Met 


Glu Arg 


Pro 


Met Ser 


Gin Ala Asp 


Leu 




100 








105 






110 




Glu Leu Asp 


Tyr 


Asn 


Pro 


Pro 


Arg Val 


Gin 


Leu Ser 


Asp Glu Met 


Phe 


115 










120 






125 





Xaa Phe Gin 
130 

<210> 833 
<211> 99 
<212> PRT 

<213> Homo sapiens 



<220> 



wo 02/083898 

<221> UNSURE 

<222> 32 

<223> Xaa « * ,Lys,Leu,Met 
<220> 

<221> UNSURE 

<222> 26 

<223> Xaa = Asp,Glu 

<220> 

<221> UNSURE 

<222> 70,78 

<223> Xaa « His, Gin 

<220> 

<221> UNSURE 
<222> 27 

<223> Xaa = Leu,Val 



<400> 833 



Met Ala 


Thr 


Val 


Met Ala 


Ala 


Thr 


Ala Ala Glu Arg Ala 


Val 


Leu Glu 


1 






5 






10 




15 


Glu Glu 


Phe 


Arg 


Trp Leu 


Leu 


His 


Asp Xaa Xaa His Ala 


Val 


Leu Xaa 






20 








25 


30 




Gin Leu 


Gin 


Asp 


lie Leu 


Lys 


Glu 


Ala Ser Leu Arg Phe 


Thr 


Leu Pro 




35 








40 


45 






Gly Ser 


Gly 


Thr 


Glu Gly 


Pro 


Ala 


Lys Gin Glu Asn Phe 


He 


Leu Gly 


50 








55 




60 






Ser Cys 


Gly 


Thr 


Asp Xaa 


Val 


Lys 


Gly Val Leu Thr Leu 


Xaa 


Gly Asp 


65 






70 






75 




80 


Ala Leu 


Ser 


Gin 


Ala Asp Val 


Asn 


Leu Lys Met Pro Arg 


Asn 


Asn Gin 








85 






90 




95 


Leu Leu 


His 

















<210> 834 
<211> 96 
<212> PRT 

<213> Homo sapiens 



PCT/TBOl/00914 
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<400> 834 






















Met Val 


Ser His 


Thr Phe His 


Met 


Arg 


Thr 


Glu 


Glu 


Ser 


Asp Ala 


Ser 


1 




5 






10 










15 




Gin Glu 


Gly Asp 


Asp Leu Pro 


Lys 


Ser 


Ser 


Ala 


Asn 


Thr 


Ser 


His 


Pro 




20 






25 










30 






Lys Gin 


Asp Asp 


Ser Pro Lys 


Ser 


Ser 


Glu 


Glu 


Thr 


He 


Gin 


Pro Lys 




35 




40 










45 








Glu Gly 


Asp He 


Pro Lys Ala 


Pro 


Glu 


Glu 


Thr 


He 


Gin 


Ser 


Lys 


Lys 


50 




55 










60 










Glu Asp 


Leu Pro 


Lys Ser Ser 


Glu 


Lys 


Ala 


He 


Gin 


Pro 


Lys 


Glu 


Ser 


65 




70 








75 










80 


Asn He 


Pro Lys 


Ser Ser Ala 


Lys 


Pro 


He 


Gin 


Pro 


Lys 


Leu 


Ala 


He 






85 






90 










95 





<210> 835 
<211> 97 
<212> PRT 

<213> Homo sapiens 
<220> 

<221> UNSURE 
<222> 94 

<223> Xaa = Leu, Met 



wo 02/083898 



426 
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<400> 835 
















Met 


Ser 


Leu 


Leu Asp Gly 


Leu 


Ala 


Ser Ser Pro 


Arg 


Ala Pro 


Leu Gin 


1 






5 






10 






15 


Ser 


Ser 


Lys 


Ala Arg Met 


Lys 


Lys 


Leu Pro Lys 


Lys 


Ser Gin 


Asn Glu 








20 






25 




30 




Lys 


Tyr 


Arg 


Leu Lys Tyr 


Leu Arg 


Leu Arg Lys 


Ala 


Ala Lys 


Ala Thr 






35 






40 






45 




Val 


Phe 


Glu 


Asn Ala Ala 


He 


Cys 


Asp Glu He 


Ala 


Arg Leu 


Glu Glu 




50 






55 






60 






Lys 


Phe 


Leu 


Lys Ala Lys 


Glu 


Glu 


Arg Arg Tyr 


Leu 


Leu Lys 


Lys Leu 


65 






70 






75 






80 


Leu 


Gin 


Leu 


Gin Ala Leu 


Thr 


Glu 


Gly Val Ser Thr 


Gly Xaa Gin Leu 








85 






90 






95 


Leu 





















<210> 836 
<211> 107 
<212> PRT 

<213> Homo sapiens 



<400> 836 








Met 


Ala 


Gin 


Glu 


Val Ser Glu Tyr 


1 








5 




Ala 


Trp 


Val 


Glu 


Ala 


Leu Arg Cys 








20 






Arg 


His 


Arg 


Arg 


Asp 


Phe Leu Leu 






35 






40 


Ala 


Gly 


Gly 


Ala 


Ala 


Ser Ala Ser 




50 








55 


Ser 


Gly 


Thr 


Arg 


Asn 


Arg Gin Leu 


65 










70 


Ala 


Trp 


Ala 


Asn 


His 


Val Phe Leu 










85 




Met 


Asp 


Lys 


He 


Leu 


Ser Met Ala 








100 







Leu Ser Gin Asn Pro Arg Val Ala 

10 15 
Asp Gly Glu Thr Asp Lys His Trp 
25 30 
Arg Asn Ala Gly Asp Leu Ala Pro 
45 

Thr Asp Glu Ala Ala Asp Ala Glu 
60 

Gin Gin Leu He Ser Phe Ser Met 

75 80 
Gly Cys Arg Tyr Pro Gin Lys Val 

90 95 
Glu Gly He 
105 



<210> 837 
<211> 65 
<212> PRT 

<213> Homo sapiens 
<220> 

<221> UNSURE 
<222> 20 

<223> Xaa = Pro, Arg 
<400> 837 

Met Arg Leu Leu Gin Pro Arg Ser Arg Leu Gin Arg Thr Ser Gin Lys 

15 10 15 

Trp Leu Phe Xaa Leu His Gly Asp Glu Ser Leu Ala Pro Lys His Pro 

20 25 30 

Gly Ser Arg Lys Asn Met Asp Cys Tyr Trp Gin Asn Thr Ser Val His 

35 40 45 

Cys Arg Arg Thr Ser Leu Trp Asn Leu His Leu Pro Gly Lys Thr Leu 
50 55 60 

Gly 
65 



<210> 838 

<211> 81 

<212> PRT 

<213> Homo sapiens 



wo 02/083898 



427 



PCT/IBOl/00914 



<220> 

<221> UNSURE 
<222> 10 

<223> Xaa = * ,Trp 
<220> 

<221> UNSURE 
<222> 9 

<223> Xaa » Ala, Pro 
<220> 

<221> UNSURE 
<222> 16 

<223> Xaa = Arg,Ser 
<220> 

<221> UNSURE 
<222> 12 

<223> Xaa = Gly, Trp 
<220> 

<221> UNSURE 
<222> 8 

<223> Xaa = Lys,Arg 



<400> 838 


















Met 


Leu 


Ala Arg Gly Leu Val Xaa 


Xaa Xaa Asn Xaa Gly 


Asn 


Arg 


Xaa 


1 






5 






10 






15 




Pro 


Val 


Phe Thr 


He 


Ser 


Phe His 


His Leu Pro Thr 


Trp 


Asn 


Leu 


Tyr 






20 








25 




30 






He 


Ser 


Pro Ser 


Leu 


Ser 


Phe Leu 


He Cys Glu Thr 


Ala 


Ala 


Asn 


Thr 






35 






40 




45 








Gly 


Phe 


Gin Arg 


Val 


Val 


Lys Asn 


Pro Cys Glu His 


Leu 


Leu 


Gin 


Leu 


50 








55 


60 










Pro 


Ser 


Thr Gin 


Asn 


Arg 


Leu Phe 


His Trp Arg Arg 


Arg 


Arg 


Asp 


Gly 


65 








70 




75 








80 


Arg 























<210> 839 
<211> 50 
<212> PRT 

<213> Homo sapiens 
<220> 

<221> UNSURE 
<222> 47 

<223> Xaa - Phe, Ser 

<400> 839 

Met Val Arg Pro Arg Arg Ala Pro 
1 5 
Leu Gly Gly Arg Gly Arg Pro Pro 
20 

Arg Ser Arg Ser Trp Pro Ala Ser 
35 40 

Asp Pro 
50 



Tyr Arg Ser Gly Ala Gly Gly Pro 

10 15 
Arg Pro Leu Val Val Arg Ala Val 
25 30 
Pro Pro Arg Pro Ala Ala Xaa Ala 
45 



<210> 840 
<211> 85 
<212> PRT 

<213> Homo sapiens 
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<400> 840 



Met Pro Ser Phe Val 


Asp Arg 


He 


Phe Gly Gly Glu 


Leu 


Thr Ser Met 


1 5 






10 




15 


lie Met Tyr Asp Gin 


Cys Arg 


Thr 


Val Ser Leu Val 


His 


Glu Ser Phe 


20 






25 




30 


Leu Asp Leu Ser Leu 


Pro Val 


Leu Asp Asp Gin Ser 


Gly Lys Lys Ser 


35 




40 




45 




Val Asn Asp Lys Asn 


Leu Lys 


Lys 


Thr Val Glu Asp 


Glu Asp Gin Asp 


50 


55 




60 






Ser Glu Glu Glu Lys 


Asp Asn 


Asp 


Ser Tyr He Lys 


Glu Arg Ser Asp 




f\J 




75 




80 


lie Pro Ser Gly Thr 












85 












<210> 841 












<211> 66 












<212> PRT 












<213> Homo sapiens 












<220> 












<221> UNSURE 












<222> 59 












<223> Xaa = Lys, Asn 












<400> 841 












Met Ala Ser He Phe 


Ser Lys 


Leu 


Leu Thr Gly Arg 


Asn 


Ala Ser Leu 


1 5 






10 




15 


Leu Phe Ala Thr Met 


Gly Thr 


Ser 


Val Leu Thr Thr 


Gly Tyr Leu Leu 


20 






25 




30 


Asn Arg Gin Lys Val 


Cys Ala 


Glu 


Val Arg Glu Gin 


Pro 


Arg Leu Phe 


35 




40 




45 




Pro Pro Ser Ala Asp 


Tyr Pro 


Asp 


Leu Arg Xaa Thr 


Thr 


Thr Ala Trp 


50 


55 




60 






Pro Ser 












65 












<210> 842 












<211> 74 












<212> PRT 












<213> Homo sapiens 












<400> 842 












Met Ala Ala Ala Ala 


Gly Leu 


Gly Leu Gin Arg Glu 


Ala 


Ala Glu Gly 


1 5 






10 




15 


Lys Trp Ser Ala Trp 


Arg Met 


Thr Glu Glu Thr Arg 


He 


Val Tyr Trp 


20 






25 




30 


He Lys Asp Arg Gin 


Leu Thr 


Asn 


Arg Asp Ser Thr 


He 


Leu Glu Leu 


35 




40 




45 




Gin Lys Val Leu Lys 


Thr Cys 


Trp Phe Val Cys Trp 


Ser 


He Phe Gin 








60 






Leu Lys Phe Leu Phe 


Leu Phe 


Phe 


Phe Phe 






65 


70 










<210> 843 












<211> 68 












<212> PRT 












<213> Homo sapiens 












<400> 843 












Met Lys Lys Asn He 


Trp Leu 


Leu 


Ser Gly Ser Arg 


Gin 


Gin Tyr Leu 


1 5 






10 




15 


He Ala Leu Trp Arg 


Leu Leu 


Leu 


Leu Glu He Tyr 


He 


Ser Glu Gly 


20 






25 




30 
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Thr His Ser Thr Glu Glu Asp lie Asn Lys Gin lie Asn Asp Lys Glu 

35 40 45 

Arg Val Ala Ala Ala Met Glu Asn Pro Asn Leu Arg Glu lie Val Asp 

50 55 60 

Ser Val Ser Leu 
65 



<210> 844 
<211> 53 
<212> PRT 

<213> Homo sapiens 



<400> 844 

Met Phe Pro Pro Pro Ala Ala Ser 

1 5 
Arg Asp His Pro Pro Val Gly Pro 
20 

lie Thr Leu Thr Thr Ala Glu Leu 

35 40 
Leu Ala Trp Thr Pro 
50 



Leu Cys Ser His Val Gly Cys Ser 

10 15 
Gly Gin Thr Asn Arg Lys Gly Pro 
25 30 
Thr Ser Thr Thr Val Ser Val Ala 
45 



<210> 845 
<211> 73 
<212> PRT 

<213> Homo sapiens 



<400> 845 

Met Cys Pro Gly Asn Trp Leu Trp Ala Ser Met Thr Phe Met Ala Arg 

15 10 15 

Phe Ser Arg Ser Ser Ser Arg Ser Pro Val Arg Thr Arg Gly Thr Leu 

20 25 30 

Glu Glu Met Pro Thr Val Gin His Pro Phe Leu Asn Val Phe Glu Leu 

35 40 45 

Glu Arg Leu Leu Tyr Thr Gly Lys Thr Ala Cys Asn His Ala Asp Glu 

50 55 60 

Val Trp Pro Gly Phe Tyr Leu Gly Asp 
65 70 



<210> 846 
<211> 143 
<212> PRT 

<213> Homo sapiens 

<220> 

<221> UNSURE 
<222> 109 

<223> Xaa = Lys, Arg 



<400> 846 



Met Thr Ser 


Gly 


Ser 


Lys 


Cys 


Pro 


1 




5 








Tyr lie Ala 


Thr 
20 


Phe 


Lys 


Gly 


Ser 


Gin Asn Pro 


He 


Gin 


Ser 


Ser 


Ser 


35 










40 


Thr Leu Gin 


Arg 


Asn 


Gly 


Leu 


Met 


50 








55 




Tyr Val Asn 


Leu 


Ala 


Leu 


Lys 


Asn 


65 






70 






Leu Gly Ser 


Gly 


Ala 
85 


Phe 


Glu 


Ala 


Phe Asn Asp 


Asn 


Ala 


Trp 


His 


Asp 



Ser Thr Asp Ser Gly Lys Glu Glu 

10 15 

Glu Tyr Phe Cys Tyr Asp Leu Ser 

25 30 

Asp Glu He Thr Leu Ser Phe Lys 
45 

Leu His Thr Gly Lys Ser Ala Asp 
60 

Gly Ala Val Ser Leu Val He Asn 

75 80 

Leu Val Glu Pro Val Asn Gly Lys 

90 95 

Val Lys Val Thr Xaa Asn Leu Arg 
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100 

Gin Val Thr lie Ser Val Asp Gly 
115 120 
Gin Glu Asp Tyr Thr Met Leu Gly 
130 135 

<210> 847 
<211> 112 
<212> PRT 

<213> Homo sapiens 



<400> 847 



Met 


Gin Leu 


Tyr 


Leu Tyr 


Ser 


Met 


1 






5 






Glu 


Gin Ala 


Arg 


Thr Leu 


Glu Arg 






20 








Ala 


lie Arg 


Ser 


Asp Arg 


Cys 


Val 




35 








40 


Ala 


Asn 'Asn 


He 


Ser Phe 


Tyr 


Leu 




50 






55 




Met 


Leu Arg 


Phe 


Ser Ser 


Ala 


Ser 


65 






70 






Pro 


Leu Leu 


Leu 


Leu Phe 


Leu 


Leu 








85 






Met Arg Met 


Lys 


Thr Phe 


Met 


Met 






100 









<210> 848 
<211> 54 
<212> PRT 

<213> Homo sapiens 
<400> 848 

Met He Gin Arg Asn Leu Ser Asn 
1 5 
Met Asp Ala Asp Met Asp Ala Val 
20 

Arg Asn Ala Arg Cys Asp Pro Ala 

35 40 
He Phe Ser Thr His Ser 
50 

<210> 849 
<211> 138 
<212> PRT 

<213> Homo sapiens 



<400> 849 



Met 

1 


Lys 


Lys 


Leu 


Lys 
5 


Ala 


Arg 


Met 


Asp 


Lys 


Met 


Gin 


Asn 


Ala 


Glu 


Ser 








20 










Leu 


Ala 


Glu 


Leu 


Glu Asp 


Asp 


Phe 






35 










40 


Ala 


Ala 


Tyr 


Tyr 


Glu 


Glu 


Gin 


His 




50 










55 




Lys 


Glu 


Arg 


Asp 


Gly Leu 


Arg 


Cys 


65 










70 






Asp 


Val 


Glu 


Asp 


Pro 


Ala 


Thr 


Glu 










85 








Lys 


Val 


Met 


Arg 


Trp 


Phe 


Gin 


Ala 








100 










Trp 


His 


Gly 


Val 


Leu 


Ala 


Trp Val 



105 






110 




He 


Leu 


Thr Thr Thr 


Gly 


Tyr Thr 






125 






Ser 


Asp Asp Phe Phe 


Tyr 


Val 






140 






Ser 


Thr 


Ala He Met 


Gly 


Leu Asp 




10 






15 


Glu 


Glu 


Lys Lys Glu 


Asn 


Ser Gly 


25 






30 




Lvs 


Ala 


Ser Phe Gin 


Gin 


Gly Leu 






45 






Cys 


Pro 


Pro Gly Tyr 


Ser 


Pro Arg 






60 






Ala 


Ala 


Leu Arg Gin 


Gin 


Asp Gin 






75 




80 


Leu 


Ser 


Leu Leu Asn 


Val 


Lys Arg 




90 






95 


Thr 


Arg 


He His Leu 


Met 


Asn Ser 


105 






110 




Asp 


Asp Asn Ser Ser 


Pro 


Ala Cys 




10 






15 


Ser 


His 


Ser Tyr His 


Trp 


Met Ser 


25 






30 




Ser 


Leu 


Tyr Ser Thr 


Leu 


Ser Ser 






45 






His 


Gin 


Ala He Glu 


Arg 


Phe Tyr 




10 






15 


Gly 


Arg 


Gly Gin Val 


Met 


Ser Ser 


25 






30 




Lvs 


Glu Gly Tyr Leu 


Glu 


Thr Val 






45 






Pro 


Glu 


Leu Thr Pro 


Leu 


Leu Glu 






60 






Arg 


Gly Asn Arg Ser 


Pro 


Val Pro 






75 




80 


Glu 


Pro Gly Glu Ser 


Phe 


Cys Asp 




90 






95 


Met 


Leu Gin Arg Leu 


Gin 


Thr Trp 


105 






110 




Lys 


Glu 


Lys Val Val 


Ala 


Leu Val 
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115 120 125 

His Ala Val Gin Ala Leu Trp Lys Gin Phe 
130 135 

<210> 850 

<211> 70 

<212> PRT 

<213> Homo sapiens 

<400> 850 



Met lie Cys Thr Phe 


Leu Arg 


Ala 


Val 


Gin Tyr 


Thr 


Glu Lys 


Leu 


His 


1 5 










10 










15 




Arg Ser Ser Ala Lys 


Aro 


Leu 


Leu 


Leu 


Pro 


Tyr 


He 


Val 


Leu 


Asn 


Lys 


20 








25 










30 






Ala Cys Leu Lys Thr 


Glu 


Pro 


Ser 


Leu Arg 


Cys 


Giy Leu Gin 


Tyr 


Gin 


35 






40 










45 








Lys Lys Thr Leu Arg 


Pro 


Arg 


Cvs 


He Leu Gly 


Val 


Thr 


Gin 


Lys 


Thr 


50 




55 










60 










lie Trp Thr Gin Gly 


Pro 






















65 


70 






















<210> 851 
























<211> 152 
























<212> PRT 
























<213> Homo sapiens 
























<400> 851 
























Met Asn Val Thr E^ro 


Glu 


Val 


Lys 


Ser Arg Gly 


Met 


Lys 


Phe 


Ala 


Glu 


1 5 










10 










15 




Glu Gin Leu Leu Lys 


His 


Gly 


Trp 


Thr 


Gin 


Gly 


Lys 


Gly 


Leu 


Gly Arg 


20 








25 










30 






Lys Glu Asn Giy lie 


Thr 


Gin 


Ala 


Leu 


Arg 


Val 


Thr 


Leu 


Lys 


Gin Asp 


35 






40 










45 








Thr His Gly Val Gly 


His 


Asp 


Pro 


Ala 


Lys 


Glu 


Phe 


Thr 


Asn 


His 


Trp 


50 




55 










60 










Trp Asn Glu Leu Phe 


Asn 


Lys 


Thr 


Ala 


Ala 


Asn 


Leu 


Val 


Val 


Glu 


Thr 


65 


70 










75 










80 


Gly Gin Asp Gly Val 


Gin 


lie Arg 


Ser 


Leu 


Ser 


Lys 


Glu 


Thr 


Thr Arg 


85 










90 










95 




Tyr Asn His Pro Lys 


Pro 


Asn 


Leu 


Leu 


Tyr 


Gin 


Lvs 


Phe 


Val 


Lys 


Met 


100 








105 










110 






Ala Thr Leu Thr Ser 


Gly Gly Glu 


Lys 


Pro 


Asn 


Lys 


Asp Leu 


Glu 


Ser 


115 






120 










125 








Cys Ser Asp Asp Asp 


Asn 


Gin 


Gly 


Ser 


Ser 


Pro 


Gin 


Arg 


Phe 


Leu 


Leu 


130 




135 










140 










Met Arg Cys Cys Ser 


Lys 


Pro 


Val 


















145 


150 






















<210> 852 
























<211> 120 
























<212> PRT 
























<213> Homo sapiens 
























<400> 852 
























Met Ala Lys Glu Glu 


Pro 


Gin 


Ser 


He 


Ser Arg 


Asp 


Leu 


Gin 


Glu 


Leu 


1 5 










10 










15 




Gin Lys Lys Leu Ser 


Leu 


Leu 


He 


Asp 


Ser 


Phe 


Gin 


Asn 


Asn 


Ser 


Lys 


20 








25 










30 






Val Val Ala Phe Met 


Lys 


Ser 


Pro 


Val 


Gly Gin 


Tyr 


Leu Asp 


Ser 


His 


35 






40 










45 








Pro Phe Leu Ala Phe 


Thr 


Leu 


Leu 


Val 


Phe 


He 


Val 


Met 


Ser 


Ala 


Val 


50 




55 










60 










Pro Val Gly Phe Phe 


Leu 


Leu 


He 


Val 


Val 


Leu 


Thr 


Thr 


Leu 


Ala 


Ala 
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65 70 
Leu Leu Gly Val lie lie Leu Glu Gly 
85 

Phe Ser Leu Leu Cys lie Leu Cys Gly 
100 105 
Met Ser Gly Met Met lie Ala Ser 
115 120 

<210> 853 
<211> 103 
<212> PRT 

<213> Homo sapiens 



PCT/IBOl/00914 

432 

75 80 
Leu Val He Ser Val Gly Gly 
90 95 
Leu Ser Phe Val Ser Leu Ala 
110 



<400> 853 








Met Asp 


Gly 


Thr 


Val Leu Cys 


Lys 


1 






5 




Gly Gin 


Glu 


Pro 


He Pro Lys 


He 






20 






Gin Met 


Glu 


Gin 


He Ser Gin 


Phe 




35 






40 


Val Arg 


Thr 


Thr 


Asp He Phe 


Gin 


50 






55 




Asp Met 


Ala 


Ala 


Val Gin Arg 


Thr 


65 






70 




Val Thr 


Lys 


Asp 


Asp Gly Cys 


Tyr 








85 




Arg Lys 


Ala 


Gin 


Gin Asn Arg 








100 







Leu He Asn Ser Leu Tyr Pro Pro 

10 15 
Ser Glu Ser Lys Met Ala Phe Lys 
25 30 
Leu Lys Ala Ala Glu Thr Tyr Gly 
45 

Thr Val Asp Leu Trp Glu Gly Lys 
60 

Leu Met Ala Leu Gly Ser Val Ala 

75 80 
Arg Gly Glu Pro Ser Trp Phe His 
90 95 



<210> 854 
<211> 78 
<212> PRT 

<213> Homo sapiens 
<220> 

<221> UNSURE 
<222> 68 

<223> Xaa » Gly, Arg 
<220> 

<221> UNSURE 
<222> 76 

<223> Xaa = Ly3,Asn 
<400> 854 

Met Pro Leu Pro Arg Lys Asp Asn Pro Gin Cys Ser Pro Leu Leu Pro 

15 10 15 

Pro Leu Ala Ser Val Gly Lys Lys Met Ala Lys Lys He Ala Ala Ala 

20 25 30 

Pro Pro Pro Pro Pro Pro Pro Pro Ser Glu Arg Arg Gly Lys Arg Ala 

35 40 45 

Glu Gin Lys Gly Ala Val Gly Gin Arg Pro Asp Trp He Phe Ser Pro 

50 55 60 

Gin Pro Gin Xaa Met Phe Leu Met Thr Gin Gly Xaa Leu Val 
65 70 75 



<210> 855 
<211> 136 
<212> PRT 

<213> Homo sapiens 



<400> 855 

Met Ser Glu Pro Gly Lys Leu Ser Gin Lys He Lys Val Trp Leu Gin 
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1 




5 






Glu 


Tyr 


Trp Asn He 


Thr 


Asp Leu 






20 






Gly Ala 


He Leu Arg 


Leu 


Gin Asn 






35 




40 


Val 


He 


Tyr Cys Val 


Asp 


He He 




50 






55 


He 


Phe 


Gly Val Asn 


Lys 


Tyr Leu 


65 






70 




Lys 


Met 


Met He Asp 


Met 


Leu Tyr 






85 






Leu 


Met 


Ser Phe Gly 


Val Ala Arg 






100 






Lys 


Pro 


Ser Trp Lys 


Leu 


Ala Arg 






115 




120 


Met 


He 


Tyr Gly Glu 


Val 


Leu Gin 




130 






135 



<210> 856 
<211> 118 
<212> PRT 

<213> Homo sapiens 



<400> 856 



Met Val 


Arg 


Gly 


Trp 


Ala Pro Thr 


1 






5 




Val Trp 


Thr 


Leu 


Thr 


Gly Thr Pro 






20 






Thr Thr 


Ala 


Val 


Val 


Ser Gly Thr 




35 






40 


Val Asn 


Val 


Lys 


Glu 


His Ser Arg 


50 








55 


Arg Asp 


Met 


Thr 


Met 


Phe Val Thr 


65 








70 


Leu Phe 


Asp 


Ser 


Thr 


Thr Leu Glu 








85 




Arg Pro 


Val 


Asn 


Ser 


Ala Ala Leu 






100 






Leu Gly 


Gly 


Gly 


Gin 


Glu 




115 









<210> 857 
<211> 122 
<212> PRT 

<213> Homo sapiens 
<400> 857 



Met 


Arg 


Pro 


Ser 


Cys 


Ala Pro Gly 


1 








5 




Tyr 


Gly 


Tyr 


Phe 


Arg 


Lys Leu Arg 








20 






Cys 


Gin 


Ser 


Tyr 


Gly 


Asn Gly Ala 






35 






AO 


Lys 


Glu 


Ala 


Ser 


Thr 


He Ala Glu 




50 








55 


Gin 


Pro 


He 


Trp 


He 


Gly Leu His 


65 










70 


Gin 


Trp 


He 


Asp 


Gly 


Ala Met Tyr 










85 




Ser 


Met 


Gly 


Gly 


Asn 


Lys His Cys 








100 






Phe 


Leu 


Thr 


Trp 


Ser 


Ser Asn Glu 






115 






120 



433 





10 






15 




Val 


Ala 


He Ser Thr 


Phe 


Met. 


He 


25 






30 






Gin 


Pro 


Tyr Met Gly 


Tyr 


Gly 


Arg 






45 








Phe 


Trp Tyr He Arg 


Val 


Leu 


Asp 






60 








Gly 


Pro Tyr Val Met 


Met 


He 


Gly 






75 






80 


Phe 


Val 


Val He Met 


Leu 


Val 


Val 




90 






95 




Gin 


Ala 


He Leu His 


Pro 


Glu 


Glu 


105 






HO 






Asn 


He 


Phe Tyr Met 


Pro 


Tyr 


Trp 






125 








Trp 


Ala 


He Pro Glu 


Leu 


Cys 


Gly 




10 






15 




Ser 


Met 


Ser Ser Leu 


Ala 


Gin 


Leu 


25 






30 






Val 


Lys 


Gin Ser Gly 

A C 


Glu 


Val 


Leu 


Gin 


He Asn Asp He 


Gin 


Leu 


Ser 






60 








Ala 


Ser 


Lys Asp Asn 


Thr 


Ala 


Lys 






75 






80 


His 


Gin 


Lys Thr Phe 


Arg 


Thr 


Glu 




90 






95 




Ser 


Pro 


Asn Tyr Asp 


His 


Val 


Val 


105 






110 






Trp 


Phe 


Tyr His Lys 


Ser 


Asn 


Cys 




10 






15 




Asn 


Trp Ser Asp Ala 


Glu 


Leu 


Glu 


25 






30 






His 


Leu 


Ala Ser He 


Leu 


Ser 


Leu 






45 








Tyr 


He Ser Gly Tyr 


Gin 


Arg 


Ser 






60 








Asp 


Pro 


Gin Lys Arg 


Gin 


Gin 


Trp 






75 






80 


Leu 


Tyr Arg Ser Trp 


Ser 


Gly 


Lys 




90 






95 




Ala 


Glu 


Met Ser Ser 


Asn 


Asn 


Asn 


105 






110 






Cys 


Asn 
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<210> 858 
<211> 107 
<212> PRT 
<213> Homo sapiens 

<220> 

<221> SIGNAL 
<222> -23. .-1 

<400> 858 



Met Leu 


Leu Gin Ser 


Gly Leu Ser 


Phe Arg 


Arg 


Leu 


Leu Leu Leu 


Ser 




-20 




-15 






-10 




Leu Val 


Ser Gly Ala 
-5 


Leu Gly Leu 

1. 


Gly Gly 


Ala 


Val 
5 


Leu Gly Val 


Gly 


Leu Ser 


Leu Gly Pro 


Val Pro Leu 


Thr Pro 


Trp 


Val 


Phe Gly Val 


Thr 


10 




15 




20 






25 


Ala Gly 


Val Phe Leu 


Tyr Val Ala 


Leu Val 


Asp Met 


Leu Pro Ala 


Leu 




30 




35 






40 




Leu Arg 


Pro Pro Glu 


Pro Leu Pro 


Thr Pro 


His 


Val 


Leu Leu Gin Gly 


45 




50 






55 




Leu Gly 


Leu Leu Leu 


Gly Gly Gly Leu Met 


Leu 


Ala 


He Thr Leu 


Leu 




60 


65 








70 




Glu Glu 


Arg Leu Leu 


Pro Val Thr 


Thr Glu 


Gly 









75 80 



<210> 859 

<211> 92 

<212> PRT 

<213> Homo sapiens 

<220> 

<221> SIGNAL 
<222> -24. .-1 

<220> 

<221> UNSURE 
<222> 52 

<223> Xaa = Arg, Thr 
<400> 859 



Met 


Val 


Met 


Gly Leu 


Gly Val Leu Leu Leu 


Val Phe Val Leu Gly 


Leu 








-20 


-15 


-10 




Gly 


Leu 


Thr 


Pro Pro 
-5 


Thr Leu Ala Gin Asp 
1 


Asn Ser Arg Tyr Thr 
e; 


His 


Phe 


Leu 


Thr 


Gin His 


Tyr Asp Ala Lys Pro 


Gin Gly Arg Asp Asp Arg 




10 






15 


20 




Tyr Cys 


Glu 


Ser He 


Met Arg Arg Arg Gly 


Leu Thr Ser Pro Cys 


Lys 


25 








30 


35 


40 


Asp 


He 


Asn 


Thr Phe 


lie His Gly Asn Lys 


Arg Xaa Ser Arg Pro 


Ser 






45 


50 


55 




Val 


Lys 


Thr 


Arg Met 


Glu Thr Leu Thr Glu 


Lys Thr 










60 


65 







<210> 860 
<211> 80 
<212> PRT 

<213> Homo sapiens 
<220> 

<221> SIGNAL 
<222> -32. .-1 
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<220> 

<221> UNSURE 
<222> 44 

<223> Xaa = Ala, Pro 
<400> 860 



Met Met 


Leu 


Val 


Gly 


Phe Leu 


Gly Cys Cys Gly Ala 


Val Gin 


Glu Ser 




-30 








-25 




-20 




Gin Cys 


Met 


Leu 


Gly Leu Phe 


Phe Gly Phe Leu 


Leu 


Val He 


Phe Ala 


-15 








-10 




-5 






He Glu 


He 


Ala 


Ala 


Ala He 


Trp Gly Tyr Ser 


His 


Lys Asp Glu Val 


1 






5 




10 






15 


He Lys 


Glu 


Val 


Gin 


Glu Phe 


Tyr Lys Asp Thr 


Tyr 


Asn Lys 


Leu Lys 






20 






25 




30 




Thr Lys Asp Glu 


Pro Gin Arg 


Glu Thr Leu Lys 


Xaa 


Ser Thr Met Arg 




35 








40 




45 




<210> 861 
















<211> 166 
















<212> PRT 
















<213> Homo sapiens 












<220> 


















<221> SIGNAL 














<222> -15..- 


-1 














<220> 


















<221> UNSURE 














<222> 142 
















<223> Xaa = 


Ser, 


Thr 












<400> 861 
















Met Leu 


Pro 


Leu 


Leu 


He He 


Cys Leu Leu Pro 


Ala 


He Glu Gly Lys 


-15 








-10 


-5 






1 


Asn Cys 


Leu 


Arg 


Cys 


Trp Pro 


Glu Leu Ser Ala 


Leu 


He Asp Tyr Asp 






5 






10 




15 




Leu Gin 


He 


Leu 


Trp 


Val Thr 


Pro Gly Pro Pro 


Thr 


Glu Leu 


Ser Gin 




20 








25 




30 




Asn Arg 


Asp 


His 


Leu 


Glu Glu 


Glu Thr Ala Lys 


Phe 


Phe Thr 


Gin Val 


35 








40 




45 






His Gin 


Ala 


He 


Lys 


Thr Leu 


Arg Asp Asp Lys 


Thr 


Val Leu 


Leu Glu 


50 








55 


60 






65 


Glu He 


Tyr 


Thr 


His 


Lys Asn 


Leu Phe Thr Glu Arg 


Leu Asn 


Lys He 








70 




75 






80 


Ser Asp 


Gly 


Leu 


Lys 


Glu Lys 


Asp He Gin Ser 


Thr 


Leu Lys 


Val Thr 






85 






90 




95 




Ser Cys 


Ala 


Asp Cys 


Arg Thr 


His Phe Leu Ser "Cys 


Asn Asp 


Pro Thr 




100 








105 




110 




Phe Cys 


Pro 


Ala 


Arg Asn Arg 


Arg Thr Ser Leu 


Trp 


Ala Val 


Ser Leu 


115 








120 




125 






Ser Ser 


Ala 


Leu 


Leu 


Leu Ala 


He Ala Gly Asp Val 


Xaa Phe 


Thr Gly 


130 








135 


140 






145 


Lys Gly 


Arg 


Arg 


Arg 

1 crt 


Gin 











150 



<210> 862 
<211> 54 
<212> PRT 

<213> Homo sapiens 
<220> 

<221> SIGNAL 
<222> -26. .-1 
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<400> 862 




















Met Val Lys 


Val 


Met Trp 


Val 


Ser 


Val 


Leu Ala Leu Ala 


Ala 


Ala 


lie 


-25 






-20 






-15 








Leu Leu Leu 


Thr 


Val Pro 


Val 


Ala 


Glu 


Gly Val Thr Cys 


Ser 


Pro 


Met 


-10 




-5 








1 




5 




Gin Leu Ala 


Ser 


Cys Ala 


Ala 


Ala 


Met 


Thr Ser Ser Ser 


Pro 


Pro 


Ser 




10 






15 




20 






Glu Ala Val 


Ala 


Gin Ser 

















25 



<210> 863 
<211> 55 
<212> PRT 

<213> Homo sapiens 
<220> 

<221> SIGNAL 
<222> -26.. -1 

<400> 863 



Met Lys Pro 


Val 


Ser Arg 


Arg 


Thr 


Leu 


Asp Trp 


He 


Tyr Ser Val 


Leu 


-25 






-20 








-15 






Leu Leu Ala 


He 


Val Leu 


He 


Ser 


Trp 


Gly Cys 


He 


He Tyr Ala 


Ser 


-10 




-5 








1 




5 




Met Val Ser 


Ala 


Arg Arg 


Gin 


Leu 


Arg 


Lys Lys 


Tyr 


Pro Asp Lys 


He 




10 








15 






20 




Phe Gly Thr 


Asn 


Glu Asn 


Leu 















25 



<210> 864 
<211> 101 
<212> PRT 
<213> Homo sapiens 

<220> 

<221> SIGNAL 
<222> -22. .-1 

<400> 864 



Met 


Arg 


Thr 


Leu 


Ala He 


Leu Ala Ala 


He 


Leu 


Leu 


Val 


Ala 


Leu 


Gin 






-20 






-15 








-10 








Ala 


Arg 


Leu 


Ser 


His Ser 


Arg Gin Glu 


Leu 


Met 


Arg 


Leu 


Leu 


Gin 


Pro 




-5 








1 




5 










10 


Arg 


Ser 


Arg 


Leu 


Gin Arg 


Thr Ser Gin 


Lys 


Trp 


Leu 


Phe 


Pro 


Leu 


His 










15 




20 










25 




Gly Thr 


Lys 


Ala 


Trp Leu 


Gin Ser He 


Gin Ala Gin Gly Lys 


Thr 


Trp 








30 




35 










40 






Pro 


Ala 


He 


Ala 


Glu Tyr 


Gin Arg Ala 


Leu 


Gin 


Glu 


Asn 


Val 


Ala 


Met 






45 






50 








55 








Glu 


Pro 


Ala 


Ser 


Thr Arg 


Glu Asp Ser 


Gly 


His 


Ser 


Ala 


Ala 


Glu 


Leu 




60 








65 






70 










Ala 


Glu 


Lys 


Glu 


Lys 



















75 

<210> 865 

<211> 57 
<212> PRT 

<213> Homo sapiens 
<220> 

<221> SIGNAL 
<222> -21.. -1 
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<400> 865 

Met Val Leu Leu Ser Ser Thr Met Ser Val Leu lie Phe Cys 

-20 -15 -10 

Asn Leu Ala lie Ser Asp Arg Arg Val Met Lys Ser Pro Thr 
-5 1 5 

Val lie His Leu Phe Phe Pro His Ser Ser Thr Ser lie Cys 

15 20 25 

Asn Leu Val Thr Leu Leu Leu Gly Thr 
30 35 

<210> 866 
<211> 129 
<212> PRT 

<213> Homo sapiens 
<220> 

<221> SIGNAL 
<222> -26. .-1 



<400> 866 



Met Val 


Ser 


Arg Met 


Val 


Ser 


Thr Met Leu 


Ser Gly Leu Leu 


Phe 


Trp 


-25 








-20 




-15 






Leu Ala 


Ser 


Gly Trp 


Thr 


Pro 


Ala Phe Ala 


Tyr Ser Pro Arg 


Thr 


Pro 


-10 






-5 






1 


5 




Asp Arg 


Val 


Ser Glu 


Ala 


Asp He Gin Arg Leu Leu His Gly 


Val 


Met 






10 






15 


20 






Glu Gin 


Leu 


Gly He 


Ala 


Arg 


Pro Arg Val 


Glu Tyr Pro Ala 


His 


Gin 




25 








30 


35 






Ala Met 


Asn 


Leu Val 


Gly 


Pro 


Gin Ser He 


Glu Gly Gly Ala 


His 


Glu 


40 








45 




50 






Gly Leu 


Gin 


His Leu 


Gly 


Pro 


Phe Gly Asn 


He Pro Asn He 


Val 


Ala 


55 






60 






65 




70 


Glu Leu 


Thr 


Gly Asp Asn 


He 


Pro Lys Asp 


Phe Ser Glu Asp 


Gin 


Gly 






75 






80 




85 




Tyr Gin 


Thr 


Leu Gin 


He 


Pro 


Val Leu Leu 


Glu Lys Gin Met 


Met 


Asp 






90 






95 


100 







Val 



<210> 867 
<211> 99 
<212> PRT 

<213> Homo sapiens 

<220> 

<221> SIGNAL 
<222> -41. .-1 

<220> 

<221> UNSURE 
<222> -36 

<223> Xaa - Ala, CyS/ Gly, Ser, Trp 
<400> 867 

Met Ala Ala Ala Val Xaa Ala Ala Pro Gly Ala Leu Gly Ser 

-40 -35 -30 

Ala Gly Gly Ala Arg Leu Val Ala Ala Cys Ser Ala Trp Leu 
-25 -20 -15 

Gly Leu Arg Leu Pro Gly Ser Leu Ala Gly Arg Arg Ala Gly 

-5 15 
He Trp Ala Gin Gly Trp Val Pro Ala Ala Gly Gly Pro Ala 

10 15 20 

Arg Gly Tyr Ser Ser Glu Met Lys Thr Glu Asp Glu Leu Arg 



Leu Leu 

He He 

10 

He Glu 



Leu His 

Cys Pro 
-10 
Pro Ala 

Pro Lys 

Val Arg 
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25 30 35 

His Leu Glu Glu Glu Asn Arg Asp Arg Asp Lys Asn Met Val Lys Gly 
40 45 50 55 

Met Glu Met 



<210> 868 
<211> 108 
<212> PRT 

<213> Homo sapiens 
<220> 

<221> SIGNAL 
<222> -19. .-1 



<400> 868 

Met Ala Ala Ala Thr Gly Ala Val Ala Ala Ser Ala Ala Ser Gly Gin 

-15 -10 -5 

Ala Glu Gly Lys Lys He Thr Asp Leu Arg Val lie Asp Leu Lys Ser 

15 10 
Glu Leu Lys Arg Arg Asn Leu Asp He Thr Gly Val Lys Thr Val Leu 

15 20 25 

He Ser Arg Leu Lys Gin Ala He Glu Glu Glu Gly Gly Asp Pro Asp 
30 35 40 45 

Asn He Glu Leu Thr Val Ser Thr Asp Thr Pro Asn Lys Lys Pro Thr 

50 55 60 

Lys Gly Lys Gly Lys Lys His Glu Ala Asp Glu Leu Ser Gly Asp Ala 

65 70 75 

Ser Val Glu Asp Asp Ala Phe He Lys Val Lys Cys 
80 85 

<210> 869 
<2il> 60 
<212> PRT 

<213> Homo sapiens 



<220> 

<221> SIGNAL 
<222> -41. .-1 



<400> 869 

Met Pro Glu Ser Leu Trp Gin His 

-40 -35 
Gin Leu Lys Glu Ala Gly Phe Leu 
-25 -20 
Ser Phe Tyr Leu Ser Arg Val Asn 
-5 

Leu Pro Lys Pro Ala Arg His Leu 
10 15 



Ser His Trp Cys Gly Glu Ser Phe 
-30 

Gly Thr Leu Glu Val Phe Trp Met 
-15 -10 
Cys Thr Thr Lys Glu Thr Ser Pro 

1 5 
Gly Leu Glu Pro 



<210> 870 
<211> 60 
<212> PRT 

<213> Homo sapiens 
<220> 

<221> SIGNAL 
<222> -45.. -1 



<400> 870 

Met His Ser Leu Phe He Ala Ser 
-45 -40 
Phe Ser Phe Arg Phe Asn Trp Phe 
-25 



Leu Lys Val Leu Phe Tyr Tyr Ser 
-35 -30 
Asp Cys Leu Leu His Asn Leu Gly 
-20 -15 
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Glu Asn Phe Leu 


Ser 


Leu 


Leu Ser 


Lys 


Ser 


Cys Ser Ala Asp 


Pro Ser 


-10 








-5 




1 




Gly Ser Thr Phe 


Met 


Arg 


Asp He 


ulu 


1 nr 


ASn lays 




5 






10 






15 




<210> 871 
















<211> 133 
















<212> PRT 
















<213> Homo sapiens 














<220> 
















<221> SIGNAL 
















<222> -13. .-1 
















<400> 871 
















Met Ala His Cys 


Val 


Thr 


Leu Val 


Gin 


Leu 


Ser He Ser Cys 


Asp His 


-10 








-5 




1 




Leu lie Asp Lys 


Asp 


He 


Gly Ser 


Lys 


Ser 


Asp Pro Leu Cys 


Val Leu 


5 






10 






15 




Leu Gin Asp Val 


Gly Gly 


Gly Ser Trp Ala 


Glu Leu Gly Arg 


Thr Glu 


20 




25 








30 


35 


Arg Val Arg Asn 


Cys 


Ser 


Ser Pro 


Glu 


Phe 


Ser Lys Thr Leu 


Gin Leu 




40 








45 




50 


Glu Tyr Arg Phe 


Glu 


Thr 


Val Gin 


Lys 


Leu 


Arg Phe Gly He 


Tyr Asp 


55 








60 




65 




lie Asp Asn Lys 


Thr 


Pro 


Glu Leu Arg Asp 


Asp Asp Phe Leu Gly Gly 


70 






75 






80 




Ala Glu Cys Ser 


Leu 


Gly 


Gin He 


Val 


Ser 


Ser Gin Val Leu 


Thr Leu 


85 






90 






95 




Pro Leu Met Leu 


Lys 


Pro 


Gly Lys 


Pro 


Ala 


Gly Arg Gly Thr 


Ser Arg 


100 




105 








110 


115 


Ser Gin Leu Arg 


Asn 
















120 














<210> 872 
















<211> 68 
















<212> PRT 
















<213> Homo sapiens 














<220> 
















<221> SIGNAL 
















<222> -47.. -1 
















<400> 872 
















Met Val Lys Ser 


Ser 


Leu 


Gin Arg 


He 


Leu 


Asn Ser His Cys 


Phe Ala 


-45 






-40 






-35 




Arg Glu Lys Glu 


Gly Asp 


Lys Pro 


Ser 


Ala 


Thr He His Ala 


Ser Arg 


-30 






-25 






-20 




Thr Met Pro Leu 


Leu 


Ser 


Leu His 


Ser Arg 


Gly Gly Ser Ser 


Ser Glu 


-15 




-10 








-5 


1 


Ser Ser Arg Val 


Ser 


Leu 


His Cys 


Cys 


Ser 


Asn Pro Gly Pro 


Gly Pro 


5 








10 




15 




Arg Trp Cys Ser 

















20 



<210> 873 
<211> 67 
<212> PRT 

<213> Homo sapiens 



<220> 

<221> SIGNAL 
<222> -31. ,-1 
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<400> 873 

Met Ser Arg Leu Gly Ser Ala Val Pro Asp Cys His lie Asn Ala Pro 

-30 -25 -20 

Glu Pro Ala Ser Trp Ala Leu Gly Arg Trp Gly Ser Cys Arg Ala Leu 
-15 -10 -5 1 

Asp Lys Gly Glu Gly Gly Arg Glu Gly Asp Pro Lys His Pro Ser Leu 

5 10 15 

Pro Gly Pro Arg Arg Gin Thr Cys Pro Met Arg Ser Thr Leu Met Asn 

20 25 30 

Cys Trp Ser 

35 

<210> 874 
<211> 81 
<212> PRT 

<213> Homo sapiens 
<220> 

<221> SIGNAL 
<222> -19. .-1 



<400> 874 



Met 


Ala 


Ala 


Ala 


Val 


Gly Arg Leu Leu Arg Ala Ser Val Ala Arg 


His 










-15 


-10 








-5- 




Val 


Ser 


Ala 


He 


Pro 


Trp Gly He Ser Ala Thr 


Ala 


Ala 


Leu 


Arg 


Pro 








1 




5 




10 








Ala 


Ala 


Cys 


Gly 


Arg 


Thr Ser Leu Thr Asn Leu 


Leu 


Cys 


Ser 


Gly 


Ser 




15 








20 


25 










Ser 


Gin 


Ala 


Lys 


He 


He Ser Ser He Leu Trp 


He 


Ser 


Pro 


Leu 


Cys 


30 










35 40 










45 


Val 


Leu 


Gin 


Lys 


Leu 


Leu Leu Leu Val Thr Lys 


Leu 


Thr 


Asn 


Phe 


Thr 








50 


55 








60 





Met 



<210> 875 
<211> 53 
<212> PRT 

<213> Homo sapiens 
<220> 

<221> SIGNAL 
<222> -35.. -1 

<400> 875 

Met Glu Gly Arg Ser Arg Ser Arg Lys Ala Cys Leu Glu Val Ala Thr 
-35 -30 -25 -20 

Val Asn Gin Val Phe Val Gly Leu Val Glu Val Val Ala Val Gly Val 

-15 -10 -5 

Val Arg Ser Asp Gin He Lys Asn Met Tyr Leu Lys Tyr Ser Gin His 

15 10 
Asp Leu Leu Met Asp 
15 

<210> 876 

<211> 56 

<212> PRT 

<213> Homo sapiens 

<400> 876 

Met Gin Glu Met Ser Arg Pro He Asn Lys Lys Phe Leu Ser Leu Ala 

15 10 15 

Glu Asn Gly Trp Pro His Glu Ala Thr Thr Thr Gly Gly Lys Glu Gly 
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20 25 30 

Cys Ser Thr Pro Arg Leu lie Gly Lys Pro Asn Gin lie Trp Asp Leu 

35 40 45 

Tyr Ser Val Ala Glu Ala Gly Arg 
50 55 

<210> 877 

<211> 53 
<212> PRT 

<213> Homo sapiens 
<400> 877 

Met Lys Lys Pro Gin Ser Glu Asn Phe Gly Gly Gly Pro Arg Thr Trp 

15 10 15 

He Asp Leu Leu Val Val Phe Pro Lys Val His Val Leu Glu He Phe 

20 25 30 

Ser Leu Ser Tyr Tyr Ser Leu Lys Lys Lys Met Leu Thr Val Tyr Leu 

35 40 45 

Phe Ser Ser He Leu 
50 

<210> 878 
<211> 58 
<212> PRT 

<213> Homo sapiens 
<400> 878 

Met Ser Thr Lys Arg Arg Leu Glu Glu Glu Gin Glu Pro Leu Arg Lys 

1 5 10 15 

Gin Phe Leu Ser Glu Glu Asn Met Ala Thr His Phe Ser Gin Leu Ser 

20 25 30 

Leu His Asn Asp Gin Pro Leu Leu Gin Pro His His Asp Leu Leu Pro 

35 40 45 

Ser Pro Ala Pro Thr Gin Glu Pro Leu Leu 
50 55 



<210> 879 

<211> 61 

<212> PRT 

<213> Homo sapiens 



<400> 879 

Met Ala Ser Asp Leu Asp Phe Ser Pro Pro Glu Cys Pro Ser Pro Leu 

15 10 15 

Ser Trp Arg Thr Cys Tyr Gly Thr Asp Ser Ser Trp Glu Pro Ser Ser 

20 25 30 

Ser Ser Ser Val Cys Trp Pro Ser Ser Tyr Pro Phe Pro Ser Pro Thr 

35 40 45 

Arg Arg Arg Leu Asn Arg Leu Ser Pro Glu Val Leu Arg 
50 55 60 



<210> 880 

<211> 61 

<212> PRT 

<213> Homo sapiens 



<400> 880 

Met Leu Ser Leu Asp Phe Leu Asp 
1 5 
Gin Leu Tyr Tyr Gin Val Leu Asn 
20 

Leu Met He Trp Lys Gly Leu Met 
35 40 



Asp Val Arg Arg Met Asn Lys Arg 

10 15 
Phe Gly Met He Val Ser Ser Ala 
25 30 
Val He Thr Gly Ser Glu Ser Pro 
45 
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lie Val Val Val Ser Gly Lys Met Gly lie Ser 
50 55 

<210> 881 
<211> 113 

<212> PRT 

<213> Homo sapiens 



<400> 881 



Met 


He 


Val 


Ser 


Ser 


Ala Leu 


Met 


He 


Trp Lys 


Gly Leu Met Val 


He 


1 








5 








10 






15 




Thr 


Gly 


Ser 


Glu 


Ser 


Pro He 


Val 


Val 


Val 


Leu 


Ser Gly Ser 


Met 


Glu 








20 








25 






30 






Pro 


Ala 


Phe 


His 


Arg 


Gly Asp 


Leu 


Leu 


Phe 


Leu 


Thr Asn Arg 


Val 


Glu 






35 








40 








45 






Asp 


Pro 


He 


Arg 


Val 


Gly Glu 


He 


Val 


Val 


Phe 


Arg He Glu 


Gly Arg 




50 








55 










60 






Glu 


He 


Pro 


He 


Val His Arg Val 


Leu 


Lys 


He 


His Glu Lys 


Phe 


Val 


65 










70 








75 






80 


Pro 


Tyr 


He 


Gly 


He 


Val Thr 


He 


Leu 


Met 


Asn 


Asp Tyr Pro 


Lys 


Phe 










85 








90 






95 




Lys 


Tyr 


Ala 


Val 


Leu 


Phe Leu 


Leu Gly Leu 


Phe 


Val Leu Val 


His 


Arg 








100 








105 






110 







Glu 



<210> 882 

<211> 72 

<212> PRT 

<213> Homo sapiens 

<220> 

<221> UNSURE 
<222> 17 

<223> Xaa = Cys,Gly, Arg, Ser 

<220> 

<221> UNSURE 
<222> 61 

<223> Xaa = His, Gin 
<400> 882 



Met 


Gly Ser Ser Arg 


Asp 


Thr 


Ser Arg 


Ser 


Leu 


Gly Gly Cys 


Ser 


Pro 


1 






5 








10 










15 




Xaa 


Ser Ser 


Leu 


Asn 


Gly 


Phe 


Gly Phe 


Lys 


Asp 


Asp 


His 


Leu Arg 


Leu 






20 








25 










30 






Leu 


Gly Cys 


His 


Ser 


Ala 


Thr Asp His 


Leu 


Gin 


Leu 


Leu 


Ala 


Thr 


Leu 




35 










40 








45 








Gly 


Phe Leu 


His 


Phe 


Cys 


Leu 


Tyr Val 


Leu 


Leu 


Leu 


Xaa 


Arg 


Leu 


Ser 




50 








55 








60 










His 


Asp He 


Tyr 


Leu 


Ser 


Val 


Met 
















65 








70 





















<210> 883 

<211> 70 

<212> PRT 

<213> Homo sapiens 

<400> 883 

Met Trp Tyr Ala Lys 
1 5 
Gin Glu Arg Ala Val 
20 

Val Gin Leu Leu Asn 



PCT/IBOl/00914 



Ser Phe 
60 



Lys Gin Gly Gin Lys Asn Lys Glu Ala Glu Arg 

10 15 
Val His Met Asp Glu Arg Arg Glu Glu Gin He 

25 30 
Ser Val Gin Ala Lys Asn Asp Lys Glu Ser Glu 
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35 

Ala Gin lie Ser Thr Pro 
50 

Gin Pro Phe Arg Ala Cys 
65 70 

<210> 884 
<211> 53 
<212> PRT 

<213> Homo sapiens 
<400> 884 

Met Ser Gly Phe Glu Asn 
1 5 
Ser lie Asp Asp Gin Ser 
20 

Gly Pro Tyr Asn Ser Met 
35 

Leu Ser Leu Gin Thr 
50 

<210> 885 
<211> 56 
<212> PRT 

<213> Homo sapiens 
<400> 885 

Met lie Phe Cys Ser His 
1 5 
His Thr Pro Gly Val Cys 
20 

Arg Val Phe Thr Ser Val 
35 

Leu Phe His Leu Thr Val 
50 

<210> 886 
<211> 103 
<212> PRT. 
<213> Homo sapiens 



<400> 886 



Met 


Met 


Gly 


Ser 


Lys Met 


1 








5 


Lys 


Pro 


His 


Thr 


Pro Leu 








20 




Lys 


Pro 


Asn 


Val 


Ser Glu 






35 






Ser 


Ser 


Val 


He 


Ser Gin 




50 








Leu 


Met 


Tyr 


Gin 


Gly Pro 


65 








70 


Pro 


Gin 


Lys 


Tyr 


Arg Arg 










85 


He 


Gin 


Arg 


Gly 


Gly Pro 








100 





<210> 887 
<211> 124 
<212> PRT 

<213> Homo sapiens 



PCT/lBOl/00914 

443 





40 


45 




Asp 


Leu 


Ala Val Arg Ser Ala 


Pro Ser Gly 


55 




60 




Leu 


Asn 


Thr Asp Phe Tyr Gin 


Thr Ser Tyr 






10 


15 


Gin 


Gin 


Ser Tyr Asp Tyr Gly 


Gly Ser Gly 






25 


30 


Leu 


Ala 


Met Thr He Arg Ser 


Lys Ala Asp 




40 


45 




Lys 


Leu 


Ala Leu Leu Ser He 


Phe Pro Glu 






10 


15 


Lys 


Tyr 


Ser Leu Pro Leu Leu 


His Thr His 






25 


30 


Thr 


Phe 


Cys Ala Tyr Cys Ser 


Ala Pro Cys 




40 


45 




He 


He 






55 








Ala 


Ser 


Ala Ser Arg Val Val 


Gin Val Val 






10 


15 


He Arg 


Phe Pro Asp Arg Arg 


Asp Asn Pro 






25 


30 


Ala 


Leu 


Arg Ser Ala Gly Leu 


Pro Ser His 




40 


45 




His 


Ser 


Lys Gly Ser Lys Ser 


Pro Asp Leu 


55 




60 




Pro Asp 


Thr Ala Glu He He 


Lys Thr Leu 






75 


80 


Lys 


Leu 


Val Ser Gin Glu Glu 


Met Glu Phe 






90 


95 


Glu 









<400> 887 
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444 



Met 


Ala 


Arg 


Val 


He 


He Asp Asn Lys 


Asp Gly Pro 


Thr 


Gin 


Lys Tyr 


1 








5 




10 






15 


Leu 


Leu 


He 


Phe 


Gly 


Ala Phe Val Ser 


Val Tyr He 


Gin 


Glu 


Met Phe 








20 




25 






30 




Arg 


Phe 


Ala 


Tyr 


Tyr 


Lys Leu Leu Lys 


Lys Ala Ser 


Glu 


Gly 


Leu Lys 






35 






40 




45 






Ser 


He 


Asn 


Pro 


Gly 


Glu Thr Ala Pro 


Ser Met Arg 


Leu 


Leu 


Ala Tyr 




50 








55 


60 








Val 


Ser 


Gly 


Leu 


Gly 


Phe Gly He Met 


Ser Gly Val 


Phe 


Ser 


Phe Val 


65 










70 


75 






80 


Asn 


Thr 


Leu 


Ser 


Asp 


Ser Leu Gly Pro 


Gly Thr Val 


Gly 


He 


His Gly 










85 




90 






95 


Asp 


Ser 


Pro 


Gin 


Phe 


Phe Leu Leu Phe 


Ser Phe His 


Asp 


Ala 


Gly His 








100 




105 






110 




Tyr 


Leu 


Ala 


Ala 


Cys 


He Leu Gly His 


Cys He Phe 












115 






120 











<210> 888 

<211> 62 

<212> PRT 

<213> Homo sapiens 



<400> 888 



Met 


Ala 


Ser Ser 


Thr 


Val Pro 


Val 


Ser 


Ala Ala Gly 


Ser 


Ala 


Asn 


Glu 


1 






5 








10 






15 




Thr 


Pro 


Glu He 


Pro 


Asp Asn 


Val 


Gly 


Asp Trp Leu 


Arg 


Gly 


Val 


Tyr 






20 








25 






30 






Pro 


Leu 


Pro Leu 


He 


Gly Met 


Thr 


Ser 


Gly Glu Leu 


Asp 


Thr 


Lys 


Phe 






35 






40 






45 








Gly 


Thr 


Leu Cys 


Cys 


Gly Ser 


Leu 


Ala 


Gly Gin Glu 


Leu 


Glu 








50 






55 






60 











<210> 889 

<211> 65 

<212> PRT 

<213> Homo sapiens 



<400> 889 



Met Leu Arg His Ser 


Pro Ser Leu Trp 


Glu Leu Val 


Glu 


Glu 


His Val 


1 5 




10 






15 


Pro Leu Arg Glu Arg 


Arg Glu Val Lys 


Arg He Leu 


Gly 


Gly 


Gly Gly 


20 


25 






30 




Gly Gly Pro Glu Pro 


Gly Ala Ala Gly 


Gly Gly Gly 


Asp 


Val 


Thr Gly 


35 


40 




45 






Thr Ala Pro Arg Gly 


Ser He Leu Ser 


Ser Pro Gin 


Leu 


Pro 


Pro His 


50 


55 


60 









Leu 
65 

<210> 890 
<211> 140 
<212> PRT 
<213> Homo sapiens 

<400> 890 



Met Gly Gin Asn Arg 


Glu Met Leu 


Pro 


Phe Trp Met 


Asn 


Ser 


Thr Gly 


1 5 






10 






15 


Arg Arg Glu Gly Trp 


Gin Arg Gly 


Trp 


His Gly Tyr 


Asp 


Asn 


Glu Leu 


20 




25 






30 




Met Asp Met Arg Gly 


He Phe Leu 


Ala 


Phe Gly Pro 


Asp 


Phe 


Lys Ser 


35 


40 






45 






Asn Phe Arg Ala Ala 


Pro He Arg 


Ser 


Val Asp Val 


Tyr 


Asn 


Val Met 



50 55 60 



wo 02/083898 

Cys Asn Val Val Gly lie 
65 70 
Arg Val Met Cys Met Leu 
85 

Gly Pro Ala Thr Val Pro 
100 

Thr Asp His lie Ala Cys 

115 

Pro Pro Lys Pro Asp Asp 
130 

<210> 891 
<211> 62 
<212> PRT 

<213> Homo sapiens 
<400> 891 



Met Glu Met Ala 


Ser 


Ser 


Ala 


Gly Ser 


Trp Leu Ser Gly 


Cys 


Leu 


He 


1 


5 








10 




15 




Pro Leu Val Phe 


Leu Arg 


Leu 


Ser Val 


His Val Ser Gly 


His 


Ala 


Gly 


20 








25 




30 






Asp Ala Gly Lys 


Phe 


His 


Val 


Ala Leu 


Leu Gly Gly Thr 


Ala 


Glu 


Leu 


35 








40 


45 








Leu Cys Pro Leu 


Ser 


Leu 


Trp 


Pro Gly 


Thr Tyr Pro Arg 


Arg 






50 






55 




60 









<210> 892 
<211> 114 

<212> PRT 

<213> Homo sapiens 



PCT/IBOl/00914 

445 



Thr Pro Leu 


Pro Asn Asn Gly 


Ser 


Trp 


Ser 






75 






80 


Lys Gly Arg 


Ala 


Ala Leu Pro 


Arg 


Leu 


Ser 




90 






95 




Trp His Asp 


Ser 


Ser Leu Pro 


Ala 


Cys 


He 


105 






110 






Leu Arg Lys 


Lys 


His His Gin 


Gin 


Ser 


Gly 



120 125 



Phe His Phe Met Cys Glu 
135 140 



<400> 892 



Met 


Asp 


He 


Val 


Ala 


His 


Cys 


Glu Lys 


Ala Met Ala 


Glu 


Ser 


Val 


Lys 


1 








5 








10 






15 




Lys 


His 


Gin 


Phe 


Ser 


Thr 


Ser 


Glu Ser 


Tyr Leu Pro 


Ala 


Thr 


Leu 


He 








20 








25 






30 






Leu 


Asn 


Ser 


He 


Ser 


Phe 


Asn 


Thr Asp 


Gly Ser He 


Gin 


Tyr 


Lys 


Glu 






35 










40 




45 








Lys 


Leu 


Tyr 


Phe 


Ser 


Ala 


Ser 


Glu Ala 


Leu Asp Ala 


Tyr 


He 


Asp 


Asp 




50 










55 




60 










Phe 


His 


Leu 


Asn 


Tyr 


Glu 


Pro 


Pro Asp 


He Asp Thr Lys 


Val 


Asn 


Leu 


65 










70 






75 








80 


Asp Gin 


Ser 


Pro 


Leu 


Glu 


Phe 


Leu Ala 


Lys Ser Asn 


Ser 


Gly 


Val 


His 










85 








90 






95 




Gly 


He 


Ala 


Pro 


Gly Gin 


Ser Gly Met 


Glu Lys He 


He 


Gin 


Trp 


Tyr 








100 








105 


/■ 




110 







Ala Ala 



<210> 893 
<211> 55 
<212> PRT 

<213> Homo sapiens 

<220> 

<221> UNSURE 
<222> 27 

<223> Xaa = Asn, Thr 
<220> 

<221> UNSURE 
<222> 39 

<223> Xaa = Leu, Val 



wo 02/083898 
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446 

<220> 

<221> UNSURE 

<222> 34,37 

<223> Xaa ^ Lys,Asn 

<400> 893 

Met Arg Cys Arg Pro Ser Ser Val His Val Ala lie Leu Thr Glu Ala 

15 10 15 

He Pro Pro Lys Met He Ser Phe Ser Thr Xaa Ser Lys Arg He Phe 

20 25 30 

Lys Xaa Ser Pro Xaa Met Xaa Lys Ser Gin Val Cys Pro Leu He Gly 

35 40 45 

Glu He Ser Gly Ser Trp Met 
50 55 



<210> 894 
<211> 123 
<212> PRT 

<213> Homo sapiens 
<220> 

<221> UNSURE 
<222> 26 

<223> Xaa = His, Gin 



<400> 894 






Met 


Pro 


Lys 


Gly 


Gly Arg Lys Gly 


1 








5 


Tyr 


Thr 


Ser 


Pro 


Glu Glu He Asp 








20 




Arg 


Pro 


Gly 


Lys 


Lys Arg Ser Lys 






35 




40 


Asp 


Pro 


Lys 


Lys 


Glu Lys Lys Ser 




50 






55 


Glu 


Glu 


Asp 


Asp 


Tyr Gin Gin Lys 


65 








70 


Asp 


He 


Glu 


Asn 


Pro Asn Arg Val 










85 


Gin 


Leu 


Asp 


Leu 


Asp Gly Pro Arg 








100 




Arg 


Leu 


Arg 


Ser 


Arg Arg Gin Lys 






115 




120 



Gly 


His Lys Gly Arg 


Ala Arg Gin 




10 


15 


Ala 


Xaa Cys Arg Leu 


Arg Ser Arg 


25 




30 


Lys 


Gly Gly Asp Gly 


Ala Ala Gly 




45 




Leu 


Asp Ser Asp Glu 


Ser Glu Asp 




60 




Arg 


Lys Gly Val Glu 


Gly Leu He 




75 


80 


Ala 


Gin Thr Thr Lys 


Lys Val Thr 




90 


95 


Ser 


Phe Arg Gly Glu 


Asn Glu Lys 


105 




110 


Ser 


Val Thr 





<210> 895 
<211> 123 
<212> PRT 

<213> Homo sapiens 
<220> 

<221> UNSURE 
<222> 26 

<223> Xaa = His, Gin 



<400> 895 

Met Pro Lys Gly Gly Arg Lys Gly 
1 5 
Tyr Thr Ser Pro Glu Glu He Asp 
20 

Arg Pro Gly Lys Lys Arg Ser Lys 

35 40 
Asp Pro Lys Lys Glu Lys Lys Ser 

50 55 
Glu Glu Asp Asp Tyr Gin Gin Lys 



Gly His Lys Gly Arg Ala Arg Gin 

10 15 
Ala Xaa Cys Arg Leu Arg Ser Arg 
25 30 
Lys Gly Gly Asp Gly Ala Ala Gly 
45 

Leu Asp Ser Asp Glu Ser Glu Asp 
60 

Arg Lys Gly Val Glu Gly Leu He 
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PCT/IBOl/00914 



447 

65 70 75 80 



Asp lie 


Glu 


Asn Pro Asn 


Arg Val 


Ala Gin Thr Thr Lys 


Lys 


Val 


Thr 






85 


90 




95 




Gin Leu Asp Leu Asp Gly 


Pro Arg 


Ser Phe Arg Gly Glu 


Asn 


Glu 


Lys 






100 




105 


110 






Arg Leu Arg Ser Arg Arg Gin Lys 


Ser Val Thr 










115 




120 










<210> 896 














<211> 81 














<212> PRT 














<213> Homo ! 


sapiens 












<220> 
















<221> UNSURE 












<222> 78 














<223> Xaa 


Ser, Thr 












<400> 896 














Met Val Gly 


Leu Lys Lys 


Lys Ser 


Ala Asp Asp Val Lys 


Lys 


Val 


Phe 


1 




5 




10 




15 




His Met 


Leu 


Asp Lys Asp Lys Ser 


Gly Phe He Glu Glu 


Asp 


Glu 


Leu 






20 




25 


30 






Gly Phe 


He 


Leu Lys Gly 


Phe Ser 


Pro Asp Ala Arg Asp 


Leu 


Ser 


Ala 




35 




40 


45 








Lys Glu 


Thr 


Lys Met Leu 


Met Ala 


Ala Gly Asp Lys Asp 


Gly Asp Gly 


50 






55 


60 








Lys lie Gly 


Val Asp Glu 


Phe Ser 


Thr Leu Val Ala Glu 


Xaa 


Lys 


Lys 


65 




70 




75 






80 


His 
















<210> 897 














<211> 143 














<212> PRT 














<213> Homo i 


sapiens 












<400> 897 














Met Tyr 


Thr 


Thr Val Asp Ala Asn 


Gly Tyr Leu Lys Asn 


Gly Ser 


Ala 


1 




5 




10 




15 




Gly Gin 


Leu 


Ser Gin Ser 


Ser His 


Leu Ala Leu Gin Leu 


Pro 


Tyr 


Asn 






20 




25 


30 






Val Leu 


Gly Leu Gly Arg 


Ser Ala 


Asn Phe Leu Asp His 


Leu 


Tyr 


Val 




35 




40 


45 








Gly He 


Pro Arg Pro Ser 


Gly Glu 


Lys Ser He Arg Lys 


Gin 


Glu 


Trp 


50 






55 


60 








Thr Ala 


He 


He Pro Asn 


Ser Gin 


Leu He Val He Pro 


Tyr 


Pro 


His 


65 




70 




75 






80 


Asn Val 


Pro Arg Ser Trp 


Ser Ala 


Lys Leu Tyr Leu Thr 


Pro 


Ser 


Asn 






85 




90 




95 




He Val 


Leu 


Leu Thr Ala 


He Ala 


Leu He Gly Val Cys 


Val 


Phe 


He 






100 




105 


110 






Leu Ala 


He 


He Gly He 


Leu His 


Trp Gin Glu Lys Lys 


Ala 


Asp 


Asp 




115 




120 


125 








Arg Glu 


Lys Arg Gin Glu Ala His 


Arg Phe His Phe Asp 


Ala 


Met 




130 






135 


140 









<210> 898 
<211> 68 
<212> PRT 

<213> Homo sapiens 
<400> 898 

Met Met Asp Thr Met Asp Pro Phe Trp Phe Asn Ala Phe Lys Arg Thr 
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15 10 15 

Asn Thr lie Leu His His Leu Arg Met Ser Lys His Thr Asp Ala Ala 

20 25 30 

Glu Glu Val Leu Leu Glu Lys Lys Gly Cys Thr Gly Val lie Thr Leu 

35 40 45 

Asn Arg Pro Lys Phe Leu Asn Ala Leu Thr Leu Asn Met lie Arg Gin 

50 55 60 

lie Tyr His Ser 
65 



<210> 899 
<211> 74 
<212> PRT 

<213> Homo sapiens 



<400> 899 

Met Ala Ser Ser Thr Val Pro Val 
1 5 
Thr Pro Glu lie Pro Asp Asn Val 
20 

Arg Phe Ala Thr Asp Arg Asn Asp 

35 40 
Leu Gly Leu Phe Ala Ala Gly Val 

50 55 
lie Asp Leu Met Ala Pro Gin Pro 
65 70 



Ser Ala Ala Gly Ser Ala Asn Glu 

10 15 
Gly Asp Trp Leu Arg Gly Val Tyr 
25 30 
Phe Arg Arg Asn Leu lie Leu Asn 
45 

Trp Leu Ala Arg Asn Leu Ser Asp 
60 

Gly Val 



<210> 900 
<211> 81 
<212> PRT 

<213> Homo sapiens 
<220> 

<221> UNSURE 
<222> 19 

<223> Xaa = Arg, Ser 

<220> 

<221> UNSURE 
<222> 75 

<223> Xaa = Leu, Arg 

<220> 

<221> UNSURE 
<222> 74 

<223> Xaa = Lys, Asn 



<400> 900 

Met Gly Ala Thr Gly Pro Lys Leu 
1 5 
Gly Arg Xaa Gly Gly Ser Glu Ala 
20 

Arg Pro Arg Gly Arg Ala Val Pro 

35 40 
Ser Met Phe Tyr Asp Glu Asp Gly 

50 55 
Glu Thr He Val Thr Lys Asn Gly 
65 70 
Ser 



Gly Ala Cys Gly Leu Arg Leu Gly 

10 15 
Ala Gly Ala Glu Gin Ala Leu Val 
25 30 
Pro Phe Val Phe Thr Arg Arg Gly 
45 

Asp Leu Ala His Glu Phe Tyr Glu 
60 

Gin Xaa Xaa Ala Lys Leu Arg Ala 
75 80 



<210> 901 
<211> 68 
<212> PRT 



wo 02/083898 
<213> Homo sapiens 



449 



PCT/IBOl/00914 



<400> 901 

Met Trp Ala Phe Pro Trp Glu Glu 
1 5 
He Val Ala Val Glu Ala Lys Trp 
20 

Val Val Leu Gin Val He Leu Glu 

35 40 
Gin Pro Met Gin Lys Pro Val Tyr 

50 55 
Pro Pro Lys Lys 
65 



Pro Leu Tyr Val He Thr Phe Tyr 

10 15 
Thr Glu Ser Pro Arg Gin His Glu 
25 30 
Lys Leu Thr Tyr His Thr Thr His 
45 

Val Gin Ser Ala Glu Cys Leu Gly 
60 



<210> 902 
<211> 107 
<212> PRT 
<213> Homo sapiens 

<220> 

<221> UNSURE 
<222> 100 

<223> Xaa = * ,Trp 
<220> 

<221> UNSURE 
<222> 27 

<223> Xaa = * ,Tyr 
<220> 

<221> UNSURE 
<222> 23 

<223> Xaa = Arg, Ser 

<220>* 

<221> UNSURE 
<222> 19 

<223> Xaa = Glu, Gly 

<220> 

<221> UNSURE 
<222> 74,107 
<223> Xaa = Gly, Val 

<220> 

<221> UNSURE 
<222> 102,104 
<223> Xaa = His, Gin 



<220> 

<221> UNSURE 
<222> 28 

<223> Xaa = Lys, Thr 



<400> 902 

Met Pro Gin Leu Lys Leu Thr Ser 

1 5 
Tyr Thr Xaa Thr Arg Thr Xaa Ser 
20 

Pro Met Asp He Gin Met Ala Asn 

35 40 
Pro Gin Gly Asn Asp Cys Asp Leu 
50 55 



Val Trp Thr Thr Thr His Ser Tyr 

10 15 
Pro Gly Xaa Xaa Pro Gly Pro Pro 
25 30 
Asn Phe Thr Pro Pro Ser Ala Thr 
45 

Tyr Ala His His Ser Thr Ala Ser 
60 



wo 02/083898 PCT/IBO 1/009 14 

4S0 



He Val Met 


Pro Leu His 


Tyr 


Ser 


Leu Xaa Phe He 


He Gly 


Leu 


Val 


65 


70 






75 






80 


Gly Asn Leu 


Leu Ala Leu 


Val 


Val 


He Val Gin Asn Arg Lys 


Lys 


He 




85 






90 




95 




He Ser Pro 


Xaa Ala Xaa 


Pro 


Aaa 


Pro He Xaa 










100 






105 








<210> 903 
















<211> 72 
















<212> PRT 
















<213> Homo sapiens 














<400> 903 
















Met Leu Asp 


Ser He Trp 


Lys 


Pro Asp Leu Phe Phe 


Ala Asn 


Glu 


Lys 


1 


5 






10 




15 




Gly Ala Asn 


Phe His Glu 


Val 


Thr Thr Asp Asn Lys 


Leu Leu 


Arg 


He 




20 






25 


30 






Phe Lys Asn 


Gly Asn Val 


Leu 


Tvr 


Ser He Arg Leu Thr Leu 


Thr 


Leu 


35 






40 




45 






Ser Cys Pro 


Met Asp Leu 


Lys 


Asn 


Phe Pro Met Asp Val Gin 


Thr 


Cys 


50 




55 




60 








He Met Gin 


Leu Glu Ser 


Leu 


Val 










65 


70 














<210> 904 
















<211> 57 
















<212> PRT 
















<213> Homo sapiens 














<400> 904 
















Met Gly Lys Gin Arg Asn Leu 


Thr 


Leu Phe Phe Ser 


Phe Pro 


Phe 


Cys 


1 


5 






10 




15 




Glu Thr Val 


Phe Leu Asn 


His 


Leu 


Leu Asn Ser Val 


Phe Phe 


Pro 


Ser 




20 






25 


30 






Tyr Ser Phe 


Val Glu Pro 


Leu 


Leu 


Ser Tyr Leu Arg 


Phe Pro 


Gin 


Thr 


35 






40 




45 






His He Ser 


Leu Leu Phe 


Gin 


Leu 


Tyr 








50 




55 










<210> 905 
















<211> 102 
















<212> PRT 
















<213> Homo sapiens 














<400> 905 
















Met Glu Arg 


Leu Thr Leu 


Pro 


Leu 


Gly Gly Ala Ala 


Ala Val 


Asp 


Glu 


1 


5 






10 




15 




Tyr Leu Glu 


Tyr Arg Arg 


He 


Val 


Gly Glu Asp Asp Gly Gly 


Lys 


Leu 




20 






25 


30 






Phe Thr Pro 


Glu Glu Tyr 


Glu 


Glu 


Tyr Lys Arg Lys 


Val Leu 


Pro 


Leu 


35 






40 




45 






Arg Leu Gin 


Asn Arg Leu 


Phe 


Val 


Ser Trp Arg Ser 


Pro Thr 


Gly 


Met 


50 




55 




60 








Asp Cys Lys 


Leu Val Gly 


Pro 


Glu 


Thr Leu Cys Phe 


Cys Thr 


His 


Arg 


65 


70 






75 






80 


Tyr Lys Gin 


His Lys Thr Asp 


Leu 


Glu Ala He Pro 


Gin Gin 


Cys 


Pro 




85 






90 




95 




He Asp Pro 


Pro Ala Lys 
















100 















<210> 906 
<211> 104 
<212> PRT 



wo 02/083898 
<213> Homo sapiens 



45! 



PCT/IBOl/00914 



<220> 

<221> UNSURE 
<222> 64 

<223> Xaa = His, Gin 



<400> 906 










Met 


Ser 


Thr 


Leu 


Leu 


He 


Ser Ser 


1 








5 






Glu 


Arg 


Ser 


Thr 


Leu 


Gin Asp Ser 








20 








Leu 


Pro 


Leu 
35 


Asp 


Gin 


His 


Ser Arg 
40 


Ser 


Glu 
50 


He 


Leu 


Ser 


He 


Gin Asp 
55 


Ser 


Cys 


He 


Leu 


Pro 


He 


Ser Arg 


65 










70 




Lys 


Ser 


Gin 


Arg 


Asn 
85 


His 


His Arg 


Ser 


Trp 


Met 


Ser 
100 


Ser 


Leu 


Leu Thr 



Ser Asp Ala Leu Leu Glu Glu Leu 

10 15 
Asp Glu Tyr Ser Asn Pro Ala Pro 
25 30 
Lys Glu Thr Asn Leu Asp Glu Thr 
45 

Asn Thr Ser Pro Leu Pro Ala Xaa 
60 

Ser Ser Met Ser Thr Val Lys Pro 

75 80 
Leu Leu Lys Arg Gin Gin Leu Leu 
90 95 



<210> 907 
<211> 63 
<212> PRT 

<213> Homo sapiens 
<220> 

<221> UNSURE 
<222> 57 

<223> Xaa « Ala, Pro 
<220> 

<221> UNSURE 
<222> 60 

<223> Xaa = Gly,Val 
<220> 

<221> UNSURE 
<222> 39 

<223> Xaa = Phe,Val 



<220> 

<221> UNSURE 
<222> 56 

<223> Xaa = Pro, Ser 



<400> 907 

Met Asp Tyr Gly Tyr Val Gin Thr Thr Ser Thr Glu Met Leu Arg Asn 

15 10 15 

Phe He Gin Thr Glu Ala Val Val Ser Lys Pro Phe Ser Leu Phe Asp 

20 25 30 

Leu Ser Ser Val Gly Leu Xaa Gly Ala Glu Thr Gin Gin Ser Lys Val 

35 40 45 

Ala Pro Gin Gin Cys Ser Gin Xaa Xaa Arg Pro Xaa Gin Ser Leu 
50 55 60 



<210> 908 
<211> 65 
<212> PRT 

<213> Homo sapiens 
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<400> 908 

Met Leu Ser Ala Leu Pro Gly Trp Gly Pro Ala His Leu Gin Arg Pro 

15 10 15 

Leu Leu Gly Pro Ala Ser Cys Leu Gly lie Leu Arg Pro Ala Met Thr 

20 25 30 

Ala His Ser Phe Ala Leu Pro Val He He Phe Thr Thr .Phe Trp Gly 

35 40 45 

Leu Val Gly He Ala Gly Pro Trp Phe Val Pro Lys Asp Pro Thr Ala 
50 55 60 

Glu 
65 



<210> 909 
<211> 83 
<212> PRT 

<213> Homo sapiens 
<220> 

<221> UNSURE 
<222> 42 

<223> Xaa = Asp, Glu 



<400> 909 



Met 


Leu 


Val 


Glu 


Arg 


Lys 


Val Asn Gly 


Leu 


Val 


Pro Arg 


Leu 


Met 


Pro 


1 








5 










10 








15 




Val 


He 


Ala 


Ala 


His 


Trp 


Glu 


Ala 


Gin 


Val 


Gly Ala Ser 


Leu 


Glu 


Gly 








20 










25 








30 






Ser 


Ser 


He 


Leu 


Asp Arg 


Ala 


Val 


He 


Xaa 


His 


Asn Leu 


Leu 


Ser 


Ala 






35 










40 








45 








Ser 


Lys 


Leu 


Tyr 


Asn 


Asn 


He 


Thr 


Phe 


Glu 


Glu 


Leu Gly Ala 


Leu 


Leu 




50 










55 










60 








Glu 


He 


Pro 


Ala 


Ala 


Lys 


Ala 


Glu 


Lys 


He 


Ala 


Ser Gin 


Met 


He 


Thr 



65 70 75 80 

Glu Asp Val 

<210> 910 
<211> 94 
<212> PRT 

<213> Homo sapiens 



<400> 910 






Met Ala 


Ser Asn 


Glu 


Val He Ala 


1 




5 




Lys Ser 


Asn Leu 


Gly Trp Thr Pro 




20 






His Arg 


Gin Val 


Val 


Gin Asp Leu 




35 




40 


Val Leu 


Asn Asp 


Met 


Gly Asp Thr 


50 






55 


Gly Arg 


Lys Val 


Lys 


He He Leu 


65 






70 


Phe Gly 


Gly Val 


Val 


Thr He Val 



85 



Asp He Asn Cys Lys Gly Arg Ser 

10 15 
Leu His Leu Ala Cys Tyr Phe Gly 
25 30 
Leu Lys Ala Gly Ala Glu Val Asn 
45 

Pro Leu His Arg Ala Ala Phe Thr 
60 

Cys Ser Met Phe Val Ser Glu Val 

75 80 
Phe Ser Val He Thr He 
90 



<210> 911 
<211> 75 
<212> PRT 

<213> Homo sapiens 



<400> 911 

Met Pro Gly Pro Thr Pro Ser Gly Thr Asn Val Gly Ser Ser Gly Arg 

15 10 15 

Ser Pro Ser Lys Ala Val Ala Ala Arg Ala Gly Asp Pro Leu Ser Gly 
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20 25 
Arg Gly Lys Met Pro Ala Val Gly Gin Gly Val Gin 

35 40 
Arg Gin Ala Pro Gly Gly Cys Gly Asp Ser Thr Gin 

50 55 60 

Gly Ser Lys Leu Ala Leu Phe Gin Tyr Trp Leu 
65 70 75 

<210> 912 
<211> 160 
<212> PRT 
<213> Homo sapiens 

<220> 

<221> UNSURE 
<222> 107 

<223> Xaa = * ,Trp 
<220> 

<221> UNSURE 
<222> 97 

<223> Xaa = Cys, Trp 



<400> 912 



Met Arg 


Pro 


Leu 


Pro 


Pro 


Val 


Gly 


Asp Val Arg Leu 


Glu Leu 


Ser 


Pro 


1 






5 








10 






15 




Pro Pro 


Pro 


Leu 


Leu 


Pro 


Val 


Pro 


Val Val 


Ser Gly 


Ser Pro 


Val 


Gly 






20 










25 




30 






Ser Ser 


Gly Arg Leu 


Met 


Ala 


Ser 


Ser Ser 


Ser Leu 


Val Pro Asp Arg 




35 










40 






45 






Leu Arg 


Leu 


Pro 


Leu 


Cys 


Phe 


Leu 


Gly Val 


Phe Val 


Cys Tyr 


Phe 


Tyr 


50 










55 






60 








Tyr Gly 


He 


Leu 


Gin 


Glu Lys 


He 


Thr Arg Gly Lys 


Tyr Gly Glu Gly 


65 








70 








75 






80 


Ala Lys 


Gin 


Glu 


Thr 


Phe 


Thr 


Phe 


Ala Leu 


Thr Leu 


Val Phe 


He 


Gin 








85 








90 






95 




Xaa Val 


He 


Asn 


Ala 


Val 


Phe 


Ala 


Lys He 


Xaa Trp 


Asp Arg 


Thr 


Arg 






100 










105 




110 






Ser Trp 


Leu 


Tyr 


Ala 


Ala Cys 


Ser 


He Ser 


Tyr Leu 


Gly Ala 


Met 


Val 




115 










120 






125 






Ser Ser 


Asn 


Ser 


Ala 


Leu 


Gin 


Phe 


Val Asn 


Tyr Pro 


Thr Gin 


Val 


Leu 


130 










135 






140 








Gly Lys 


Ser Cys 


Lys 


Pro 


He 


Pro 


Val Met 


Leu Leu 


Gly Val 


Thr 


Ser 


145 








150 








155 






160 



<210> 913 
<211> 60 
<212> PRT 

<213> Homo sapiens 
<400> 913 

Met Arg Leu Ser Pro Phe His Leu 
1 5 
Lys Arg Glu Asp Leu Gin He Ser 
20 

Met Ala Val Asn He Gin Pro Lys 

35 40 
Arg Gin Val Arg Cys Leu Thr Phe 
50 55 
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30 

Ala Ala Gin Pro 
45 

Lys He His Leu 



Gin Leu Glu Phe 
10 

Trp Ser Phe He 
25 

His Trp Gly Arg 

Ser Arg Thr Ser 
60 



His Met Lys Glu 
15 

Ser Val Pro Glu 

30 

Thr Arg Trp Leu 
45 
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<211> 137 
<212> PRT 
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<400> 914 






Met 


Glu 


Tyr Leu 


He Gly He Gin 


1 






5 




Ser Asp 


Arg Val 


Ala Ala 


Ser Asn 






20 






Asp 


Lys 


Met Phe 


Lys Met 


Ser Glu 






35 




40 


Glu 


Ala 


Gly Asp Thr Val 


Gin Phe 




50 






55 


Gin 


Leu 


Tyr Lys 


Met Arg Asn Gly 


65 






70 




Ala 


Asn 


Phe Thr 


Arg Arg 


Asn Leu 








85 




Pro 


Tyr 


His Val 


Asn Leu 


Leu Leu 






100 






Pro 


Ala 


Leu Tyr 


Tyr Met 


Asp Tyr 






115 




120 


Leu 


Gin 


Pro Thr 


Ala Met 


Val Pro 




130 






135 



Gly Pro Asp Tyr Val Leu Val Ala 

10 15 
He Val Gin Met Lys Asp Asp His 
25 30 
Lys He Leu Leu Leu Cys Val Gly 
45 

Ala Glu Tyr He Gin Lys Asn Val 
60 

Tyr Glu Leu Ser Pro Thr Ala Ala 

75 80 
Ala Asp Cys Leu Arg Ser Arg Thr 

90 95 
Ala Gly Tyr Asp Glu His Glu Gly 
105 110 
Leu Ala Ala Leu Ala Lys Ala Leu 
125 

Ser 



<210> 915 
<211> 55 
<212> PRT 

<213> Homo sapiens 



<400> 915 

Met Ala Asp Arg Gly Gly Val Gly 
1 5 
Pro Ala Ser Val Pro Gly Leu Asn 
20 

Leu Arg Pro Gly Trp Arg Gly Leu 

35 40 
Gly Trp Gly Cys Phe Thr Pro 
50 55 



Glu Ala Ala Ala Val Gly Ala Ser 

10 15 
Pro Thr Leu Gly Trp Arg Glu Arg 
25 30 
Gly Pro Arg Cys Gly Ser Trp Arg 
45 



<210> 916 
<211> 132 
<212> PRT 

<213> Homo sapiens 



<400> 916 



Met 


He 


Ser 


He 


Thr Glu Trp 


Gin Lys 


He Gly Val 


Gly He 


Thr 


Gly 


1 








5 




10 




15 




Phe 


Gly 


He 


Phe 


Phe He Leu 


Phe Gly Thr Leu Leu 


Tyr Phe 


Asp 


Ser 








20 




25 




30 






Val 


Leu 


Leu 


Ala 


Phe Gly Asn Leu Leu 


Phe Leu Thr 


Gly Leu 


Ser 


Leu 






35 






40 




45 






He 


He 
50 


Gly 


Leu 


Arg Lys Thr 
55 


Phe Trp 


Phe Phe Phe 
60 


Gin Arg 


His 


Lys 


Leu 


Lys 


Gly 


Thr 


Ser Phe Leu 


Leu Gly Gly Val Val 


He Val 


Leu 


Leu 


65 








70 




75 






80 


Arg 


Trp 


Pro 


Leu 


Leu Gly Met 

85 


Phe Leu 


Glu Thr Tyr 

90 


Gly Phe 


Phe 

95 


Ser 


Leu 


Phe 


Lys 


Gly 


Phe Phe Pro 


Val Ala 


Phe Gly Phe 


Leu Gly Asn 


Val 








100 




105 




110 






Cys 


Asn 


He 


Pro 


Phe Leu Gly Ala Leu 


Phe Arg Arg 


Leu Gin Gly 


Thr 






115 






120 




125 






Ser 


Ser 
130 


Met 


Val 
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<211> 143 
<212> PRT 

<213> Homo sapiens 
<400> 917 



Met 


Leu 


Gin Ala 


Pro 


Arg Gly Arg Ala Arg Ala Ala 


Leu Arg Arg Leu 


1 






5 


10 


15 




Ala 


Val 


Ala Thr 


Arg 


Thr Met Ala Gly Ala Pro Thr 


Val Ser Leu 


Pro 






20 




25 


30 




Glu 


Leu 


Arg Ser 


Leu 


Leu Ala Ser Gly Arg Ala Arg 


Leu Phe Asp Val 






35 




40 


45 




Arg 


Ser 


Arg Glu Glu 


Ala Ala Ala Gly Thr He Pro 


Gly Ala Leu 


Asn 




50 






55 60 






lie 


Pro 


Val Ser 


Glu 


Leu Glu Ser Ala Leu Gin Met 


Glu Pro Ala 


Ala 


65 








70 75 




80 


Phe 


Gin 


Ala Leu 


Tyr 


Ser Ala Glu Lys Pro Lys Leu 


Glu Asp Glu 


His 








85 


90 


95 




Leu 


Val 


Phe Phe 


Cys 


Gin Met Gly Lys Arg Gly Leu 


Gin Ala Arg 


Ser 






100 




105 


110 




Trp 


Pro 


Gly Val 


Leu 


Asp Thr Leu Gly Leu Ala Thr 


Thr Leu Glu 


Pro 






115 




120 


125 




He 


Glu 


Asn Gly Trp 


Arg Lys Arg Val Arg Gin Glu 


Ala Ala Tyr 






130 






135 140 







<210> 918 
<211> 56 
<212> PRT 

<213> Homo sapiens 



<400> 918 

Met Lys Met Ser Arg Asn Ser Ser 
1 5 
Gly Ala Ser Trp Glu Ala Lys His 
20 

Ser Ser Leu Gly Phe Arg Pro Leu 

35 40 
Leu Lys Glu Ala Pro Asp Ser Met 
50 55 



Leu Gly Gly Ala Thr His Cys Thr 

10 15 

Gly Pro Ala Ser Gly Ala Ser Pro 
25 30 

Ser Pro Ser Leu Pro Ala Leu Gly 
45 
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